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Preface 


This  is  the  third  edition  of  this  text  on  survival  analysis, 
originally  published  in  1996.  As  in  the  first  and  second 
editions,  each  chapter  contains  a  presentation  of  its  topic 
in  “lecture-book"  format  together  with  objectives,  an  out¬ 
line,  key  formulae,  practice  exercises,  and  a  test.  The  “lec¬ 
ture-book”  format  has  a  sequence  of  illustrations  and 
formulae  in  the  left  column  of  each  page  and  a  script  in 
the  right  column.  This  format  allows  you  to  read  the  script 
in  conjunction  with  the  illustrations  and  formulae  that 
highlight  the  main  points,  formulae,  or  examples  being 
presented. 

This  third  edition  has  expanded  the  second  edition  by 
adding  one  new  chapter,  additional  sections  and  clarifica¬ 
tions  to  several  chapters,  and  a  revised  computer  appendix. 

The  new  chapter  is  Chapter  10,  "Design  Issues  for 
Randomized  Trials,"  which  considers  how  to  compute 
sample  size  when  designing  a  randomized  trial  involving 
time-to-event  data. 

We  have  expanded  Chapter  1  to  clarify  the  distinction 
between  random,  independent,  and  noninformative  cen¬ 
soring  assumptions  often  made  about  survival  data.  We 
also  added  a  section  in  Chapter  1  that  introduces  the 
Counting  Process  data  layout  that  is  discussed  in  later 
chapters  (3,  6,  and  8). 

We  added  sections  in  Chapter  2  to  describe  how  to  obtain 
confidence  intervals  for  the  Kaplan-Meier  (KM)  curve  and 
the  median  survival  time  obtained  from  a  KM  curve. 

We  have  expanded  Chapter  3  on  the  Cox  Proportional 
Hazards  (PH)  Model  by  describing  the  use  of  age  as  the 
time  scale  instead  of  time-on-follow-up  as  the  outcome 
variable.  We  also  added  a  section  that  clarifies  how  to 
obtain  confidence  intervals  for  PH  models  that  contain 
product  terms  that  reflect  effect  modification  of  exposure 
variables  of  interest. 


viii  Preface 


Suggestions 
for  Use 


We  have  added  sections  that  describe  the  derivation  of  the 
(partial)  likelihood  functions  for  the  stratified  Cox  (SC)  model 
in  Chapter  5  and  the  extended  Cox  model  in  Chapter  6. 

We  have  expanded  Chapter  9  on  competing  risks  to 
describe  the  Fine  and  Gray  model  for  a  subdistribution 
hazard  that  allows  for  a  multivariable  analysis  involving  a 
cumulative  incidence  curve  (CIC).  We  also  added  a  numer¬ 
ical  example  to  illustrate  the  calculation  of  a  conditional 
probability  curve  (CPC)  defined  from  a  CIC. 

The  Computer  Appendix  in  the  second  edition  of  this  text 
provided  step-by-step  instructions  for  using  the  computer 
packages  STATA,  SAS,  and  SPSS  to  carry  out  the  survival 
analyses  presented  in  the  main  text.  We  expanded  this 
Appendix  to  include  the  free  internet-based  computer  soft¬ 
ware  package  call  R.  We  have  also  updated  our  description 
of  STATA  (version  10.0),  SAS  (version  9.2),  and  SPSS 
(version  PASW  18).  The  application  of  these  computer 
packages  to  survival  data  is  described  in  separate  self- 
contained  sections  of  the  Computer  Appendix,  with  the 
analysis  of  the  same  datasets  illustrated  in  each  section. 

In  addition  to  the  above  new  material,  the  original  nine 
chapters  have  been  modified  slightly  to  correct  for  errata 
in  the  second  edition  and  to  add  or  modify  exercises 
provided  at  the  end  of  some  chapters. 

The  authors’  Web  site  for  this  textbook  has  the  following 
Web-link:  http://www.sph.emory.edu/dklein/surv3.htm. 

This  Web  site  includes  information  on  how  to  order  this 
second  edition  from  the  publisher  and  a  freely  download¬ 
able  zip-file  containing  data-files  for  examples  used  in  the 
textbook. 


This  text  was  originally  intended  for  self-study,  but  in  the 
15  years  since  the  first  edition  was  published,  it  has  also 
been  effectively  used  as  a  text  in  a  standard  lecture-type 
classroom  format.  The  text  may  also  be  used  to  supplement 
material  covered  in  a  course  or  to  review  previously 
learned  material  in  a  self-instructional  course  or  self- 
planned  learning  activity.  A  more  individualized  learning 
program  may  be  particularly  suitable  to  a  working  profes¬ 
sional  who  does  not  have  the  time  to  participate  in  a  regu¬ 
larly  scheduled  course. 
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Recommended 

Preparation 


In  working  with  any  chapter,  the  learner  is  encouraged 
first  to  read  the  abbreviated  outline  and  the  objectives 
and  then  work  through  the  presentation.  The  reader  is 
then  encouraged  to  read  the  detailed  outline  for  a  summary 
of  the  presentation,  work  through  the  practice  exercises, 
and,  finally,  complete  the  test  to  check  what  has  been 
learned. 


The  ideal  preparation  for  this  text  on  survival  analysis  is  a 
course  on  quantitative  methods  in  epidemiology  and  a 
course  in  applied  multiple  regression.  Also,  knowledge  of 
logistic  regression,  modeling  strategies,  and  maximum- 
likelihood  techniques  is  crucial  for  the  material  on  the 
Cox  and  parametric  models  described  in  Chapters  3-9. 

Recommended  references  on  these  subjects,  with  sug¬ 
gested  chapter  readings  are: 

Kleinbaum  D,  Kupper  L,  Nizam  A,  and  Muller  K,  Applied 
Regression  Analysis  and  Other  Multivariable  Methods, 
Fourth  Edition,  Cengage  Publishers,  2007,  Chapters  1-16, 
22-23. 


Kleinbaum  D,  Kupper  L  and  Morgenstern  H,  Epidemio¬ 
logic  Research:  Principles  and  Quantitative  Methods,  John 
Wiley  and  Sons,  Publishers,  New  York,  1982,  Chapters 
20-24. 


Kleinbaum  D  and  Klein  M,  Logistic  Regression:  A  Self- 
Learning  Text,  Third  Edition,  Springer  Publishers, 
New  York,  2010,  Chapters  4-7,  11. 

Kleinbaum  D,  ActivEpi-A  CD  Rom  Electronic  Textbook  on 
Fundamentals  of  Epidemiology,  Springer  Publishers, 
New  York,  2002,  Chapters  13-15. 

A  first  course  on  the  principles  of  epidemiologic  research 
would  be  helpful,  since  all  chapters  in  this  text  are  written 
from  the  perspective  of  epidemiologic  research.  In  parti¬ 
cular,  the  reader  should  be  familiar  with  the  basic  charac¬ 
teristics  of  epidemiologic  study  designs,  and  should  have 
some  idea  of  the  frequently  encountered  problem  of 
controlling  for  confounding  and  assessing  interaction/ 
effect  modification.  The  above  reference,  ActivEpi,  pro¬ 
vides  a  convenient  and  hopefully  enjoyable  way  to  review 
epidemiology. 
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Introduction 


Abbreviated 

Outline 


This  introduction  to  survival  analysis  gives  a  descriptive 
overview  of  the  data  analytic  approach  called  survival 
analysis.  This  approach  includes  the  type  of  problem 
addressed  by  survival  analysis,  the  outcome  variable  con¬ 
sidered,  the  need  to  take  into  account  “censored  data,” 
what  a  survival  function  and  a  hazard  function  represent, 
basic  data  layouts  for  a  survival  analysis,  the  goals  of  sur¬ 
vival  analysis,  and  some  examples  of  survival  analysis. 

Because  this  chapter  is  primarily  descriptive  in  content,  no 
prerequisite  mathematical,  statistical,  or  epidemiologic 
concepts  are  absolutely  necessary.  A  first  course  on  the 
principles  of  epidemiologic  research  would  be  helpful.  It 
would  also  be  helpful  if  the  reader  has  had  some  experi¬ 
ence  reading  mathematical  notation  and  formulae. 

The  outline  below  gives  the  user  a  preview  of  the  material 
to  be  covered  by  the  presentation.  A  detailed  outline  for 
review  purposes  follows  the  presentation. 

I.  What  is  survival  analysis?  (pages  4-5) 

II.  Censored  data  (pages  5-8) 

III.  Terminology  and  notation  (pages  9-15) 

IV.  Goals  of  survival  analysis  (page  16) 

V.  Basic  data  layout  for  computer  (pages  16-23) 

VI.  Basic  data  layout  for  understanding  analysis 
(pages  23-28) 

VII.  Descriptive  measures  of  survival  experience 

(pages  28-30) 

VIII.  Example:  Extended  remission  data  (pages  30-33) 
IX.  Multivariable  example  (pages  33-35) 

X.  Math  models  in  survival  analysis  (pages  35-37) 
XI.  Censoring  assumptions  (pages  37-43) 
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Objectives 


Upon  completing  the  chapter,  the  learner  should  be  able  to: 

1 .  Recognize  or  describe  the  type  of  problem  addressed 
by  a  survival  analysis. 

2.  Define  what  is  meant  by  censored  data. 

3.  Define  or  recognize  right-censored  data. 

4.  Give  three  reasons  why  data  may  be  censored. 

5.  Define,  recognize,  or  interpret  a  survivor  function. 

6.  Define,  recognize,  or  interpret  a  hazard  function. 

7.  Describe  the  relationship  between  a  survivor  function 
and  a  hazard  function. 

8.  State  three  goals  of  a  survival  analysis. 

9.  Identify  or  recognize  the  basic  data  layout  for  the 
computer;  in  particular,  put  a  given  set  of  survival 
data  into  this  layout. 

10.  Identify  or  recognize  the  basic  data  layout,  or 
components  thereof,  for  understanding  modeling 
theory;  in  particular,  put  a  given  set  of  survival  data 
into  this  layout. 

1 1 .  Interpret  or  compare  examples  of  survivor  curves  or 
hazard  functions. 

12.  Given  a  problem  situation,  state  the  goal  of  a  survival 
analysis  in  terms  of  describing  how  explanatory 
variables  relate  to  survival  time. 

13.  Compute  or  interpret  average  survival  and/or  average 
hazard  measures  from  a  set  of  survival  data. 

14.  Define  or  interpret  the  hazard  ratio  defined  from 
comparing  two  groups  of  survival  data. 
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Presentation 


This  presentation  gives  a  general  introduction 
to  survival  analysis,  a  popular  data  analysis 
approach  for  certain  kinds  of  epidemiologic 
and  other  data.  Here  we  focus  on  the  problem 
addressed  by  survival  analysis,  the  goals  of  a 
survival  analysis,  key  notation  and  terminol¬ 
ogy,  the  basic  data  layout,  and  some  examples. 


I.  What  Is  Survival 
Analysis? 


Outcome  variable:  Time  until  an 
event  occurs 


We  begin  by  describing  the  type  of  analytic 
problem  addressed  by  survival  analysis.  Gener¬ 
ally,  survival  analysis  is  a  collection  of  statisti¬ 
cal  procedures  for  data  analysis  for  which  the 
outcome  variable  of  interest  is  time  until  an 
event  occurs. 


start  follow-up  TIME  Event  By  time,  we  mean  years,  months,  weeks,  or 

days  from  the  beginning  of  follow-up  of  an 
individual  until  an  event  occurs;  alternatively, 
time  can  refer  to  the  age  of  an  individual  when 
an  event  occurs. 


Event:  death 

disease 

relapse 

recovery 

Assume  1  event 

_ 1 

Recurrent  event 

>  1  event 

or 

Competing  risk 

By  event,  we  mean  death,  disease  incidence, 
relapse  from  remission,  recovery  (e.g.,  return  to 
work)  or  any  designated  experience  of  interest 
that  may  happen  to  an  individual. 

Although  more  than  one  event  may  be  consid¬ 
ered  in  the  same  analysis,  we  will  assume  that 
only  one  event  is  of  designated  interest.  When 
more  than  one  event  is  considered  (e.g.,  death 
from  any  of  several  causes),  the  statistical  prob¬ 
lem  can  be  characterized  as  either  a  recurrent 
event  or  a  competing  risk  problem,  which  are 
discussed  in  Chaps.  8  and  9,  respectively. 


Time  =  survival  time  In  a  survival  analysis,  we  usually  refer  to  the 

time  variable  as  survival  time,  because  it  gives 
Event  =  failure  the  time  that  an  individual  has  “survived”  over 

some  follow-up  period.  We  also  typically  refer 
to  the  event  as  a  failure,  because  the  event  of 
interest  usually  is  death,  disease  incidence,  or 
some  other  negative  individual  experience. 
However,  survival  time  may  be  “time  to  return 
to  work  after  an  elective  surgical  procedure,”  in 
which  case  failure  is  a  positive  event. 
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EXAMPLE 


1 .  Leukemia  patients/time  in  remission 
(weeks) 

2.  Disease-free  cohort/time  until  heart 
disease  (years) 

3.  Elderly  (60+)  population/time  until 
death  (years) 

4.  Parolees  (recidivism  study)/time 
until  rearrest  (weeks) 

5.  Heart  transplants/time  until  death 
(months) 


Five  examples  of  survival  analysis  problems 
are  briefly  mentioned  here.  The  first  is  a  study 
that  follows  leukemia  patients  in  remission 
over  several  weeks  to  see  how  long  they  stay 
in  remission.  The  second  example  follows  a 
disease-free  cohort  of  individuals  over  several 
years  to  see  who  develops  heart  disease.  A  third 
example  considers  a  13 -year  follow-up  of  an 
elderly  population  (60+  years)  to  see  how  long 
subjects  remain  alive.  A  fourth  example  follows 
newly  released  parolees  for  several  weeks  to 
see  whether  they  get  rearrested.  This  type  of 
problem  is  called  a  recidivism  study.  The  fifth 
example  traces  how  long  patients  survive  after 
receiving  a  heart  transplant. 


All  of  the  above  examples  are  survival  analysis 
problems  because  the  outcome  variable  is  time 
until  an  event  occurs.  In  the  first  example, 
involving  leukemia  patients,  the  event  of  inter¬ 
est  (i.e.,  failure)  is  “going  out  of  remission,” 
and  the  outcome  is  “time  in  weeks  until  a 
person  goes  out  of  remission.”  In  the  second 
example,  the  event  is  “developing  heart  dis¬ 
ease,”  and  the  outcome  is  “time  in  years  until 
a  person  develops  heart  disease.”  In  the  third 
example,  the  event  is  “death"  and  the  outcome 
is  “time  in  years  until  death.”  Example  four, 
a  sociological  rather  than  a  medical  study,  con¬ 
siders  the  event  of  recidivism  (i.e.,  getting  rear¬ 
rested),  and  the  outcome  is  “time  in  weeks  until 
rearrest.”  Finally,  the  fifth  example  considers 
the  event  “death,”  with  the  outcome  being 
"time  until  death  (in  months  from  receiving 
a  transplant).” 


We  will  return  to  some  of  these  examples  later 
in  this  presentation  and  in  later  presentations. 


II.  Censored  Data  Most  survival  analyses  must  consider  a  key 

analytical  problem  called  censoring.  In  essence, 
censoring  occurs  when  we  have  some  informa- 
Censoring:  don’t  know  survival  tion  about  individual  survival  time,  but  we  don’t 
time  exactly  know  the  survival  time  exactly. 
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As  a  simple  example  of  censoring,  consider 
leukemia  patients  followed  until  they  go  out 
of  remission,  shown  here  as  X.  If  for  a  given 
patient,  the  study  ends  while  the  patient  is  still 
in  remission  (i.e.,  doesn’t  get  the  event),  then 
that  patient’s  survival  time  is  considered  cen¬ 
sored.  We  know  that,  for  this  person,  the  sur¬ 
vival  time  is  at  least  as  long  as  the  period  that 
the  person  has  been  followed,  but  if  the  person 
goes  out  of  remission  after  the  study  ends, 
we  do  not  know  the  complete  survival  time. 

There  are  generally  three  reasons  why  censor¬ 
ing  may  occur: 

(1)  a  person  does  not  experience  the  event 
before  the  study  ends; 

(2)  a  person  is  lost  to  follow-up  during  the 
study  period; 

(3)  a  person  withdraws  from  the  study 

because  of  death  (if  death  is  not  the  event 
of  interest)  or  some  other  reason  (e.g., 
adverse  drug  reaction  or  other  competing 
risk) 

These  situations  are  graphically  illustrated 
here.  The  graph  describes  the  experience  of 
several  persons  followed  over  time.  An  X 
denotes  a  person  who  got  the  event. 


Person  A,  for  example,  is  followed  from  the 
start  of  the  study  until  getting  the  event  at 
week  5;  his  survival  time  is  5  weeks  and  is  not 
censored. 


Person  B  also  is  observed  from  the  start  of  the 
study  but  is  followed  to  the  end  of  the  12-week 
study  period  without  getting  the  event;  the  sur¬ 
vival  time  here  is  censored  because  we  can  say 
only  that  it  is  at  least  12  weeks. 

X  =>  Event  occurs  Person  C  enters  the  study  between  the  second 

and  3rd  week  and  is  followed  until  he  with¬ 
draws  from  the  study  at  6  weeks;  this  person’s 
survival  time  is  censored  after  3.5  weeks. 

Person  D  enters  at  week  4  and  is  followed  for 
the  remainder  of  the  study  without  getting  the 
event;  this  person’s  censored  time  is  8  weeks. 


Why  censor? 

1 .  study  ends  -  no  event 

2.  lost  to  follow-up 

3.  withdraws 


EXAMPLE  ( 

Weeks - ► 

2  4  6  8  10  12 

A 

1  1  1  1  1 

T=  5 

1  X 

R 

T=  12 

C 

Withdrawn 

D 

T  =  8 

Study  end 

E 

T  =  6  Lost 

T- 3.5  „ 

F 

- X 

EXAMPLE  1 

Leukei 

1 

mia  patients  in  remission: 

> 

s  *  X 

Study 

start 

Study 

end 
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Person  E  enters  the  study  at  week  3  and 
is  followed  until  week  9,  when  he  is  lost  to 
follow-up;  his  censored  time  is  6  weeks. 

Person  F  enters  at  week  8  and  is  followed  until 
getting  the  event  at  week  11.5.  As  with  person 
A,  there  is  no  censoring  here;  the  survival  time 
is  3.5  weeks. 


SUMMARY  In  summary,  of  the  six  persons  observed,  two 

Event:  A,  F  get  the  event  (persons  A  and  F)  and  four  are 

Censored:  B,  C,  D,  E  censored  (B,  C,  D,  and  E). 


Person 

Survival 

time 

Failed  (1); 
Censored  (0) 

A 

5 

1 

B 

12 

0 

3.5 

D 

8 

0 

E 

6 

0 

<d2in 

3.5 

Pp; 

Right-censored:  true  survival  time 
is  equal  to  or  greater  than  observed 
survival  time 


Weeks  - 
2  4  6 


8  10  12 


Study  end 
'▼> _ 


-  Withdrawn 


-Lost' 


4 


RIGHT 

CENSORED 


Study  end 


A  table  of  the  survival  time  data  for  the  six 
persons  in  the  graph  is  now  presented.  For 
each  person,  we  have  given  the  corresponding 
survival  time  up  to  the  event’s  occurrence  or  up 
to  censorship.  We  have  indicated  in  the  last 
column  whether  this  time  was  censored  or  not 
(with  1  denoting  failed  and  0  denoting  cen¬ 
sored).  For  example,  the  data  for  person  C  is  a 
survival  time  of  3 . 5  and  a  censorship  indicator  of 
0,  whereas  for  person  F  the  survival  time  is  3.5 
and  the  censorship  indicator  is  1 .  This  table  is  a 
simplified  illustration  of  the  type  of  data  to  be 
analyzed  in  a  survival  analysis. 

Notice  in  our  example  that  for  each  of  the  four 
persons  censored,  we  know  that  the  person’s 
true  survival  time  becomes  incomplete  at  the 
right  side  of  the  follow-up  period,  occurring 
when  the  study  ends  or  when  the  person  is 
lost  to  follow-up  or  is  withdrawn.  We  generally 
refer  to  this  kind  of  data  as  right-censored. 
For  these  data,  the  complete  survival  time 
interval,  which  we  don’t  really  know,  has  been 
cut  off  (i.e.,  censored)  at  the  right  side  of  the 
observed  survival  time  interval.  Although  data 
can  also  be  left-censored,  most  survival  data  is 
right-censored. 


F 
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Left-censored:  true  survival  time 
is  less  than  or  equal  to  the  observed 
survival  time 


h 

0 


HIV  exposure 


t 


HIV  + 
test 

— I - Time 

t 


Event  occurs  between  0  and  t 
but 

do  not  know  the  exact  time. 


Interval-censored:  true  survival 
time  is  within  a  known  time  interval 

HIV  -  HIV  + 

test  test 

- 1 - 5 - 1 -  Time 

0  h  1  t2 

HIV  exposure 


Left  censoring  >  ti  =  0,  t2  =  upper  bound 
Right  censoring  =►  ti  =  lower  bound,  t2  =  oo 


Right-censored  due  to  competing  risk, 
e.g.,  death  from  another  cause 
(1 

t2  =  OO 

gives  upper  bound  for  true  survival  time 
assuming  that  competing  risk  had  not 
occurred. 


Left-censored:  data  can  occur  when  a  person’s 
true  survival  time  is  less  than  or  equal  to  that 
person’s  observed  survival  time.  For  example, 
if  we  are  following  persons  until  they  become 
HIV  positive,  we  may  record  a  failure  when  a 
subject  first  tests  positive  for  the  virus.  How¬ 
ever,  we  may  not  know  the  exact  time  of  first 
exposure  to  the  virus,  and  therefore  do  not 
know  exactly  when  the  failure  occurred.  Thus, 
the  survival  time  is  censored  on  the  left  side 
since  the  true  survival  time,  which  ends  at 
exposure,  is  shorter  than  the  follow-up  time, 
which  ends  when  the  subject’s  test  is  positive. 

In  other  words,  if  a  person  is  left-censored 
at  time  t,  we  know  they  had  an  event  between 
time  0  and  t,  but  we  do  not  know  the  exact  time 
of  event. 

Survival  analysis  data  can  also  be  interval- 
censored,  which  can  occur  if  a  subject’s  true 
(but  unobserved)  survival  time  is  within  a 
certain  known  specified  time  interval.  As  an 
example,  again  considering  HIV  surveillance, 
a  subject  may  have  had  two  HIV  tests,  where 
he/she  was  HIV  negative  at  the  time  (say,  ti)  of 
the  first  test  and  HIV  positive  at  the  time  (t2) 
of  the  second  test.  In  such  a  case,  the  subject’s 
true  survival  time  occurred  after  time  t|  and 
before  time  t2,  i.e.,  the  subject  is  interval- 
censored  in  the  time  interval  (ti,  t2). 

Interval-censoring  actually  incorporates  both 
right-censoring  and  left-censoring  as  special 
cases.  Left-censored  data  occur  whenever  the 
value  of  H  is  0  and  t2  is  a  known  upper  bound 
on  the  true  survival  time.  In  contrast,  right- 
censored  data  occurs  whenever  the  value  of  t2 
is  infinity,  and  ti  is  a  known  lower  bound  on 
the  true  survival  time. 

If  an  individual  is  right-censored  due  to  a  com¬ 
peting  event  (e.g.,  death  from  another  cause), 
then  in  this  context,  we  consider  what  the  true 
survival  time  would  have  been  if  the  competing 
event  had  not  occurred.  In  other  words,  when  we 
state  that  the  value  of  the  upper  bound  for  the 
time  survival  time  is  infinity  for  right-censored 
data,  we  are  considering  what  would  have 
occurred  in  the  absence  of  a  competing  risk. 
Competing  risks  are  hilly  discussed  in  Chapter  9. 
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III.  Terminology  and 
Notation 


T=  survival  time  (T>  0) 


random  variable 


We  are  now  ready  to  introduce  basic  mathe¬ 
matical  terminology  and  notation  for  survival 
analysis.  First,  we  denote  by  a  capital  T  the 
random  variable  for  a  person’s  survival  time. 
Since  T  denotes  time,  its  possible  values  include 
all  nonnegative  numbers;  that  is,  T  can  be  any 
number  equal  to  or  greater  than  zero. 


t  =  specific  value  for  T 


EXAMPLE 


Survives  >  5  years? 
T  >  t  =  5 


Next,  we  denote  by  a  small  letter  t  any  spe¬ 
cific  value  of  interest  for  the  random  variable 
capital  T.  For  example,  if  we  are  interested  in 
evaluating  whether  a  person  survives  for  more 
than  5  years  after  undergoing  cancer  therapy, 
small  t  equals  5;  we  then  ask  whether  capital  T 
exceeds  5. 


d  =  (0,  1)  random  variable 

f  1  if  failure 
|  0  censored 

•  study  ends 

•  lost  to  follow-up 

•  withdraws 


Finally,  we  denote  the  small  letter  d  to  define  a 
(0,1)  random  variable  indicating  either  failure 
or  censorship.  That  is,  d  =  1  for  failure  if  the 
event  occurs  during  the  study  period,  or  d  = 
0  if  the  survival  time  is  censored  by  the  end  of 
the  study  period.  Note  that  if  a  person  does  not 
fail,  that  is,  does  not  get  the  event  during  the 
study  period,  censorship  is  the  only  remaining 
possibility  for  that  person’s  survival  time.  That 
is,  d  =  0  if  and  only  if  one  of  the  following 
happens:  a  person  survives  until  the  study 
ends,  a  person  is  lost  to  follow-up,  or  a  person 
withdraws  during  the  study  period. 


Sit)  =  survivor  function  We  next  introduce  and  describe  two  quantitative 

h{t)  =  hazard  function  terms  considered  in  any  survival  analysis.  These 

are  the  survivor  function,  denoted  by  Sit),  and 
the  hazard  function,  denoted  by  hit). 


S{t )  =  PiT  >  t) 


t _ Sit) 

1  S(l)  =  PiT  >  1) 

2  S(2)  =  PiT  >  2) 

3  S(3)  =  PiT  >  3) 


The  survivor  function  Sit)  gives  the  probability 
that  a  person  survives  longer  than  some  speci¬ 
fied  time  t:  that  is,  Sit)  gives  the  probability 
that  the  random  variable  T  exceeds  the  speci¬ 
fied  time  t. 

The  survivor  function  is  fundamental  to  a 
survival  analysis,  because  obtaining  survival 
probabilities  for  different  values  of  t  provides 
crucial  summary  information  from  survival 
data. 
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Theoretical  S(t ): 


Theoretically,  as  t  ranges  from  0  up  to  infinity, 
the  survivor  function  can  be  graphed  as  a 
smooth  curve.  As  illustrated  by  the  graph, 
where  t  identifies  the  X-axis,  all  survivor  func¬ 
tions  have  the  following  characteristics: 

•  they  are  nonincreasing;  that  is,  they  head 
downward  as  t  increases; 

•  at  time  t  =  0,  S(t)  =  S(0)  =  1;  that  is,  at  the 
start  of  the  study,  since  no  one  has  gotten 
the  event  yet,  the  probability  of  surviving 
past  time  0  is  one; 

•  at  time  t  =  oo,  S(t)  =  S(oc)  =  0;  that  is, 
theoretically,  if  the  study  period  increased 
without  limit,  eventually  nobody  would 
survive,  so  the  survivor  curve  must 
eventually  fall  to  zero. 


Note  that  these  are  theoretical  properties  of 
survivor  curves. 


S (f)  in  practice: 


In  practice,  when  using  actual  data,  we  usually 
obtain  graphs  that  are  step  functions,  as  illu¬ 
strated  here,  rather  than  smooth  curves.  More¬ 
over,  because  the  study  period  is  never  infinite 
in  length  and  there  may  be  competing  risks  for 
failure,  it  is  possible  that  not  everyone  studied 
gets  the  event.  The  estimated  survivor  function, 
denoted  by  a  caret  over  the  S  in  the  graph,  thus 
may  not  go  all  the  way  down  to  zero  at  the  end 
of  the  study. 


hit)  =  lim 

x  7  At-»0 


P(t  <T  <  t  +  At\T  >  t) 
At 


The  hazard  function,  denoted  by  h(t),  is  given 
by  the  formula:  hit)  equals  the  limit,  as  At 
approaches  zero,  of  a  probability  statement 
about  survival,  divided  by  At,  where  At  denotes 
a  small  interval  of  time.  This  mathematical  for¬ 
mula  is  difficult  to  explain  in  practical  terms. 
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h{t)  =  instantaneous  potential 


Before  getting  into  the  specifics  of  the  formula, 
we  give  a  conceptual  interpretation.  The  haz¬ 
ard  function  hit)  gives  the  instantaneous 
potential  per  unit  time  for  the  event  to 
occur,  given  that  the  individual  has  survived 
up  to  time  t.  Note  that,  in  contrast  to  the  survi¬ 
vor  function,  which  focuses  on  not  failing,  the 
hazard  function  focuses  on  failing,  that  is,  on 
the  event  occurring.  Thus,  in  some  sense,  the 
hazard  function  can  be  considered  as  giving 
the  opposite  side  of  the  information  given  by 
the  survivor  function. 

To  get  an  idea  of  what  we  mean  by  instanta¬ 
neous  potential,  consider  the  concept  of  veloc¬ 
ity.  If,  for  example,  you  are  driving  in  your  car 
and  you  see  that  your  speedometer  is  register¬ 
ing  60  mph,  what  does  this  reading  mean? 
It  means  that  if  in  the  next  hour,  you  continue 
to  drive  this  way,  with  the  speedometer  exactly 
on  60,  you  would  cover  60  miles.  This  reading 
gives  the  potential,  at  the  moment  you  have 
looked  at  your  speedometer,  for  how  many 
miles  you  will  travel  in  the  next  hour.  However, 
because  you  may  slow  down  or  speed  up  or 
even  stop  during  the  next  hour,  the  60-mph 
speedometer  reading  does  not  tell  you  the 
number  of  miles  you  really  will  cover  in  the 
next  hour.  The  speedometer  tells  you  only 
how  fast  you  are  going  at  a  given  moment ; 
that  is,  the  instrument  gives  your  instanta¬ 
neous  potential  or  velocity. 


Instantaneous  potential 


Similar  to  the  idea  of  velocity,  a  hazard  func¬ 
tion  hit)  gives  the  instantaneous  potential  at 
time  t  for  getting  an  event,  like  death  or  some 
disease  of  interest,  given  survival  up  to  time  t. 
The  “given”  part,  that  is,  surviving  up  to  time  t, 
is  analogous  to  recognizing  in  the  velocity 
example  that  the  speedometer  reading  at  a 
point  in  time  inherently  assumes  that  you  have 
already  traveled  some  distance  (i.e.,  survived) 
up  to  the  time  of  the  reading. 
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Given 


h(  t)  =  lim  - 
At  — >0 


P(t<T<  t  +  A t  |  T>t) 
At 


Conditional  probabilities:  P(A\B ) 

P(t  <  T  <  t  +  At  |  T  >  t) 

=  P(individual  fails  in  the  interval 
\t,  t  +  At]  |  survival  up  to  time  t) 


Hazard  function  =  conditional 
failure  rate 


P(t  <T<  t  +  At  \  T  >t) 


At 


Probability  per  unit  time 
Rate:  0  to  oo 


P  =  P{t  <T  <t  +  At\T  >  t ) 

P  At  P/At  =  rate 


if  =  °'67/day 


1 

14 


-week 


1/3 

1/14 


=  4.67/week 


In  mathematical  terms,  the  given  part  of  the 
formula  for  the  hazard  function  is  found  in 
the  probability  statement  in  the  numerator  to 
the  right  of  the  limit  sign.  This  statement  is  a 
conditional  probability  because  it  is  of  the 
form,  “P  of  A,  given  B,”  where  the  P  denotes 
probability  and  where  the  long  vertical  line 
separating  A  from  B  denotes  "given.”  In  the 
hazard  formula,  the  conditional  probability 
gives  the  probability  that  a  person’s  survival 
time,  T,  will  lie  in  the  time  interval  between 
t  and  t  +  At,  given  that  the  survival  time  is 
greater  than  or  equal  to  t.  Because  of  the 
given  sign  here,  the  hazard  function  is  some¬ 
times  called  a  conditional  failure  rate. 

We  now  explain  why  the  hazard  is  a  rate  rather 
than  a  probability.  Note  that  in  the  hazard 
function  formula,  the  expression  to  the  right 
of  the  limit  sign  gives  the  ratio  of  two  quanti¬ 
ties.  The  numerator  is  the  conditional  proba¬ 
bility  we  just  discussed.  The  denominator  is  At, 
which  denotes  a  small  time  interval.  By  this 
division,  we  obtain  a  probability  per  unit 
time,  which  is  no  longer  a  probability  but  a 
rate.  In  particular,  the  scale  for  this  ratio  is 
not  0  to  1,  as  for  a  probability,  but  rather 
ranges  between  0  and  infinity,  and  depends 
on  whether  time  is  measured  in  days,  weeks, 
months,  or  years,  etc. 

For  example,  if  the  probability,  denoted  here 
by  P,  is  1/3,  and  the  time  interval  is  one-half 
a  day,  then  the  probability  divided  by  the 
time  interval  is  1/3  divided  by  1/2,  which  equals 
0.67  per  day.  As  another  example,  suppose,  for 
the  same  probability  of  1/3,  that  the  time  inter¬ 
val  is  considered  in  weeks,  so  that  1/2  day 
equals  1/14  of  a  week.  Then  the  probability 
divided  by  the  time  interval  becomes  1/3  over 
1/14,  which  equals  14/3,  or  4.67  per  week.  The 
point  is  simply  that  the  expression  P  divided  by 
At  at  the  right  of  the  limit  sign  does  not  give 
a  probability.  The  value  obtained  will  give  a 
different  number  depending  on  the  units  of 
time  used,  and  may  even  give  a  number 
larger  than  one. 
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h(t)  = 


dim  \  P(t<T<  t  +A t  I  T>t) 
At 


Gives 

instantaneous 

potential 


When  we  take  the  limit  of  the  right-side  expres¬ 
sion  as  the  time  interval  approaches  zero,  we 
are  essentially  getting  an  expression  for  the 
instantaneous  probability  of  failing  at  time  t 
per  unit  time.  Another  way  of  saying  this  is 
that  the  conditional  failure  rate  or  hazard  func¬ 
tion  hit)  gives  the  instantaneous  potential  for 
failing  at  time  t  per  unit  time,  given  survival  up 
to  time  t. 


Hazrd  functions 


h(t) 


0 


t 


•  h(t)  >  0 

•  hit)  has  no  upper  bound 


As  with  a  survivor  function,  the  hazard  func¬ 
tion  h{t)  can  be  graphed  as  t  ranges  over  vari¬ 
ous  values.  The  graph  at  the  left  illustrates 
three  different  hazards.  In  contrast  to  a  survi¬ 
vor  function,  the  graph  of  hit)  does  not  have  to 
start  at  1  and  go  down  to  zero,  but  rather  can 
start  anywhere  and  go  up  and  down  in  any 
direction  over  time.  In  particular,  for  a  speci¬ 
fied  value  of  t,  the  hazard  function  hit)  has  the 
following  characteristics: 

•  it  is  always  nonnegative,  that  is,  equal  to  or 
greater  than  zero; 

•  it  has  no  upper  bound. 

These  two  features  follow  from  the  ratio 
expression  in  the  formula  for  hit),  because 
both  the  probability  in  the  numerator  and  the 
At  in  the  denominator  are  nonnegative,  and 
since  At  can  range  between  0  and  oo. 


EXAMPLE 


® 

Constant  hazard 

(exponential  model) 


h(t)  for  healthy  . 
persons 


t 


Now  we  show  some  graphs  of  different  types  of 
hazard  functions.  The  first  graph  given  shows  a 
constant  hazard  for  a  study  of  healthy  persons. 
In  this  graph,  no  matter  what  value  of  t  is  spe¬ 
cified,  hit)  equals  the  same  value — in  this  exam¬ 
ple,  X.  Note  that  for  a  person  who  continues 
to  be  healthy  throughout  the  study  period,  his/ 
her  instantaneous  potential  for  becoming  ill 
at  any  time  during  the  period  remains  constant 
throughout  the  follow-up  period.  When  the 
hazard  function  is  constant,  we  say  that  the 
survival  model  is  exponential.  This  term  fol¬ 
lows  from  the  relationship  between  the  survivor 
function  and  the  hazard  function.  We  will 
return  to  this  relationship  later. 
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1  EXAMPLE:  (continued)  1 

© 

T  Weibull 

h(t )  for  leukemia 
patients 

t 

© 

i  Weibull 

h(t )  for  Persons 
recovering  from 
surgery 

t 

© 

T  J,  lognormal 

h{t)  for  TB 
patients 

t 

The  second  graph  shows  a  hazard  function  that 
is  increasing  over  time.  An  example  of  this  kind 
of  graph  is  called  an  increasing  Weibull 
model.  Such  a  graph  might  be  expected  for 
leukemia  patients  not  responding  to  treatment, 
where  the  event  of  interest  is  death.  As  survival 
time  increases  for  such  a  patient,  and  as  the 
prognosis  accordingly  worsens,  the  patient’s 
potential  for  dying  of  the  disease  also  increases. 

In  the  third  graph,  the  hazard  function  is 
decreasing  over  time.  An  example  of  this  kind 
of  graph  is  called  a  decreasing  Weibull.  Such 
a  graph  might  be  expected  when  the  event 
is  death  in  persons  who  are  recovering  from 
surgery,  because  the  potential  for  dying  after 
surgery  usually  decreases  as  the  time  after  sur¬ 
gery  increases. 

The  fourth  graph  given  shows  a  hazard  func¬ 
tion  that  is  first  increasing  and  then  decreas¬ 
ing.  An  example  of  this  type  of  graph  is  the 
lognormal  survival  model.  We  can  expect 
such  a  graph  for  tuberculosis  patients,  since 
their  potential  for  dying  increases  early  in  the 
disease  and  decreases  later. 


S(t):  directly  describes  survival 
hit):  •  a  measure  of 

instantaneous  potential 

•  identify  specific  model 
form 

•  math  model  for  survival 
analysis 


Of  the  two  functions  we  have  considered,  S{t) 
and  hit),  the  survivor  function  is  more  natu¬ 
rally  appealing  for  analysis  of  survival  data, 
simply  because  S(f)  directly  describes  the  sur¬ 
vival  experience  of  a  study  cohort. 

However,  the  hazard  function  is  also  of  interest 
for  the  following  reasons: 


•  it  is  a  measure  of  instantaneous  potential 
whereas  a  survival  curve  is  a  cumulative 
measure  over  time; 

•  it  may  be  used  to  identify  a  specific  model 
form,  such  as  an  exponential,  a  Weibull,  or 
a  lognormal  curve  that  fits  one’s  data; 

•  it  is  the  vehicle  by  which  mathematical 
modeling  of  survival  data  is  carried  out; 
that  is,  the  survival  model  is  usually  written 
in  terms  of  the  hazard  function. 
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Relationship  of  S(t)  and  h(t): 

If  you  know  one,  you  can  deter¬ 
mine  the  other. 


EXAMPLE 


h(t)  =  X  if  and  only  if  S(t)  =  e  u 


Regardless  of  which  function  Sit)  or  hit )  one 
prefers,  there  is  a  clearly  defined  relation¬ 
ship  between  the  two.  In  fact,  if  one  knows 
the  form  of  Sit),  one  can  derive  the  corres¬ 
ponding  h(t),  and  vice  versa.  For  example, 
if  the  hazard  function  is  constant,  i.e.,  h(t)  = 
X,  for  some  specific  value  X,  then  it  can  be 
shown  that  the  corresponding  survival  func¬ 
tion  is  given  by  the  following  formula:  Sit) 
equals  e  to  the  power  minus  X  times  t. 


General  formulae: 


S{t)  =  exp 


hit) 


d  S{t)/dt 

.  W)  . 


More  generally,  the  relationship  between  Sit) 
and  hit)  can  be  expressed  equivalently  in  either 
of  two  calculus  formulae  shown  here. 

The  first  of  these  formulae  describes  how  the 
survivor  function  Sit)  can  be  written  in  terms 
of  an  integral  involving  the  hazard  function. 
The  formula  says  that  Sit)  equals  the  expo¬ 
nential  of  the  negative  integral  of  the  hazard 
function  between  integration  limits  of  0  and  t. 


The  second  formula  describes  how  the  hazard 
function  hit)  can  be  written  in  terms  of  a  deriv¬ 
ative  involving  the  survivor  function.  This  for¬ 
mula  says  that  hit)  equals  minus  the  derivative 
of  Sit)  with  respect  to  t  divided  by  Sit). 


In  any  actual  data  analysis,  a  computer  pro¬ 
gram  can  make  the  numerical  transformation 
from  Sit)  to  hit),  or  vice  versa,  without  the  user 
ever  having  to  use  either  formula.  The  point 
here  is  simply  that  if  you  know  either  Sit)  or 
hit),  you  can  get  the  other  directly. 


SUMMARY 

T  =  survival  time  random 
variable 

t  =  specific  value  of  T 
d  =  (0.1)  variable  for  failure/ 
censorship 

Sit)  =  survivor  function 
hit)  =  hazard  function 


At  this  point,  we  have  completed  our  dis¬ 
cussion  of  key  terminology  and  notation.  The 

key  notation  is  T  for  the  survival  time  vari¬ 
able,  t  for  a  specified  value  of  T,  and  d  for 
the  dichotomous  variable  indicating  event 
occurrence  or  censorship.  The  key  terms 
are  the  survivor  function  S(t)  and  the 
hazard  function  h(t),  which  are  in  essence 
opposed  concepts,  in  that  the  survivor  func¬ 
tion  focuses  on  surviving  whereas  the  hazard 
function  focuses  on  failing,  given  survival  up 
to  a  certain  time  point. 
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IV.  Goals  of  Survival  We  now  state  the  basic  goals  of  survival  analysis. 

Analysis 

Goal  1:  To  estimate  and  interpret  survivor  and/ 
or  hazard  functions  from  survival  data. 


Goal  2:  To  compare  survivor  and/or  hazard 
functions. 


Goal  3:  To  assess  the  relationship  of  explana¬ 
tory  variables  to  survival  time. 


Treatment 


Regarding  the  first  goal,  consider,  for  example, 
the  two  survivor  functions  pictured  at  the  left, 
which  give  very  different  interpretations .  The 
function  farther  on  the  left  shows  a  quick  drop 
in  survival  probabilities  early  in  follow-up  but  a 
leveling  off  thereafter.  The  function  on  the 
right,  in  contrast,  shows  a  very  slow  decrease 
in  survival  probabilities  early  in  follow-up  but  a 
sharp  decrease  later  on. 

We  compare  survivor  functions  for  a  treat¬ 
ment  group  and  a  placebo  group  by  graphing 
these  functions  on  the  same  axis.  Note  that  up 
to  6  weeks,  the  survivor  function  for  the  treat¬ 
ment  group  lies  above  that  for  the  placebo 
group,  but  thereafter  the  two  functions  are  at 
about  the  same  level.  This  dual  graph  indicates 
that  up  to  6  weeks  the  treatment  is  more 
effective  for  survival  than  the  placebo  but  has 
about  the  same  effect  thereafter. 


Goal  3:  Use  math  modeling,  e.g., 

Cox  proportional  hazards  Goal  3  usually  requires  using  some  form  of 

mathematical  modeling,  for  example,  the  Cox 
proportional  hazards  approach,  which  will  be 
the  subject  of  subsequent  chapters. 


V.  Basic  Data  Layout 
for  Computer 


Two  types  of  data  layouts: 

•  for  computer  use 

•  for  understanding 


We  previously  considered  some  examples  of 
survival  analysis  problems  and  a  simple  data 
set  involving  six  persons.  We  now  consider  the 
general  data  layout  for  a  survival  analysis. 
We  will  provide  two  types  of  data  layouts,  one 
giving  the  form  appropriate  for  computer  use, 
and  the  other  giving  the  form  that  helps  us 
understand  how  a  survival  analysis  works. 
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For  computer: 


Indiv.  # 

t 

d 

*i 

*2’ 

1 

h 

d\ 

*11 

*12  ' 

‘  *  *1 „ 

2 

h 

d2 

*21 

*22 

'  *  *2p 

We  start  by  providing,  in  the  table  shown  here, 
the  basic  data  layout  for  the  computer.  Assume 
that  we  have  a  data  set  consisting  of  n  persons. 
The  first  column  of  the  table  identifies  each 
person  from  1,  starting  at  the  top,  to  n,  at  the 
bottom. 


The  remaining  columns  after  the  first  one  pro¬ 
vide  survival  time  and  other  information  for 
each  person.  The  second  column  gives  the  sur¬ 
vival  time  information,  which  is  denoted  t\  for 
individual  1,  t2  for  individual  2,  and  so  on,  up 
to  tn  for  individual  n.  Each  of  these  t’s  gives  the 
observed  survival  time  regardless  of  whether 
the  person  got  the  event  or  is  censored.  For 
example,  if  person  5  got  the  event  at  3  weeks 
of  follow-up,  then  t5  =  3;  on  the  other  hand, 
if  person  8  was  censored  at  3  weeks,  without 
getting  the  event,  then  ts  =  3  also. 


To  distinguish  persons  who  get  the  event  from 
those  who  are  censored,  we  turn  to  the  third 
column,  which  gives  the  information  for  status 
(i.e.,  d)  the  dichotomous  variable  that  indicates 
censorship  status. 


Thus,  d\  is  1  if  person  1  gets  the  event  or  is  0  if 
person  1  is  censored;  d2  is  1  or  0  similarly,  and 
so  on,  up  through  dn.  In  the  example  just  con¬ 
sidered,  person  5,  who  failed  at  3  weeks,  has  a  d 
of  1;  that  is,  d5  equals  1.  In  contrast,  person  8, 
who  was  censored  at  3  weeks,  has  a  d  of  0;  that 
is,  ds  equals  0. 

Note  that  if  all  of  the  d  values  in  this  column  are 
added  up,  their  sum  will  be  the  total  number  of 
failures  in  the  data  set.  This  total  will  be  some 
number  equal  to  or  less  than  n,  because  not 
every  one  may  fail. 

The  remainder  of  the  information  in  the  table 
gives  values  for  explanatory  variables  of  inter¬ 
est.  An  explanatory  variable,  X;  is  any  variable 
like  age  or  exposure  status,  E,  or  a  product 
term  like  age  x  race  that  the  investigator 
wishes  to  consider  to  predict  survival  time. 
These  variables  are  listed  at  the  top  of  the 
table  as  Xj,  X2,  and  so  on,  up  to  Xp.  Below 
each  variable  are  the  values  observed  for  that 
variable  on  each  person  in  the  data  set. 
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Columns 


r 

'I 

# 

t 

d 

X 

x"'  x 

r  1 

h 

dx 

Xi 

xv  •  •  •  Xp 

2 

h 

d2 

Xi 

X2  • • • Xp 

• 

• 

• 

• 

• 

• 

(t 

ti 

di 

X, 

*a  XP) 

• 

. 

• 

. 

* 

n 

d„ 

Xi 

Xi"  '  Xp 

EXAMPLE 

The  data:  Remission  times  (in  weeks) 
for  two  groups  of  leukemia  patients 

Group  1 

(Treatment)  n  =  21 

Group  2 

(Placebo)  n  =  21 

6,  6,  6,  7,  10, 

13,  16,  22,  23, 

6+,  9+,  10+,  11+, 
17+,  19+,  20+, 

25+,  32+,  32+, 

34+, 35+ 

I,  1,  2,  2,  3, 

4,  4,  5,  5, 

8,  8,  8,  8, 

II,  11,  12,  12, 

15,  17,  22,  23 

In  remission 
at  study  end 

+  denotes 

.  Lost  to 

censored 

follow-up 

Withdraws 

For  example,  in  the  column  corresponding  to  X\ 
are  the  values  observed  on  this  variable  for  all 
n  persons.  These  values  are  denoted  as  Xn,  X2\, 
and  so  on,  up  to Xnl;  the  first  subscript  indicates 
the  person  number,  and  the  second  subscript,  a 
one  in  each  case  here,  indicates  the  variable 
number.  Similarly,  the  column  corresponding 
to  variable  X2  gives  the  values  observed  on  X2 
for  all  n  persons.  This  notation  continues  for  the 
other  X  variables  up  through  Xp. 

We  have  thus  described  the  basic  data  layout 
by  columns.  Alternatively,  we  can  look  at  the 
table  line  by  line,  that  is,  by  rows.  For  each  line 
or  row,  we  have  the  information  obtained  on 
a  given  individual.  Thus,  for  individual  i, 
the  observed  information  is  given  by  the 
values  t{,  dit  Xn,  Xi2,  etc.,  up  to  Xip.  This  is  how 
the  information  is  read  into  the  computer, 
that  is,  line  by  line,  until  all  persons  are 
included  for  analysis. 

As  an  example  of  this  data  layout,  consider  the 
following  set  of  data  for  two  groups  of  leuke¬ 
mia  patients:  one  group  of  21  persons  has 
received  a  certain  treatment;  the  other  group 
of  21  persons  has  received  a  placebo.  The  data 
come  from  Freireich  et  al.,  Blood,  1963. 

As  presented  here,  the  data  are  not  yet  in 
tabular  form  for  the  computer,  as  we  will  see 
shortly.  The  values  given  for  each  group  consist 
of  time  in  weeks  a  patient  is  in  remission,  up  to 
the  point  of  the  patient’s  either  going  out  of 
remission  or  being  censored.  Here,  going  out 
of  remission  is  a  failure.  A  person  is  censored  if 
he  or  she  remains  in  remission  until  the  end  of 
the  study,  is  lost  to  follow-up,  or  withdraws 
before  the  end  of  the  study.  The  censored  data 
here  are  denoted  by  a  plus  sign  next  to  the 
survival  time. 
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1  EXAMPLE:  (continued)  I 

Group  1 

Group  2 

(Treatment) 

n  =21 

(Placebo) 

n  =  21 

6,  6,  6,  7,  10, 

1,  1,  2,  2, 

3, 

13,  16,  22,  23, 

4,  4,  5,  5, 

6+,  9+,  10+,  11+, 

8,  8,  8,  8, 

17+,  19+,  20+, 

11,  11,  12,  12, 

25+,  32+,  32+, 

15,  17,  22,  23 

34+, 35+ 

#  failed 

#  censored 

Total 

Group  1 

9 

12 

21 

Group  2 

21 

0 

21 

Indiv. 

d  (failed  or 

X 

# 

t  (weeks) 

censored) 

(Group) 

1 

6 

1 

1 

2 

6 

1 

1 

© 

6 

1 

1 

4 

7 

1 

1 

5 

10 

1 

1 

6 

13 

1 

1 

7 

16 

1 

1 

8 

22 

1 

1 

GROUP  9 

23 

1 

1 

1  10 

6 

0 

1 

11 

9 

0 

1 

12 

10 

0 

1 

13 

11 

0 

1 

@ 

17 

0 

1 

15 

19 

0 

1 

16 

20 

0 

1 

17 

25 

0 

1 

18 

32 

0 

1 

19 

32 

0 

1 

20 

34 

0 

1 

21 

35 

0 

1 

Here  are  the  data  again: 

Notice  that  the  first  three  persons  in  group  1 
went  out  of  remission  at  6  weeks;  the  next 
6  persons  also  went  out  of  remission,  but  at 
failure  times  ranging  from  7  to  23.  All  of  the 
remaining  persons  in  group  1  with  pluses  next 
to  their  survival  times  are  censored.  For  exam¬ 
ple,  on  line  three  the  first  person  who  has  a  plus 
sign  next  to  a  6  is  censored  at  6  weeks.  The 
remaining  persons  in  group  1  are  also  cen¬ 
sored,  but  at  times  ranging  from  9  to  35  weeks. 

Thus,  of  the  2 1  persons  in  group  1 ,  nine  failed 
during  the  study  period,  whereas  the  last  12 
were  censored.  Notice  also  that  none  of  the 
data  in  group  2  is  censored;  that  is,  all  21  per¬ 
sons  in  this  group  went  out  of  remission  during 
the  study  period. 

We  now  put  this  data  in  tabular  form  for  the 
computer,  as  shown  at  the  left.  The  list  starts 
with  the  21  persons  in  group  1  (listed  1-21)  and 
follows  (on  the  next  page)  with  the  2 1  persons 
in  group  2  (listed  22-42).  Our  n  for  the  com¬ 
posite  group  is  42. 

The  second  column  of  the  table  gives  the  sur¬ 
vival  times  in  weeks  for  all  42  persons.  The 
third  column  indicates  failure  or  censorship 
for  each  person.  Finally,  the  fourth  column 
lists  the  values  of  the  only  explanatory  variable 
we  have  considered  so  far,  namely,  group  sta¬ 
tus,  with  1  denoting  treatment  and  0  denoting 
placebo. 

If  we  pick  out  any  individual  and  read  across 
the  table,  we  obtain  the  line  of  data  for  that 
person  that  gets  entered  in  the  computer. 
For  example,  person  #3  has  a  survival  time  of 
6  weeks,  and  since  d=  1,  this  person  failed,  that 
is,  went  out  of  remission.  The  X  value  is  1 
because  person  #3  is  in  group  1.  As  a  second 
example,  person  #14,  who  has  an  observed  sur¬ 
vival  time  of  17  weeks,  was  censored  at  this 
time  because  d  =  0.  The  X  value  is  again  1 
because  person  #14  is  also  in  group  1. 
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EXAMPLE:  (continued) 


Indiv.  #  t  (weeks) 

d  (failed 
or 

censored) 

X 

(Group) 

22 

1 

1 

0 

23 

1 

1 

0 

24 

2 

1 

0 

25 

2 

1 

0 

26 

3 

1 

0 

27 

4 

1 

0 

GROUP  28 

4 

1 

0 

2  29 

5 

1 

0 

30 

5 

1 

0 

31 

8 

1 

0 

© 

8 

1 

0 

33 

8 

1 

0 

34 

8 

1 

0 

35 

11 

1 

0 

36 

11 

1 

0 

37 

12 

1 

0 

38 

12 

1 

0 

39 

15 

1 

0 

40 

17 

1 

0 

41 

22 

1 

0 

42 

23 

1 

0 

As  one  more  example,  this  time  from  group  2, 
person  #32  survived  8  weeks  and  then  failed, 
because  d  =  1 ;  the  X  value  is  0  because  person 
#32  is  in  group  2. 


Alternative  Data  Layout:  Counting  An  alternative  format  for  the  computer  is  called 
Process  (Start,  Stop)  Format  the  Counting  Process  (CP)  format. 


CP  Format:  applies  to  more  com¬ 
plicated  survival  analysis 

•  Age-at  follow-up  is  outcome 

•  Time-dependent  variables 

•  Recurrent  events 

•  Gaps  in  follow-up 


The  CP  format  is  useful  for  more  complicated 
survival  analysis  situations  that  we  discuss 
in  later  chapters,  in  particular  when  age-at- 
follow-up  time  is  used  as  the  outcome  variable 
instead  of  time  of  follow-up  (Chap.  3),  when 
there  are  time-dependent  variables  (Chap.  6), 
and  when  there  are  recurrent  events  and/or 
gaps  in  follow-up  (Chap.  8). 


The  general  CP  format  is  shown  on  the  left. 
This  format  differs  from  the  previously 
described  “standard”  data  layout  in  two  ways. 
First,  the  CP  format  allows  multiple  lines  of 
data  for  the  same  individual;  that  is,  each 
individual’s  total  at-risk-follow-up  time  is  sub¬ 
divided  into  smaller  time  intervals  to  allow 
for  recurrent  events  on  the  same  individual. 
Second,  there  are  two  time  points  specified 
for  each  individual,  labeled  in  the  layout  as 
fro  and  ty  i ,  and  often  referred  to  as  START  and 
STOP  times,  respectively. 


Presentation:  V.  Basic  Data  Layout  for  Computer  21 


The  first  two  columns  in  this  format  are 
labeled  i  (for  subject  number)  and  j  (for  data- 
line  number  for  the  ith  subject).  As  in  the  stan¬ 
dard  format,  i  ranges  from  1  to  n;  also,  in 
the  CP  format,  j  ranges  from  1  to  ri7  where  rj 
denotes  the  number  of  datalines  for  the  i-th 
subject. 

The  third  column  labeled  dy  denotes  the  failure 
status  (l=failed,  0=censored)  for  the  j-th  data- 
line  on  the  i-th  subject. 

The  next  two  columns  identify  two  time  points 
required  for  each  dataline,  the  START  time 
(tyo)  and  the  STOP  time  (tyi).  These  two  col¬ 
umns  are  the  primary  distinguishing  feature  of 
the  CP  format. 


Simplest  CP  format:  1  dataline/ 
subject 


i  i  dij 

*ijO  tijl 

Nji  • 

•  Xijp 

1 

i  du 

0  t, 

xm  • 

•  xllp 

i 

i  dji 

0  ti 

Xill 

•  Xilp 

n 

1  d„i 

0  q 

Xnll  * 

••  xnlp 

The  simplest  CP  format  occurs  when  the  out¬ 
come  is  follow-up  time  since  study  entry  and 
when  there  are  no  recurrent  events  or  time- 
dependent  covariates,  as  in  our  previously 
described  Remission  Time  Dataset.  In  this  sit¬ 
uation,  there  is  one  dataline  for  each  subject 
(i.e.,  r:=l  for  all  i  so  that  the  only  value  that  j 
takes  is  1),  the  start  time  (tn0)  is  0  for  each 
subject,  and  the  stop  time  (tm)  is  the  follow¬ 
up  time  (t)  until  either  the  event  or  censorship 
occurs. 


CP  Format  for  Group  1  of  Remi¬ 
ssion  Time  Dataset 


i 

j 

“ij 

start 

stop' 

X(Group) 

1 

1 

1 

0 

6 

1 

2 

1 

1 

0 

6 

1 

3 

1 

1 

0 

6 

1 

4 

1 

1 

0 

7 

1 

5 

1 

1 

0 

10 

1 

6 

1 

1 

0 

13 

1 

7 

1 

1 

0 

16 
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As  an  example,  the  CP  format  for  Group  1  of 
the  Remission  Time  Dataset  is  shown  on  the 
left.  Note  that  the  value  of  j  is  1  throughout  the 
table,  the  start  times  are  all  zero,  and  the  stop 
times  are  the  failure  or  censored  survival  times 
for  each  subject. 
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CP  Format:  First  15  Subjects- 
Bladder  Canscer  Study 
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We  now  illustrate  the  CP  format  that  allows  for 
more  than  one  dataline  per  subject  as  well  as 
start  times  other  than  zero.  We  consider  data 
on  the  first  1 5  subjects  from  a  study  of  recur¬ 
rent  bladder  cancer  tumors  (Byar,  1980;  and 
Wei,  Lin  and  Weissfeld,  1989).  The  entire  data¬ 
set  contained  86  patients,  each  followed  for  a 
variable  amount  of  time  up  to  64  months. 
We  describe  how  to  analyze  this  dataset  in 
Chapter  8  on  Recurrent  Event  Survival  Analy¬ 
sis.  Here,  we  only  describe  how  this  data  layout 
fits  the  CP  format. 

The  event  being  analyzed  is  the  recurrence  of 
bladder  cancer  tumor  after  transurethral  sur¬ 
gical  excision.  Each  recurrence  of  new  tumors 
was  treated  by  removal  at  each  examination. 

The  exposure  variable  of  interest  is  drug  treat¬ 
ment  status  (tx,  0=placebo,  1=  treatment  with 
thiotepa).  Although  each  of  15  subjects  shown 
here  are  in  the  placebo  group  (tx=0),  several 
other  subjects  in  the  larger  dataset  are  in  the 
treatment  group  (tx=l). 

The  covariates  listed  here  are  initial  number 
of  tumors  (num)  and  initial  size  of  tumors 
(size)  in  centimeters.  Both  these  variables 
have  the  same  value  for  each  subject  (i.e., 
time-independent  variables),  although  the  gen¬ 
eral  data  layout  also  allows  for  time-dependent 
variables. 

Notice  that  several  subjects  in  this  dataset, 
namely  subjects  6,  9,  10,  12,  13,  14,  and  15 
have  two  or  more  datalines.  Subject  6,  for 
example,  has  two  datalines  (i.e.,  r6  =  2), 
whereas  subject  14  has  4  datalines  (i.e.,  r14  =  4). 
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Bladder  Canscer  Study  (cont’d) 
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CP  format  illustrated  for  other 
situations  in  later  chapters. 


See  Computer  Appendix  for  com¬ 
puter  code  in  CD  format  for  SAS, 
STATA,  and  R. 


The  first  of  the  two  lines  for  subject  6  tells  us  that 
this  subject  had  a  (first)  recurrent  bladder  cancer 
event  (i.e.,  d=l)  at  6  months  (i.e.,  stop  =6).  This 
subject  was  then  followed  for  another  4  months 
(from  6  to  10,  as  shown  on  the  second  dataline 
for  this  subject,  where  the  start  time  is  6  and  the 
stop  time  is  10.  At  10  months,  the  subject  is 
censored  (d=0);  in  other  words,  this  subject  did 
not  get  a  second  recurrent  event  at  10  months, 
after  which  no  further  information  is  available. 

Subject  14  had  three  recurrent  events,  the  first 
one  at  3  months  (i.e.,  stop  =3),  the  second  one 
at  9  months  (i.e.,  stop  =9),  and  the  third  one  at 
21  months  (i.e.,  stop  =21).  This  subject  was 
followed  for  another  2  months  (start  =21  to 
stop  =23  on  dataline  number  j=4)  without 
another  event  occurring  (d=0). 

As  mentioned  at  the  beginning  of  this  section, 
the  CP  format  is  also  applicable  when  age-at- 
follow-up  time  is  used  as  the  outcome  variable 
instead  of  time  of  follow-up  (Chapter  3),  when 
there  are  time-dependent  variables  (Chapter  6), 
and  when  there  are  gaps  in  follow-up  ( Chapter  8) . 
We  will  illustrate  these  latter  situations  within 
the  later  chapters  just  mentioned. 

In  the  Computer  Appendix,  we  describe  the 
computer  code  required  by  SAS,  STATA,  and 
R  packages  when  the  data  is  set  up  in  CP  for¬ 
mat  for  the  analysis  of  recurrent  event  survival 
data  and  when  age  is  used  as  the  time  scale 
instead  of  time-on-study. 
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We  are  now  ready  to  look  at  another  data  lay¬ 
out,  which  is  shown  at  the  left.  This  layout 
helps  provide  some  understanding  of  how  a 
survival  analysis  actually  works  and,  in  parti¬ 
cular,  how  survivor  curves  are  derived. 
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The  first  column  in  this  table  gives  ordered 
failure  times.  These  are  denoted  by  t’s  with 
subscripts  within  parentheses,  starting  with 
t( o),  then  u  and  so  on,  up  to  t(/c).  Note  that 
the  parentheses  surrounding  the  subscripts 
distinguish  ordered  failure  times  from  the  sur¬ 
vival  times  previously  given  in  the  computer 
layout. 
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To  get  ordered  failure  times  from  survival 
times,  we  must  first  remove  from  the  list  of 
unordered  survival  times  all  those  times  that 
are  censored;  we  are  thus  working  only  with 
those  times  at  which  people  failed.  We  then 
order  the  remaining  failure  times  from  smal¬ 
lest  to  largest,  and  count  ties  only  once.  The 
value  k  gives  the  number  of  distinct  times  at 
which  subjects  failed. 

For  example,  using  the  remission  data  for 
group  1 ,  we  find  that  9  of  the  2 1  persons  failed, 
including  3  persons  at  6  weeks  and  1  person 
each  at  7,  10,  13,  16,  22,  and  23  weeks.  These 
nine  failures  have  k  =  7  distinct  survival  times, 
because  three  persons  had  survival  time  6  and 
we  only  count  one  of  these  6’s  as  distinct.  The 
first  ordered  failure  time  for  this  group, 
denoted  as  f(p,  is  6;  the  second  ordered  failure 
time  f(2),  is  7,  and  so  on  up  to  the  seventh 
ordered  failure  time  of  23. 

Turning  to  group  2,  we  find  that  although 
all  21  persons  in  this  group  failed,  there  are 
several  ties.  For  example,  two  persons  had  a 
survival  time  of  1  week;  two  more  had  a  sur¬ 
vival  time  of  2  weeks;  and  so  on.  In  all,  we  find 
that  there  were  k  =  12  distinct  survival  times 
out  of  the  21  failures.  These  times  are  listed  in 
the  first  column  for  group  2. 

Note  that  for  both  groups  we  inserted  a  row  of 
data  giving  information  at  time  0.  We  will 
explain  this  insertion  when  we  get  to  the  third 
column  in  the  table. 

The  second  column  in  the  data  layout  gives 
frequency  counts,  denoted  by  nif,  of  those  per¬ 
sons  who  failed  at  each  distinct  failure  time. 
When  there  are  no  ties  at  a  certain  failure 
time,  then  nif=  1.  Notice  that  in  group  1,  there 
were  three  ties  at  6  weeks  but  no  ties  thereafter. 
In  group  2,  there  were  ties  at  1,  2,  4,  5,  8,  11, 
and  12  weeks.  In  any  case,  the  sum  of  all  the 
nifS  in  this  column  gives  the  total  number  of 
failures  in  the  group  tabulated.  This  sum  is 
9  for  group  1  and  21  for  group  2. 
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The  third  column  gives  frequency  counts, 
denoted  by  qf,  of  those  persons  censored  in 
the  time  interval  starting  with  failure  time  f(Q 
up  to  the  next  failure  time  denoted  t(f+1).  Tech¬ 
nically,  because  of  the  way  we  have  defined  this 
interval  in  the  table,  we  include  those  persons 
censored  at  the  beginning  of  the  interval. 

For  example,  the  remission  data,  for  group  1 
includes  5  nonzero  q  s:  q\  =  1,  q2  =  1,  q 3  =  2, 
q$  =  3,  q-7  =  5.  Adding  these  values  gives  us 
the  total  number  of  censored  observations  for 
group  1,  which  is  12.  Moreover,  if  we  add  the 
total  number  of  q  s  (12)  to  the  total  number  of 
m’s  (9),  we  get  the  total  number  of  subjects  in 
group  1 ,  which  is  2 1 . 

We  now  focus  on  group  1  to  look  a  little  closer 
at  the  q  s.  At  the  left,  we  list  the  unordered 
group  1  information  followed  (on  the  next 
page)  by  the  ordered  failure  time  information. 
We  will  go  back  and  forth  between  these  two 
tables  (and  pages)  as  we  discuss  the  q  s.  Notice 
that  in  the  table  here,  one  person,  listed  as  #10, 
was  censored  at  week  6.  Consequently,  in  the 
table  at  the  top  of  the  next  page,  we  have  q  1  =  1 , 
which  is  listed  on  the  second  line  corres¬ 
ponding  to  the  ordered  failure  time  /( q,  which 
equals  6. 

The  next  q  is  a  little  trickier,  it  is  derived  from 
the  person  who  was  listed  as  #  1 1  in  the  table 
here  and  was  censored  at  week  9.  Correspond¬ 
ingly,  in  the  table  at  the  top  of  the  next  page, 
we  have  q2  =  1  because  this  one  person  was 
censored  within  the  time  interval  that  starts  at 
the  second  ordered  failure  time,  7  weeks,  and 
ends  just  before  the  third  ordered  failure  time, 
10  weeks.  We  have  not  counted  person  #12 
(who  was  censored  at  week  10)  here  because 
this  person’s  censored  time  is  exactly  at  the  end 
of  the  interval.  We  count  this  person  in  the 
following  interval. 
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We  now  consider,  from  the  table  of  unordered 
failure  times,  person  #12  who  was  censored  at 
10  weeks,  and  person  #13,  who  was  censored 
at  1 1  weeks.  Turning  to  the  table  of  ordered 
failure  times,  we  see  that  these  two  times  are 
within  the  third  ordered  time  interval,  which 
starts  and  includes  the  1 0-week  point  and  ends 
just  before  the  1 3th  week.  As  for  the  remaining 
q  s,  we  will  let  you  figure  them  out  for  practice. 

One  last  point  about  the  q  information.  We 
inserted  a  row  at  the  top  of  the  data  for  each 
group  corresponding  to  time  0.  This  insertion 
allows  for  the  possibility  that  persons  may  be 
censored  after  the  start  of  the  study  but  before 
the  first  failure.  In  other  words,  it  is  possible 
that  t/o  may  be  nonzero.  For  the  two  groups  in 
this  example,  however,  no  one  was  censored 
before  the  first  failure  time. 

The  last  column  in  the  table  gives  the  “risk 
set.”  The  risk  set  is  not  a  numerical  value  or 
count  but  rather  a  collection  of  individuals. 
By  definition,  the  risk  set  R(t (p)  is  the  collection 
of  individuals  who  have  survived  at  least  to 
time  tq))  that  is,  each  person  in  R(t<f))  has  a 
survival  time  that  is  tq)  or  longer,  regardless 
of  whether  the  person  has  failed  or  is  censored. 

For  example,  we  see  that  at  the  start  of  the 
study  everyone  in  group  1  survived  at  least 
0  weeks,  so  the  risk  set  at  time  0  consists  of 
the  entire  group  of  21  persons.  The  risk  set  at 
6  weeks  for  group  1  also  consists  of  all  2 1  per¬ 
sons,  because  all  21  persons  survived  at  least  as 
long  as  6  weeks.  These  21  persons  include  the 
3  persons  who  failed  at  6  weeks,  because 
they  survived  and  were  still  at  risk  just  up  to 
this  point. 
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\0  0/ 

'  21  persons  survive  >  0  wks 

1(1)= 6 

v 

21  persons  survive  >  6  wks 

%) =  7 

1/M 

17  persons  survive  >  7  wks 

l(3)  =  10  / 

A  X 

15  persons  survive  >10  wks 

1(4)  =  © 

1  0 

<T5jpersons  survive  >  13~wks> 

1(5)=  16 

1  3 

11  persons  survive  >16  wks 

1,5,  =  22 

1  0 

7  persons  survive  >  22  wks 

1(7)=  23 

1  5 

6  persons  survive  >  23  wks 

Totals 

9  12 

How  we  work  with  censored 
data:  Use  all  information  up  to 
time  of  censorship;  don’t  throw 
away  information. 


Now  let’s  look  at  the  risk  set  at  7  weeks.  This  set 
consists  of  1 7  persons  in  group  1  that  survived 
at  least  7  weeks.  We  omit  everyone  in  the  X-ed 
area.  Of  the  original  21  persons,  we  therefore 
have  excluded  the  three  persons  who  failed  at 

6  weeks  and  the  one  person  who  was  censored 
at  6  weeks.  These  four  persons  did  not  survive 
at  least  7  weeks.  Although  the  censored  person 
may  have  survived  longer  than  7  weeks,  we 
must  exclude  him  or  her  from  the  risk  set  at 

7  weeks  because  we  have  information  on  this 
person  only  up  to  6  weeks. 

To  derive  the  other  risk  sets,  we  must  exclude 
all  persons  who  either  failed  or  were  censored 
before  the  start  of  the  time  interval  being  con¬ 
sidered.  For  example,  to  obtain  the  risk  set  at 
1 3  weeks  for  group  1 ,  we  must  exclude  the  five 
persons  who  failed  before,  but  not  including, 
13  weeks  and  the  four  persons  who  were 
censored  before,  but  not  including,  13  weeks. 
Subtracting  these  9  persons  from  21,  leaves 
12  persons  in  group  1  still  at  risk  for  getting 
the  event  at  13  weeks.  Thus,  the  risk  set  con¬ 
sists  of  these  12  persons. 

The  importance  of  the  table  of  ordered  failure 
times  is  that  we  can  work  with  censored  obser¬ 
vations  in  analyzing  survival  data.  Even  though 
censored  observations  are  incomplete,  in  that 
we  don’t  know  a  person’s  survival  time  exactly, 
we  can  still  make  use  of  the  information  we 
have  on  a  censored  person  up  to  the  time 
we  lose  track  of  him  or  her.  Rather  than  simply 
throw  away  the  information  on  a  censored 
person,  we  use  all  the  information  we  have  on 
such  a  person  up  until  time  of  censorship. 
(Nevertheless,  most  survival  analysis  techni¬ 
ques  require  a  key  assumption  that  censoring 
is  independent,  i.e.,  censored  subjects  are  not 
at  increased  risk  for  failure.  See  Chap.  9  on 
competing  risks  for  further  details.) 
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EXAMPLE  I 

k  o 

m  f 

<n 

fUt(o) 

6 

3 

l 

21  persons 

7 

1 

l 

V 

17  persons 

10 

1 

2 

V 

1 5  persons 

13 

1 

0 

V 

12  persons 

16 

1 

© 

✓ 

1 1  persons 

22 

1 

0 

7  persons 

23 

1 

5 

6  persons 

For  example,  for  the  three  persons  in  group  1 
who  were  censored  between  the  1 6th  and  22nd 
weeks,  there  are  at  least  16  weeks  of  survival 
information  on  each  that  we  don’t  want  to  lose. 
These  three  persons  are  contained  in  all  risk 
sets  up  to  the  16th  week;  that  is,  they  are  each 
at  risk  for  getting  the  event  up  to  16  weeks.  Any 
survival  probabilities  determined  before,  and 
including,  16  weeks  should  make  use  of  data 
on  these  three  persons  as  well  as  data  on  other 
persons  at  risk  during  the  first  16  weeks. 


Flaving  introduced  the  basic  terminology  and 
data  layouts  to  this  point,  we  now  consider 
some  data  analysis  issues  and  some  additional 
applications. 


VII.  Descriptive 

Measures  of  Survival 
Experience 


EXAMPLE 

Remission  times  (in  weeks)  for  two 

groups  of  leukemia  patients 

Group  1 

Group  2 

(Treatment)  n  —  21 

(Placebo)  n  —  21 

6,  6,  6,  7,  10, 

1,  1,  2,  2,  3, 

13,  16,  22,  23, 

4,  4,  5,  5, 

6+,  9+,  10+,  11+, 

00 

GO 

00 

00 

17+,  19+,  20+, 

11,  11,  12,  12, 

25+,  32+,  32+, 

34+,  35+ 

15,  17,  22,  23 

T\  (ignoring  +  ’s)  =  17.1  72  =  8.6 


hl=—=  .025 
359 


21 

*2=!82=115 


Average  hazard  rate  (h)  —  #  l'>i'llJri's 

E" 


We  first  return  to  the  remission  data,  again 
shown  in  untabulated  form.  Inspecting  the 
survival  times  given  for  each  group,  we  can 
see  that  most  of  the  treatment  group’s  times 
are  longer  than  most  of  the  placebo  group’s 
times.  If  we  ignore  the  plus  signs  denoting 
censorship  and  simply  average  all  21  survival 
times  for  each  group  we  get  an  average, 
denoted  by  T  “bar,”  of  17.1  weeks  survival  for 
the  treatment  group  and  8.6  weeks  for  the 
placebo  group.  Because  several  of  the  treat¬ 
ment  group’s  times  are  censored,  this  means 
that  group  l’s  true  average  is  even  larger  than 
what  we  have  calculated.  Thus,  it  appears  from 
the  data  (without  our  doing  any  mathematical 
analysis)  that,  regarding  survival,  the  treat¬ 
ment  is  more  effective  than  the  placebo. 

As  an  alternative  to  the  simple  averages  that 
we  have  computed  for  each  group,  another 
descriptive  measure  of  each  group  is  the 
average  hazard  rate,  denoted  as  h  "bar.”  This 
rate  is  defined  by  dividing  the  total  number 
of  failures  by  the  sum  of  the  observed  survival 
times.  For  group  1,  h  “bar”  is  9/359,  which 
equals  .025.  For  group  2,  h  “bar”  is  21/182, 
which  equals.  115. 
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As  previously  described,  the  hazard  rate  indi¬ 
cates  failure  potential  rather  than  survival 
probability.  Thus,  the  higher  the  average  haz¬ 
ard  rate,  the  lower  is  the  group’s  probability  of 
surviving. 

In  our  example,  the  average  hazard  for  the 
treatment  group  is  smaller  than  the  average 
hazard  for  the  placebo  group. 


Placebo  hazard  >  treatment  haz¬ 
ard:  suggests  that  treatment  is 
more  effective  than  placebo 


Thus,  using  average  hazard  rates,  we  again  see 
that  the  treatment  group  appears  to  be  doing 
better  overall  than  the  placebo  group;  that  is, 
the  treatment  group  is  less  prone  to  fail  than 
the  placebo  group. 


Descriptive  measures  (T  and  h)  give 
overall  comparison;  they  do  not 
give  comparison  over  time. 


The  descriptive  measures  we  have  used  so 
far — the  ordinary  average  and  the  hazard  rate 
average — provide  overall  comparisons  of  the 
treatment  group  with  the  placebo  group. 
These  measures  don’t  compare  the  two  groups 
at  different  points  in  time  of  follow-up.  Such  a 
comparison  is  provided  by  a  graph  of  survivor 
curves. 


EXAMPLE 


Here  we  present  the  estimated  survivor 
curves  for  the  treatment  and  placebo  groups. 
The  method  used  to  get  these  curves  is  called 
the  Kaplan-Meier  method,  which  is  described 
in  Chap.  2.  When  estimated,  these  curves  are 
actually  step  functions  that  allow  us  to  com¬ 
pare  the  treatment  and  placebo  groups  over 
time.  The  graph  shows  that  the  survivor  func¬ 
tion  for  the  treatment  group  consistently  lies 
above  that  for  the  placebo  group;  this  dif¬ 
ference  indicates  that  the  treatment  appears 
effective  at  all  points  of  follow-up.  Notice,  how¬ 
ever,  that  the  two  functions  are  somewhat 
closer  together  in  the  first  few  weeks  of  fol¬ 
low-up,  but  thereafter  are  quite  spread  apart. 
This  widening  gap  suggests  that  the  treatment 
is  more  effective  later  during  follow-up  than 
it  is  early  on. 
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Median  (treatment)  =  23  weeks 
Median  (placebo)  =  8  weeks 


Also  notice  from  the  graph  that  one  can  obtain 
estimates  of  the  median  survival  time,  the  time 
at  which  the  survival  probability  is. 5  for  each 
group.  Graphically,  the  median  is  obtained  by 
proceeding  horizontally  from  the  0.5  point  on 
the  Y-axis  until  the  survivor  curve  is  reached, 
as  marked  by  an  arrow,  and  then  proceeding 
vertically  downward  until  the  X-axis  is  crossed 
at  the  median  survival  time. 

For  the  treatment  group,  the  median  is  23  weeks; 
for  the  placebo  group,  the  median  is  8  weeks. 
Comparison  of  the  two  medians  reinforces  our 
previous  observation  that  the  treatment  is  more 
effective  overall  than  the  placebo. 


VIII.  Example:  Extended 
Remission  Data 


Group  1  Group  2 


t  (weeks)  log  WBC  t  (weeks)  log  WBC 


6 

2.31 

1 

2.80 

6 

4.06 

1 

5.00 

6 

3.28 

2 

4.91 

7 

4.43 

2 

4.48 

10 

2.96 

3 

4.01 

13 

2.88 

4 

4.36 

16 

3.60 

4 

2.42 

22 

2.32 

5 

3.49 

23 

2.57 

5 

3.97 

6+ 

3.20 

8 

3.52 

9+ 

2.80 

8 

3.05 

10+ 

2.70 

8 

2.32 

11  + 

2.60 

8 

3.26 

17+ 

2.16 

11 

3.49 

19+ 

2.05 

11 

2.12 

20+ 

2.01 

12 

1.50 

25+ 

1.78 

12 

3.06 

32+ 

2.20 

15 

2.30 

32+ 

2.53 

17 

2.95 

34+ 

1.47 

22 

2.73 

35+ 

1.45 

23 

1.97 

Before  proceeding  to  another  data  set,  we  con¬ 
sider  the  remission  example  data  (Freireich 
et  al.,  Blood,  1963)  in  an  extended  form.  The 
table  at  the  left  gives  the  remission  survival 
times  for  the  two  groups  with  additional  infor¬ 
mation  about  white  blood  cell  count  for  each 
person  studied.  In  particular,  each  person’s 
log  white  blood  cell  count  is  given  next  to 
that  person's  survival  time.  The  epidemiologic 
reason  for  adding  log  WBC  to  the  data  set 
is  that  this  variable  is  usually  considered  an 
important  predictor  of  survival  in  leukemia 
patients;  the  higher  the  WBC,  the  worse  the 
prognosis.  Thus,  any  comparison  of  the 
effects  of  two  treatment  groups  needs  to  con¬ 
sider  the  possible  confounding  effect  of  such 
a  variable. 
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EXAMPLE:  CONFOUNDING 


Treatment  group:  logWBC  =  1.8 
Placebo  group:  logWBC  =  4. 1 
Indicates  confounding  of  treatment 
effect  by  log  WBC 


Frequency 

distribution 


Need  to  adjust  for  imbalance  in  the 
distribution  of  log  WBC 


Although  a  full  exposition  of  the  nature  of 
confounding  is  not  intended  here,  we  provide 
a  simple  scenario  to  give  you  the  basic  idea. 
Suppose  all  of  the  subjects  in  the  treatment 
group  had  very  low  log  WBC,  with  an  average, 
for  example,  of  1.8,  whereas  all  of  the  subjects 
in  the  placebo  group  had  very  high  log  WBC, 
with  an  average  of  4.1.  We  would  have  to  con¬ 
clude  that  the  results  we’ve  seen  so  far  that 
compare  treatment  with  placebo  groups  may 
be  misleading. 

The  additional  information  on  log  WBC  would 
suggest  that  the  treatment  group  is  surviving 
longer  simply  because  of  their  low  WBC  and 
not  because  of  the  efficacy  of  the  treatment 
itself.  In  this  case,  we  would  say  that  the  treat¬ 
ment  effect  is  confounded  by  the  effect  of 
log  WBC. 

More  typically,  the  distribution  of  log  WBC 
may  be  quite  different  in  the  treatment  group 
than  in  the  control  group.  We  have  illustrated 
one  extreme  in  the  graph  at  the  left.  Even 
though  such  an  extreme  is  not  likely,  and  is 
not  true  for  the  data  given  here,  the  point  is 
that  some  attempt  needs  to  be  made  to  adjust 
for  whatever  imbalance  there  is  in  the  distribu¬ 
tion  of  log  WBC.  However,  if  high  log  WBC 
count  was  a  consequence  of  the  treatment, 
then  white  blood  cell  count  should  not  be  con¬ 
trolled  for  in  the  analysis. 


EXAMPLE:  INTERACTION  1 

High  log  WBC  Low  log  WBC 

5(0 

Treatment  S(l) 

\\  Treatment 

V~\  ' 

\7 

/  \ 

Placebo^ 

/ b 

Placebo 

t 

t 

Treatment  by  log  WBC  interaction 

Another  issue  to  consider  regarding  the  effect 
of  log  WBC  is  interaction.  What  we  mean  by 
interaction  is  that  the  effect  of  the  treatment 
may  be  different,  depending  on  the  level  of  log 
WBC.  For  example,  suppose  that  for  persons 
with  high  log  WBC,  survival  probabilities  for 
the  treatment  are  consistently  higher  over  time 
than  for  the  placebo.  This  circumstance  is  illu¬ 
strated  by  the  first  graph  at  the  left.  In  contrast, 
the  second  graph,  which  considers  only  per¬ 
sons  with  low  log  WBC,  shows  no  difference 
in  treatment  and  placebo  effect  over  time. 
In  such  a  situation,  we  would  say  that  there  is 
strong  treatment  by  log  WBC  interaction, 
and  we  would  have  to  qualify  the  effect  of  the 
treatment  as  depending  on  the  level  of  log  WBC. 


32  1 .  Introduction  to  Survival  Analysis 


Need  to  consider: 

•  interaction; 

•  confounding. 


The  example  of  interaction  we  just  gave  is  but 
one  way  interaction  can  occur;  on  the  other 
hand,  interaction  may  not  occur  at  all.  As  with 
confounding,  it  is  beyond  our  scope  to  provide  a 
thorough  discussion  of  interaction.  In  any  case, 
the  assessment  of  interaction  is  something  to 
consider  in  one’s  analysis  in  addition  to  con¬ 
founding  that  involves  explanatory  variables. 


The  problem: 

Compare  two  groups  after  adjusting 
for  confounding  and  interaction. 


Thus,  with  our  extended  data  example,  the 
basic  problem  can  be  described  as  follows:  to 
compare  the  survival  experience  of  the  two 
groups  after  adjusting  for  the  possible  con¬ 
founding  and/or  interaction  effects  of  log  WBC. 


EXAMPLE  1 

Individual  t 

I. 

X, 

# 

(weeks) 

d 

(Group)  (log  WBC) 

ft 

6 

1 

1 

2.31 

2 

6 

1 

1 

4.06 

3 

6 

1 

1 

3.28 

4 

7 

1 

1 

4.43 

5 

10 

1 

1 

2.96 

6 

13 

1 

1 

2.88 

7 

16 

1 

1 

3.60 

8 

22 

1 

1 

2.32 

9 

23 

1 

1 

2.57 

Group  1 

1  1 

10 

11 

6 

9 

0 

0 

1 

1 

3.20 

2.80 

12 

10 

0 

1 

2.70 

13 

11 

0 

1 

2.60 

14 

17 

0 

1 

2.16 

15 

19 

0 

1 

2.05 

16 

20 

0 

1 

2.01 

17 

25 

0 

1 

1.78 

18 

32 

0 

1 

2.20 

19 

32 

0 

1 

2.53 

20 

34 

0 

1 

1.47 

c 

35 

0 

1 

1.45 

(22 

1 

1 

0 

2.80 

23 

1 

1 

0 

5.00 

24 

2 

1 

0 

4.91 

25 

2 

1 

0 

4.48 

26 

3 

1 

0 

4.01 

27 

4 

1 

0 

4.36 

28 

4 

1 

0 

2.42 

29 

5 

1 

0 

3.49 

30 

5 

1 

0 

3.97 

Group  / 

2  1 

31 

32 

8 

8 

1 

1 

0 

0 

3.52 

3.05 

33 

8 

1 

0 

2.32 

34 

8 

1 

0 

3.26 

35 

11 

1 

0 

3.49 

36 

11 

1 

0 

2.12 

37 

12 

1 

0 

1.50 

38 

12 

1 

0 

3.06 

39 

15 

1 

0 

2.30 

40 

17 

1 

0 

2.95 

41 

22 

1 

0 

2.73 

li2 

23 

1 

0 

1.97 

The  problem  statement  tells  us  that  we  are  now 
considering  two  explanatory  variables  in  our 
extended  example,  whereas  we  previously  con¬ 
sidered  a  single  variable,  group  status.  The 
data  layout  for  the  computer  needs  to  reflect 
the  addition  of  the  second  variable,  log  WBC. 
The  extended  table  in  computer  layout  form  is 
given  at  the  left.  Notice  that  we  have  labeled  the 
two  explanatory  variables  X\  (for  group  status) 
and  X2  (for  log  WBC).  The  variable  is  our 
primary  study  or  exposure  variable  of  interest 
here,  and  the  variable  X2  is  an  extraneous  vari¬ 
able  that  we  are  interested  in  accounting  for 
because  of  either  confounding  or  interaction. 
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Analysis  alternatives: 

•  stratify  on  log  WBC; 

•  use  math  modeling,  e.g., 
proportional  hazards  model. 


As  implied  by  our  extended  example,  which 
considers  the  possible  confounding  or  inter¬ 
action  effect  of  log  WBC,  we  need  to  consider 
methods  for  adjusting  for  log  WBC  and/or 
assessing  its  effect  in  addition  to  assessing  the 
effect  of  treatment  group.  The  two  most  popu¬ 
lar  alternatives  for  analysis  are  the  following: 


•  to  stratify  on  log  WBC  and  compare 
survival  curves  for  different  strata;  or 

•  to  use  mathematical  modeling  procedures 
such  as  the  proportional  hazards  or  other 
survival  models;  such  methods  will  be 
described  in  subsequent  chapters. 


IX.  Multivariable  Example  We  now  consider  one  other  example.  Our 

purpose  here  is  to  describe  a  more  general 
type  of  multivariable  survival  analysis  prob¬ 
lem.  The  reader  may  see  the  analogy  of  this 
example  to  multiple  regression  or  even  logistic 
regression  data  problems. 


We  consider  a  data  set  developed  from  a 
13-year  follow  up  study  of  a  fixed  cohort  of 
persons  in  Evans  County  Georgia,  during  the 
period  1967-1980  (Schoenbach  et  al.,  Amer.  J. 
Epid.,  1986).  From  this  data  set,  we  focus  on  a 
portion  containing  n  =  1 70  white  males  who  are 
age  60  or  older  at  the  start  of  follow-up  in  1967. 

For  this  data  set,  the  outcome  variable  is  T,  time 
in  years  until  death  from  start  of  follow-up, 
so  the  event  of  interest  is  death.  Several  explan¬ 
atory  variables  are  measured,  one  of  which  is 
considered  the  primary  exposure  variable;  the 
other  variables  are  considered  as  potential  con- 
founders  and/or  interaction  variables. 

The  primary  exposure  variable  is  a  measure 
called  Social  Network  Index  (SNI).  This  is  an 
ordinal  variable  derived  from  questionnaire 
measurement  and  is  designed  to  assess  the 
extent  to  which  a  study  subject  has  social  con¬ 
tacts  of  various  types.  With  the  questionnaire, 
a  scale  is  used  with  values  ranging  from  0 
(absence  of  any  social  network)  to  5  (excellent 
social  network). 


•  Describes  general 
multivariable  survival  problem. 

•  Gives  analogy  to  regression 
problems. 


EXAMPLE 


13-year  follow-up  of  fixed  cohort  from 
Evans  County,  Georgia 

n  =  170  white  males  (60+) 

T  =  years  until  death 

Event  =  death 

Explanatory  variables: 

•  exposure  variable 

•  confounders 

•  interaction  variables 

Exposure : 

Social  Network  Index  (SNI) 


0  1  2  3  4  5 

Absence  Excellent 

of  social  social 

network  network 
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EXAMPLE:  (continued) 


Study  goal:  to  determine  whether  SNI 
is  protective  against  death, 
i.e.,  SNI  =>  S(0  A 


Explanatory  variables : 


SNI 

AGE 

SBP 

CHR 

QUET 

SOCL 


Exposure  variable 


Potential  confounders/ 
interaction  variables 


Note:  QUET 


Weight 

(height) 


x  100 


The  problem: 

To  describe  the  relationship  between 
SNI  and  time  to  death,  after 
controlling  for  AGE,  SBP,  CHR, 
QUET,  and  SOCL. 


Goals: 

•  Measure  of  effect  (adjusted) 

•  Survivor  curves  for  different  SNI 
categories  (adjusted) 

•  Decide  on  variables  to  be 
adjusted;  determine  method  of 
adjustment 


The  study’s  goal  is  to  determine  whether  one’s 
social  network,  as  measured  by  SNI,  is  protec¬ 
tive  against  death.  If  this  study  hypothesis  is 
correct,  then  the  higher  the  social  network 
score,  the  longer  will  be  one’s  survival  time. 

In  evaluating  this  problem,  several  explanatory 
variables,  in  addition  to  SNI,  are  measured  at 
the  start  of  follow-up.  These  include  AGE,  sys¬ 
tolic  blood  pressure  (SBP),  an  indicator  of  the 
presence  or  absence  of  some  chronic  disease 
(CHR),  body  size  as  measured  by  Quetelet’s 
index  (QUET  =  weight  over  height  squared 
times  100),  and  social  class  (SOCL). 

These  five  additional  variables  are  of  interest 
because  they  are  thought  to  have  their  own 
special  or  collective  influence  on  how  long  a 
person  will  survive.  Consequently,  these  vari¬ 
ables  are  viewed  as  potential  confounders  and/ 
or  interaction  variables  in  evaluating  the  effect 
of  social  network  on  time  to  death. 

We  can  now  clearly  state  the  problem  being 
addressed  by  this  study:  To  describe  the  rela¬ 
tionship  between  SNI  and  time  to  death, 
controlling  for  AGE,  SBP,  CHR,  QUET,  and 
SOCL. 

Our  goals  in  using  survival  analysis  to  solve 
this  problem  are  as  follows: 

•  to  obtain  some  measure  of  effect  that  will 
describe  the  relationship  between  SNI  and 
time  until  death,  after  adjusting  for  the 
other  variables  we  have  identified; 

•  to  develop  survival  curves  that  describe  the 
probability  of  survival  over  time  for  differ¬ 
ent  categories  of  social  networks;  in  partic¬ 
ular,  we  wish  to  compare  the  survival  of 
persons  with  excellent  networks  to  the  sur¬ 
vival  of  persons  with  poor  networks.  Such 
survival  curves  need  to  be  adjusted  for  the 
effects  of  other  variables. 

•  to  achieve  these  goals,  two  intermediary 
goals  are  to  decide  which  of  the  additional 
variables  being  considered  need  to  be 
adjusted  and  to  determine  an  appropriate 
method  of  adjustment. 
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The  computer  data  layout  for  this  problem  is 
given  below.  The  first  column  lists  the  170  indi¬ 
viduals  in  the  data  set.  The  second  column  lists 
the  survival  times,  and  the  third  column  lists 
failure  or  censored  status.  The  remainder  of 
the  columns  list  the  6  explanatory  variables  of 
interest,  starting  with  the  exposure  variable 
SNI  and  continuing  with  the  variables  to  be 
accounted  for  in  the  analysis. 

Computer  layout:  13-year  follow-up  study  (1967-1980)  of  a  fixed  cohort  of  n  =  170 

white  males  (60+)  from  Evans  County,  Georgia 
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X.  Math  Models  in 
Survival  Analysis 


General  framework 


Controlling  for  C i,  C2,  . . .  Cp. 
SNI  study: 

E  =  SNI  =>  D  =  survival  time 

Controlling  for  AGE,  SBP,  CHR, 
QUET,  and  SOCL 


It  is  beyond  the  scope  of  this  presentation  to 
provide  specific  details  of  the  survival  analysis 
of  these  data.  Nevertheless,  the  problem 
addressed  by  these  data  is  closely  analogous 
to  the  typical  multivariable  problem  addressed 
by  linear  and  logistic  regression  modeling. 
Regardless  of  which  modeling  approach  is  cho¬ 
sen,  the  typical  problem  concerns  describing 
the  relationship  between  an  exposure  variable 
(e.g.,  E )  and  an  outcome  variable  (e.g.,  D)  after 
controlling  for  the  possible  confounding  and 
interaction  effects  of  additional  variables  (e.g., 
Ci,  C2,  and  so  on  up  to  Cp).  In  our  survival 
analysis  example,  E  is  the  social  network  vari¬ 
able  SNI,  D  is  the  survival  time  variable,  and 
there  ar ep  =  5  C  variables,  namely,  AGE,  SBP, 
CHR,  QUET,  and  SOCL. 
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Model  Outcome 


Survival  analysis 

Linear  regression 
Logistic  regression 


Time  to  event 
(with  censoring) 

Continuous  (SBP) 

Dichotomous 
(CHD  yes/no) 


Nevertheless,  an  important  distinction  among 
modeling  methods  is  the  type  of  outcome  vari¬ 
able  being  used.  In  survival  analysis,  the  out¬ 
come  variable  is  “time  to  an  event,”  and  there 
may  be  censored  data.  In  linear  regression 
modeling,  the  outcome  variable  is  generally 
a  continuous  variable,  like  blood  pressure.  In 
logistic  modeling,  the  outcome  variable  is  a 
dichotomous  variable,  like  CHD  status,  yes  or 
no.  And  with  linear  or  logistic  modeling,  we 
usually  do  not  have  information  on  follow-up 
time  available. 


As  with  linear  and  logistic  modeling,  one  statis¬ 
tical  goal  of  a  survival  analysis  is  to  obtain  some 
measure  of  effect  that  describes  the  exposur¬ 
e-outcome  relationship  adjusted  for  relevant 
extraneous  variables. 


Measure  of  effect: 

Linear  regression: 

regression  coefficient  p 

Logistic  regression: 
odds  ratio  e|! 


Survival  analysis: 
hazard  ratio  e1’’ 


In  linear  regression  modeling,  the  measure  of 
effect  is  usually  some  regression  coefficient  p. 

In  logistic  modeling,  the  measure  of  effect  is  an 
odds  ratio  expressed  in  terms  of  an  exponential 
of  one  or  more  regression  coefficients  in  the 
model,  for  example,  e  to  the  p. 

In  survival  analysis,  the  measure  of  effect  typi¬ 
cally  obtained  is  called  a  hazard  ratio;  as  with 
the  logistic  model,  this  hazard  ratio  is 
expressed  in  terms  of  an  exponential  of  one  or 
more  regression  coefficients  in  the  model. 


EXAMPLE 


SNI  study:  hazard  ratio  (HR) 
describes  relationship  between  SNI 
and  T,  after  controlling  for  covariates. 


Thus,  from  the  example  of  survival  analysis 
modeling  of  the  social  network  data,  one  may 
obtain  a  hazard  ratio  that  describes  the  rela¬ 
tionship  between  SNI  and  survival  time  ( T ), 
after  controlling  for  the  appropriate  co variates. 
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Interpretation  of  HR  (like  OR): 

HR  =  1  =>■  no  relationship 

HR  =  1 0  =>■  exposed  hazard  1 0 
times  unexposed 

HR  =  1/10  =>  exposed  hazard  1/10 
times  unexposed 


The  hazard  ratio,  although  a  different  measure 
from  an  odds  ratio,  nevertheless  has  a  similar 
interpretation  of  the  strength  of  the  effect. 
A  hazard  ratio  of  1,  like  an  odds  ratio  of  1, 
means  that  there  is  no  effect;  that  is,  1  is  the 
null  value  for  the  exposure-outcome  relation¬ 
ship.  A  hazard  ratio  of  10,  on  the  other  hand, 
is  interpreted  like  an  odds  ratio  of  10;  that  is, 
the  exposed  group  has  ten  times  the  hazard  of 
the  unexposed  group.  Similarly,  a  hazard  ratio 
of  1/10  implies  that  the  exposed  group  has  one- 
tenth  the  hazard  of  the  unexposed  group. 


XI.  Censoring 
Assumptions 

Three  assumptions  about  censoring: 

Independent  (vs.  non-independent) 
censoring 

Random  (vs.  non-random) 
censoring 

Non-inf ormative  (vs.  informative) 
censoring 


There  are  three  assumptions  about  censoring 
often  considered  for  survival  data:  inde¬ 
pendent  censoring,  random  censoring,  and 
non-informative  censoring.  Although  these 
assumptions  have  similarities,  they  are  also 
somewhat  different  and  are  often  confused  in 
the  textbook  and  published  literature  as  being 
interchangeable  synonyms. 


Mathematic  definitions  have  been  Mathematical  definitions  of  independent  (vs. 
provided  elsewhere.  nonindependent),  random  (vs.  nonrandom), 

and  non-informative  (vs.  informative)  censor¬ 
ing  have  been  given  elsewhere  (Kalbfleisch 
and  Prentice,  1980;  Klein  and  Moeschberger, 
2003).  Here,  however,  we  prefer  to  provide 
more  intuitive  definitions  and  examples. 


Independent  (vs.  non-independent) 
censoring 

•  most  useful 

•  affects  validity 

Random  (vs.  non-random) 
censoring 

•  more  restrictive  than 
independent, 

i.e.,  random  =>■  indep, 
whereas  indep  ^  random. 


The  assumption  of  independent  censoring  is 
the  most  useful  of  the  three  types  for  drawing 
correct  inferences  that  compare  the  survival 
experience  of  two  or  more  groups  (e.g.,  treat¬ 
ment  vs.  placebo).  In  particular,  the  presence 
of  non-independent  censoring  typically  affects 
the  validity  of  one’s  estimated  effect.  Random 
censoring  is  a  stronger  assumption  and  more 
restrictive  than  independent  censoring. 


03  > 


38  1 .  Introduction  to  Survival  Analysis 


To  be  more  specific,  random  censoring  essen¬ 
tially  means  that  subjects  who  are  censored  at 
time  t  should  be  representative  of  all  the  study 
subjects  who  remained  at  risk  at  time  t  with 
respect  to  their  survival  experience.  In  other 
words,  the  failure  rate  for  subjects  who  are 
censored  is  assumed  to  be  equal  to  the  failure 
rate  for  subjects  who  remained  in  the  risk  set 
who  are  not  censored. 

Independent  censoring:  Independent  censoring  essentially  means 

that  within  any  subgroup  of  interest,  the  subjects 
Failure  rate  who  are  censored  at  time  t  should  be  representa- 

Subgrp  Censored  Not  censored  tive  of  all  the  subjects  in  that  subgroup  who 
hA  Cc(t)  =  hA  NCc(t)  remained  at  risk  at  time  t  with  respect  to  their 

b BCe(t)  =  ha  NCe(t)  survival  experience.  In  other  words,  censoring 

is  independent  provided  that  it  is  random 
within  any  subgroup  of  interest.  We  illustrate 
these  ideas  with  an  example. 

Suppose  that  we  are  interested  in  estimating 
the  3-year  survival  (for  some  disease)  among 
those  in  Group  A.  We  follow  100  individuals 
intially  disease  free  for  3  years.  Over  the  3-year 
period,  20  contract  disease.  We  estimate  the 
3-year  risk  of  disease  for  those  in  Group  A  to 
be  0.20  and  the  3-year  survival  to  be  0.80  (since 
80  of  100  survived). 

Now  suppose  we  wish  to  continue  the  study  for 
another  2  years  in  order  to  estimate  the  5-year 
survival  for  Group  A.  We  want  to  continue  the 
following  for  the  80  individuals  who  partici¬ 
pated  in  the  study  and  survived  for  the  first 
3  years.  However,  half  or  40  of  those  80  indivi¬ 
duals  refused  to  continue  in  the  study  and 
were  therefore  lost  to  follow-up  (censored).  Of 
the  other  40  individuals  who  remained  in  the 
study,  5  individuals  contract  the  disease.  With 
this  information,  what  is  the  estimate  of  the 
5 -year  survival  for  Group  A  and  under  what 
assumptions? 


EXAMPLE 

Group  A 

Time  #  at  risk  #  events 

#  survived 

0-3  yrs  100  20 

80 

3-yr  risk  =  20/100  =  0.20 

3-yr  survival  =  80/100  =0.80 

Time  #  at  risk  #  events 

#  survived 

0-3  100  20 

80 

40  leave  study 
3-5  40  5 

35 

5-year  survival? 

Random  Censoring: 

Failure  rate 

Censored  Not  censored 

hce(t)  =  hNCe(t) 
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EXAMPLE:  (continued) 


What  happened  to  40  individuals  who 
were  censored  at  3  years?  Don’t  know 

Assuming  indep  and  random  censoring: 

40  at  risk  at  time  5 
similar  to 

40  censored  at  time  3 
i.e., 

expect  5  events  from  40  censored  at  time  3 
since  5  events  from  40  at  risk 


Estimated  #  of  cases  over  5  years: 

20  +  5  +5 

first  3  years  next  2  years  censored  cases 

=  30  estimated  cases  from  original 
100  over  5  years 

Estimated  5-year  survival  =  70/100  =  0.70 


The  idea: 

Assume  survival  experience  of  sub¬ 
jects  censored  at  t  is  as  expected  if 
randomly  selected  from  subjects 
who  are  at  risk  at  t. 


So  far,  there  is  no  distinction 
between  independent  and  random 
censoring. 

Reason:  Only  considering  one  group 


If  we  know  what  happened  to  the  40  indivi¬ 
duals  who  were  censored  at  the  3-year  mark, 
then  we  could  sum  the  total  number  of  events 
and  the  total  number  of  individuals  who  sur¬ 
vived  (out  of  the  original  100  at  risk).  Under  an 
assumption  of  independent  and  random 
censoring,  we  assume  that  the  40  indi¬ 
viduals  who  were  censored  were  similar  to 
the  40  who  remained  at  risk  with  respect 
to  their  survival  experience.  Since  5  of  the 
40  who  remained  in  the  study  after  3  years 
contracted  disease  over  the  next  2  years,  we 
estimate  that  5  of  the  40  who  were  censored 
also  contracted  the  disease  over  the  same  time 
period,  even  though  their  disease  experience 
was  unobserved. 

So  over  the  course  of  the  5  years:  20  contracted 
disease  in  the  first  3  years,  5  were  observed  to 
get  disease  after  3  years,  and  5  of  the  censored 
individuals  were  estimated  to  have  contracted 
disease.  This  yields  20  +  5  +  5  =  30  who  are 
estimated  to  have  contract  disease  leaving  70  of 
the  original  100  who  have  survived  over  the 
5-year  period.  The  estimated  5-year  survival 
among  Group  A  is  0.70  under  the  assumptions 
of  random  and  independent  censoring. 

The  idea  behind  independent  and  random 
censoring  is  that  it  is  as  if  the  subjects  censored 
at  time  t  were  randomly  selected  to  be  censored 
from  the  group  of  subjects  who  were  in  the  risk 
set  at  time  t.  Even  though  the  censored  subjects 
were  not  randomly  selected,  their  survival 
experience  would  be  expected  to  be  the  same 
as  if  they  had  been  randomly  selected  from  the 
risk  set  at  time  t. 

In  this  example,  there  is  no  distinction  made 
between  independent  censoring  and  random 
censoring.  The  reason  is  because  we  are  only 
considering  one  group  of  individuals  (i.e.,  there 
are  no  predictor  variables  considered).  The  dis¬ 
tinction  comes  if  we  consider  more  than  one 
group  for  comparison.  We  illustrate  this  dis¬ 
tinction  by  continuing  the  example. 
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EXAMPLE:  (continued) 


Group  B 

Time  #  at  risk  #  events  #  survived 
0-3  100  40  60 

10  leave  study 

3-5  50  10  40 

Failure  risk  from  3  to  5  yrs  =10/50  = 

0.20. 


Assuming  independent  censoring: 
expect  0.20  x  10  =2  cases 
from  10  censored  at  time  3 
Estimated  #  of  cases  over  5  years: 

40  40  2 

first  3  years  next  2  years  censored  cases 
=  52  estimated  cases  from  original 
100  over  5  years 

Estimated  5-year  survival  =  48/100  = 
0.48 


Groups  A  and  B  combined 

Time  #  at  risk  #  events  #  survived 
A  B  total  A  B  total  A  B  total 

0-3  100  100  200  20  40  60  80  60  140 

40  from  A  and  10  from  B  leave  study 

3-5  40  50  90  5  10  15  35  40  75 

(  pA(censored)  =  40/80  =  0.50  or 50% 

<  pB(censored)  =  10/60  =  0.17  or  17% 
l  pA(censored)  pB(censored) 

Group  A  Group  B 

5-yr  survival  0.70  0.48 

k 

Censoring  not  random 


We  extend  the  previous  example  to  include  100 
subjects  from  Group  B  who  are  disease  free  at 
the  start  of  follow-up.  The  goal  is  to  estimate 
their  5 -year  survival  and  compare  it  to  the 
5-year  survival  for  Group  A.  Suppose  over  the 
first  3-year  period,  40  of  the  100  individuals 
contract  disease.  Then,  of  the  60  who  survive 
the  first  3  years,  10  refuse  to  continue  in  the 
study  and  are  therefore  censored.  For  the  50 
who  remain  in  the  study,  10  individuals  con¬ 
tract  disease  by  the  5th  year  (10  of  50  =  20%). 

Under  independent  censoring,  we  estimate 
that  20%  or  2  of  the  10  censored  subjects  con¬ 
tract  disease  by  the  5th  year. 

So,  over  the  course  of  the  5  years  among  the 
original  100  in  Group  B:  40  contracted  disease 
in  the  first  3  years,  10  were  observed  to  get 
disease  after  3  years,  and  2  of  the  censored 
individuals  were  estimated  to  the  contracted 
disease.  This  yields  40  +  10  +  2  =  52  who  are 
estimated  to  have  contracted  disease,  leaving 
48  of  the  original  100  who  survived  over  the 
5-year  period.  The  estimated  5-year  survival 
among  Group  B  is  0.48  under  the  assumptions 
of  independent  censoring. 

Over  all,  combining  both  groups,  there  were 
200  originally  at  risk  of  whom  60  contracted 
disease  within  the  first  3  years  (20  from  Group 
A  and  40  from  Group  A)  leaving  140  who  have 
survived  for  the  first  3  years  (80  from  Group  A 
and  60  from  Group  B).  At  the  3-year  mark,  50 
subjects  were  censored  (40  in  Group  A  and  10 
in  Group  B). 

A  much  higher  proportion  of  censoring 
occurred  in  Group  A  (i.e.,  40/80  =  0.50)  than 
in  Group  B  (i.e.,  10/60  =  0.17). 

Moreover,  subjects  in  Group  A  had  a  higher 
survival  probability  than  those  in  Group  B. 


Therefore,  the  censoring  was  not  random. 
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EXAMPLE  (continued) 


Random  censoring 
within  Group  A  and  within  Group  B 

J) 

Independent  censoring 

(i.e.,  random  censoring  conditional  on 
covariates) 

Nevertheless, 

(overall)  random  censoring  not  met 


ALTERNATIVE  EXAMPLE 


Time  #  at  risk  #  events  #  survived 
A  B  total  A  B  total  A  B  total 

0-3  100  100  200  20  40  60  80  60  140 

40  from  A  and  30  from  B  leave  study 
3-5  40  30  70  5  10  15  35  20  55 

pA(censored)  =  40/80  =  0.50  or  50% 
pB(censored)  =  30/60  =  0.50  or  50% 

pA(censored)  =  pB(censored) 

11- 

Random  censoring  (overall) 


Non-informative  censoring  depends 
on 

•  distribution  of  time-to-event 

•  distribution  of  time-to- 
censorship 


Time-to-event  random  variable  (T): 

Distribution  of  survival  times 

assuming: 

•  no  loss-to-follow-up 

•  study  continues  until  all 
subjects  get  event 

Time-to-censorship  random  vari¬ 
able  (C): 

Distribution  of  censoring  times 

assuming: 

•  study  ends  before  all  subjects 
get  event 

•  censored  subjects  do  not  get 
event  prior  to  the  end  of  study 


However,  conditional  on  each  level  of  covari¬ 
ates  (conditional  on  group  status  in  this  exam¬ 
ple),  the  censoring  was  random.  Therefore,  the 
censoring  was  independent  because  indepen¬ 
dent  censoring  is  random  censoring  condi¬ 
tional  on  each  level  of  covariates. 

Nevertheless,  the  random  censoring  assump¬ 
tion  was  not  met  overall  because  the  censored 
individuals  were  not  representative  of  all  who 
remained  in  the  risk  set  at  time  t  with  respect  to 
the  rate  of  failure. 

If  instead  in  the  previous  example,  suppose  40 
of  80  from  Group  A  and  30  of  60  from  Group  B 
were  censored  at  the  3-year  mark,  as  shown  in 
the  table  on  the  left. 

Then  the  censoring  would  have  been  random 
because  an  equal  proportion  of  Group  A  and 
Group  B  would  have  been  censored  from  the 
risk  set  and  those  censored  would  be  represen¬ 
tative  of  those  who  remained  at  risk. 


We  next  consider  the  assumption  of  non- 
informative  censoring.  Whether  censoring  is 
non-informative  or  informative  depends  on 
two  distributions:  (1)  the  distribution  of  the 
time-to-event  random  variable  and  (2)  the  distri¬ 
bution  of  time-to-censorship  random  variable. 

We  can  conceptualize  the  distribution  for  the 
time-to-event  random  variable  by  considering 
the  distribution  of  survival  times  if  there  was 
no  loss  to  follow-up  and  the  study  did  not  end 
until  all  subjects  got  the  event. 


Similarly,  we  can  conceptualize  a  time-to- 
censorship  random  variable  by  considering 
the  distribution  of  censoring  times  for  those 
subjects  who  would  not  have  gotten  the  event 
if  the  study  ended  before  all  subjects  got  the 
event. 
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Non-informative  censoring: 

no  information 

T  distribution  4$  C  distribution 

Note:  must  still  need  to  know  which 
subjects  are  censored  or  not  cen¬ 
sored. 


Non-informative  censoring  occurs  if  the  dis¬ 
tribution  of  survival  times  (T)  provides  no 
information  about  the  distribution  of  censor¬ 
ship  times  (C),  and  vice  versa.  Otherwise,  the 
censoring  is  informative.  Note,  however,  that 
the  data  must  still  identify  which  subjects  are 
or  are  not  censored. 


Non-informative 

Independent 

Random 


•  Often  all 
justifiable 
together 

•  Not  all 
equivalent 


The  assumption  of  non-informative  censoring 
is  often  justifiable  when  censoring  is  indepen¬ 
dent  and/or  random;  nevertheless,  these  three 
assumptions  are  not  equivalent. 


EXAMPLE:  Independent  and  random 
but  informative 
censoring 


Subject  A  gets  event 
(1 

Subject  B  (randomly  selected) 
gets  event,  e.g.,  family  member 
of  Subject  A  leaves  study 

Assume:  censored  subjects  represent 
subjects  at  risk  at  any  time 

Then 

•  independent  and  random 
censoring 

•  informative  censoring  since 
T  =>  C.  (i.e.,  T  distribution 
specifies  C  distribution) 


To  illustrate  how  independent  censoring  could 
be  different  from  non-informative  censoring, 
we  describe  an  artificial  example  where  the 
censoring  is  informative  but  also  random  and 
independent. 

Suppose  every  time  an  individual  gets  an  event, 
another  individual  in  the  study  is  randomly 
selected  to  leave  the  study,  e.g.,  after  an  event, 
a  family  member  decides  to  leave  the  study. 
If  those  censored  were  representative  of  those 
who  remained  in  the  risk  set,  then  the  cen¬ 
soring  would  be  random  and  independent. 
However,  the  censoring  would  also  be  infor¬ 
mative  as  the  censoring  mechanism  would 
be  related  to  the  time-to-event  distribution 
(since  events  cause  censorships).  In  fact,  if  this 
was  the  only  mechanism  in  which  individuals 
were  censored,  the  distribution  of  survival 
times  would  completely  specify  the  distribu¬ 
tion  of  censoring  times  (highly  informative). 
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To  see  how  bias  can  occur  if  the  censoring  is 
not  independent,  consider  a  drug  study  in 
which  some  individuals  are  censored  from  the 
study  due  to  the  occurrence  of  some  side 
effects.  It  may  be  that  the  unobserved  survival 
experience  among  those  who  are  censored  due 
to  a  drug  side  effect  is  not  representative  of 
those  who  remained  in  the  study.  If  those 
with  a  side  effect  are  more  vulnerable  to  the 
health  outcome,  then  we  would  likely  overesti¬ 
mate  their  survival  with  an  assumption  of  inde¬ 
pendent  censoring. 

Independent  censoring  most  Many  of  the  analytic  techniques  discussed  in 

relevant:  affects  validity  the  chapters  that  follow  Kaplan-Meier  survival 

estimation,  the  log  rank  test,  and  the  Cox 
model,  rely  on  an  assumption  of  independent 
censoring  for  valid  inference  in  the  presence  of 
right-censored  data. 


EXAMPLE:  Not  independent 
censoring 


•  Drug  side  effect  causes  censoring 

•  Censored  subjects  not 
representative  of  subjects  still  at 
risk 

•  Censored  subjects  more 
vulnerable  than  subjects  still  at 
risk 

k 

Assuming  independent  censoring 

would  overestimate  survival 


Chapters  This  presentation  is  now  complete.  We  suggest 

that  you  review  the  material  covered  here  by 
y  (  \  .  Introduction  )  reading  the  detailed  outline  that  follows.  Then 

v y  do  the  practice  exercises  and  test. 

2.  Kaplan-Meier  Survival 

Curves  and  the  Log-Rank  [n  chap.  2  we  describe  how  to  estimate  and 
* est  graph  survival  curves  using  the  Kaplan-Meier 

(KM)  method.  We  also  describe  how  to  test 
whether  two  or  more  survival  curves  are  esti¬ 


mating  a  common  curve.  The  most  popular 
such  test  is  called  the  log-rank  test. 
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Detailed  I.  What  is  survival  analysis?  (pages  4-5) 

Outline  A.  Type  of  problem  addressed:  outcome  variable  is 

time  until  an  event  occurs. 

B.  Assume  one  event  of  interest;  more  than  one 
type  of  event  implies  a  competing  risk  problem. 

C.  Terminology:  time  =  survival  time;  event  = 
failure. 

D.  Examples  of  survival  analysis: 

i.  leukemia  patients/time  in  remission 

ii.  disease-free  cohort/time  until  heart  disease 

iii.  elderly  population/time  until  death 

iv.  parolees/time  until  rearrest  (recidivism) 

v.  heart  transplants/time  until  death 
II.  Censored  data  (pages  5-8) 

A.  Definition:  don’t  know  survival  time  exactly. 

B.  Typical  reasons:  study  ends,  loss  to  follow-up, 
withdrawal  from  study. 

C.  Example  that  illustrates  (right-)  censoring. 

D.  Right-censoring:  true  survival  time  is  equal  to 
or  greater  than  observed  survival  times. 

E.  Left-censoring:  true  survival  time  is  less  than  or 
equal  to  observed  survival  time 

F.  Interval-censoring:  true  survival  time  is  within  a 
known  time  interval  (tl,  t2) 

G.  Interval-censoring  incorporates  right-  and  left- 
censoring  as  special  cases,  i.e., 
right-censoring  =>•  ti  =lower  bound,  t2=oo; 
left-censoring  =>•  ti=0,  t2=upper  bound. 

III.  Terminology  and  notation  (pages  9-15) 

A.  Notation:  T  =  survival  time  random  variable: 

t  =  specific  value  for  T 
d  =  (0-1)  variable  for  failure/censor¬ 
ship  status 

B.  Terminology:  S(t)  =  survivor  function 

h(t)  =  hazard  function 

C.  Properties  of  survivor  function: 

•  theoretically,  graph  is  smooth  curve, 
decreasing  from  S(t)  =  1  at  time  t  =  0  to 
S(t)  =  0  at  t  =  oo; 

•  in  practice,  graph  is  step  function  that  may 
not  go  all  the  way  to  zero  at  end  of  study  if 
not  everyone  studied  gets  the  event. 
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D.  Hazard  function  formula: 


hit)  =  lim 

Af-*0 


P(t  <T<t  +  At\T  >  t) 
At 


E.  Hazard  function  properties: 

•  hit)  gives  instantaneous  potential  for  event 
to  occur  given  survival  up  to  time  l; 

•  instantaneous  potential  idea  is  illustrated  by 
velocity; 

•  hazard  function  also  called  "conditional 
failure  rate”; 

•  h(t)  >  0;  has  no  upper  bound;  not  a  proba¬ 
bility;  depends  on  time  units. 

F.  Examples  of  hazard  curves: 

i.  exponential 

ii.  increasing  Weibull 

iii.  decreasing  Weibull 

iv.  log  normal 

G.  Uses  of  hazard  function: 

•  gives  insight  about  conditional  failure  rates; 

•  identifies  specific  model  form; 

•  math  model  for  survival  analysis  is  usually 
written  in  terms  of  hazard  function. 

H.  Relationship  of  S(f)  to  h(t):  if  you  know  one, 

you  can  determine  the  other: 

•  example:  hit)  =  /.  if  and  only  if  Sit)  =  e~Xt 

•  general  formulae: 


5(f)  =  exp 


h(u)du 


h{t) 


d  S{t)/dt 
.  S(t )  . 


IV.  Goals  of  survival  analysis  ( page  1 6) 

A.  Estimate  and  interpret  survivor  and/or  hazard 
functions. 

B.  Compare  survivor  and/or  hazard  functions. 

C.  Assess  the  relationship  of  explanatory  variables 
to  survival  time. 
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V.  Basic  data  layout  for  computer  (pages  16-23) 

A.  General  layout: 

#  t  d  X,  X2...XP 

1  t !  d,  I„  X12...Xlp 

2  t2  d2  X2  \  X22  ■  ■  ■  X2p 


i  ti  dj  Xj  ]  Xj2  . . .  Xj. 


7i  tn  d„  Xn  |  Xn  2 . . 


B.  Example:  Remission  time  data 

C.  Alternative  Data  Layout  for  Computer:  Couuting 
Process  (Start,  Stop)  Format 

•  U seful  for  more  complicated  survival  analysis : 

i.  Age-at-follow-up  as  time  scale  (Chapter  3) 

ii.  Time-dependent  variables  (Chapter  6) 

iii.  Recurrent  events  (Chapter  7) 

•  CP  data  layout 
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•  Simplest  CP  format:  1  dataline  subject 


i 

j 

dij 

kjo 

kjt 

|  Xiil  - 

'  Xijp 

1 

i 

dll 

!  0 

h 

1 

!  xm  • 

i 

■■  xllp 

i 

i 

dii 

!  o 

t; 

1  Xill 

1 

••  xilp 

n 

i 

dnl 

i  o 

L 

j  Xnl!  ■ 

Xnlp 

•  Example  from  Remission  Time  Dataset 

•  Example  from  Study  of  Recurrent  Bladder 
Cancer  Tumors  (Byar,  1980;  Wei,  Lin,  and 
Weisfeld,  1989) 

•  Computer  Appendix  gives  programming 
code 

VI.  Basic  data  layout  for  understanding  analysis 

(pages  23-28) 

A.  General  layout: 


Ordered  #  of  #  censored 

failure  times  failures  in  t(f+V))  Risk  set 
(ftp)  ( mf )  (qf)  R(t(f)) 


k  0)  =  o 

m0  =  0 

qo 

ho 

mi 

q  t 

R(k  p) 

k  2) 

m2 

qi 

R(k  2>) 

hk) 

mk 

qk 

R(t(k >) 

Note:  k  =  #  of  distinct  times  at  which  subjects  failed;  n  = 
#  of  subjects  (k  <  n)\  R(t </)),  the  risk  set,  is  the  set  of 
individuals  whose  survival  times  are  at  least  t or  larger. 


B.  Example:  Remission  time  data 
Group  1  (n  =  21,  9  failures,  k  =  7); 

Group  2  (n  =  21,  21  failures,  k  =12) 

C.  How  to  work  with  censored  data: 

Use  all  information  up  to  the  time  of  censor¬ 
ship;  don’t  throw  away  information. 


VII.  Descriptive  measures  of  survival  experience 

(pages  28-30) 

A.  Average  survival  time  (ignoring  censorship 
status): 


n 


n 


T  underestimates  the  true  average 
survival  time,  because  censored 
times  are  included  in  the  formula. 
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B.  Average  hazard  rate: 

A  failures 

h  = - n - 

Eo 

!=1 

C.  Descriptive  measures  7  and  h  give  overall  com¬ 
parison;  estimated  survivor  curves  give  compar¬ 
ison  over  time. 

D.  Estimated  survivor  curves  are  step  function 
graphs. 

E.  Median  survival  time:  graphically,  proceed  hor¬ 
izontally  from  0.5  on  the  E-axis  until  reaching 
graph,  then  vertically  downward  until  reaching 
the  A- axis. 

VIII.  Example:  Extended  remission  data  (pages  30-33) 

A.  Extended  data  adds  log  WBC  to  previous  remis¬ 
sion  data. 

B.  Need  to  consider  confounding  and  interaction. 

C.  Extended  data  problem:  compare  survival  expe¬ 
rience  of  two  groups,  after  adjusting  for  con¬ 
founding  and  interaction  effects  of  log  WBC. 

D.  Analysis  alternatives: 

i.  stratify  on  log  WBC  and  compare  survival 
curves  for  different  strata; 

ii.  use  math  modeling,  e.g.,  proportional 
hazards  model. 

IX.  Multivariable  example  (pages  33-35) 

A.  The  problem:  to  describe  the  relationship 
between  social  network  index  (SNI)  and  time 
until  death,  controlling  for  AGE,  systolic  blood 
pressure  (SBP),  presence  or  absence  of  chronic 
disease  (CHR),  Quetelet’s  index  (QUET  -  a 
measure  of  body  size),  and  social  class  (SOCL). 

B.  Goals: 

•  to  obtain  an  adjusted  measure  of  effect; 

•  to  obtain  adjusted  survivor  curves  for  differ¬ 
ent  SNI  categories; 

•  to  decide  on  variables  to  be  adjusted. 

C.  The  data:  13-year  follow-up  study  (1967-1980) 
of  a  fixed  cohort  of  n  =  170  white  males  (60+) 
from  Evans  County,  Georgia. 
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X.  Math  models  in  survival  analysis  (pages  35-37) 

A.  Survival  analysis  problem  is  analogous  to  typi¬ 
cal  multivariable  problem  addressed  by  linear 
and/or  logistic  regression  modeling:  describe 
relationship  of  exposure  to  outcome,  after 
accounting  for  possible  confounding  and  inter¬ 
action. 

B.  Outcome  variable  (time  to  event)  for  survival 
analysis  is  different  from  linear  (continuous) 
or  logistic  (dichotomous)  modeling. 

C.  Measure  of  effect  typically  used  in  survival  anal¬ 
ysis:  hazard  ratio  (HR). 

D.  Interpretation  of  HR:  like  OR.  SNI  study:  HR 
describes  relationship  between  SNI  and  T,  after 
controlling  for  covariates. 

XI.  Censoring  assumptions  (pages  37-43) 

A.  Three  different  assumptions  about  censoring: 

i.  Independent  (vs.  Non-independent)  censoring 
a.  most  useful-  concerns  validity  of 

estimated  effect 

ii.  Random  (vs.  Non-random)  censoring 

a.  more  restrictive  than  independent 
censoring 

iii.  Non-informative  (vs.  Informative)  censoring 
a.  typically  affects  efficiency  of  estimated 

effect 

B.  Examples. 
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Practice 

Exercises 


True  or  False  (Circle  T  or  F): 


T  F  1. 
T  F  2. 
T  F  3. 

T  F  4. 

T  F  5. 
T  F  6. 
T  F  7. 

T  F  8. 

T  F  9. 

T  F  10. 
T  F  11. 

T  F  12. 

T  F  13. 
T  F  14. 

T  F  15. 


In  a  survival  analysis,  the  outcome  variable  is 
dichotomous. 

In  a  survival  analysis,  the  event  is  usually 
described  by  a  (0,  1)  variable. 

If  the  study  ends  before  an  individual  has 
gotten  the  event,  then  his  or  her  survival  time 
is  censored. 

If,  for  a  given  individual,  the  event  occurs 
before  the  person  is  lost  to  follow-up  or  with¬ 
draws  from  the  study,  then  this  person’s  sur¬ 
vival  time  is  censored. 

S(t)  =  P(T  >t)  is  called  the  hazard  function. 
The  hazard  function  is  a  probability. 
Theoretically,  the  graph  of  a  survivor  function 
is  a  smooth  curve  that  decreases  from  S(f)  =  1 
at  t  =  0  to  S(t)  =  0  at  t  =  oo. 

The  survivor  function  at  time  t  gives  the  instan¬ 
taneous  potential  per  unit  time  for  a  failure  to 
occur,  given  survival  up  to  time  t. 

The  formula  for  a  hazard  function  involves  a 
conditional  probability  as  one  of  its  compo¬ 
nents. 

The  hazard  function  theoretically  has  no  upper 
bound. 

Mathematical  models  for  survival  analysis 
are  frequently  written  in  terms  of  a  hazard 
function. 

One  goal  of  a  survival  analysis  is  to  compare 
survivor  and/or  hazard  functions. 

Ordered  failure  times  are  censored  data. 
Censored  data  are  used  in  the  analysis  of  sur¬ 
vival  data  up  to  the  time  interval  of  censorship. 
A  typical  goal  of  a  survival  analysis  involving 
several  explanatory  variables  is  to  obtain  an 
adjusted  measure  of  effect. 
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16.  Given  the  following  survival  time  data  (in  weeks)  for 
n  =  15  subjects, 

1,  1,  1+,  1+,  1+,  2,  2,  2,  2+,  2+,  3,  3,  3+,  4+,  5+ 
where  +  denotes  censored  data,  complete  the  following 
table: 

hr>  mf  _ R(hf)) _ 

0  0  0  15  persons  survive  >  0  weeks 

1 
2 
3 


Also,  compute  the  average  survival  time  ( T )  and  the 
average  hazard  rate  (77)  using  the  raw  data  (ignoring  + 
signs  for  T). 

17.  Suppose  that  the  estimated  survivor  curve  for  the 
above  table  is  given  by  the  following  graph: 


t 


What  is  the  median  survival  time  for  this  cohort? 

Questions  18-20  consider  the  comparison  of  the  fol¬ 
lowing  two  survivor  curves: 


5(0 


18.  Which  group  has  a  better  survival  prognosis  before 
time  £*? 

19.  Which  group  has  a  better  survival  prognosis  after 
time  t*? 

20.  Which  group  has  a  longer  median  survival  time? 
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Jest  True  or  False  (Circle  T  or  F): 

T  F  1.  Survival  analysis  is  a  collection  of  statistical 
procedures  for  data  analysis  for  which  the  out¬ 
come  variable  is  time  until  an  event  occurs. 

T  F  2.  In  survival  analysis,  the  term  “event”  is  synon¬ 
ymous  with  “failure.” 

T  F  3 .  If  a  given  individual  is  lost  to  follow-up  or  with¬ 
draws  from  the  study  before  the  end  of  the 
study  without  the  event  occurring,  then  the 
survival  time  for  this  individual  is  said  to  be 
“censored.” 

T  F  4.  In  practice,  the  survivor  function  is  usually 
graphed  as  a  smooth  curve. 

T  F  5.  The  survivor  function  ranges  between  0  and  oo. 

T  F  6.  The  concept  of  instantaneous  potential  is  illu¬ 
strated  by  velocity. 

T  F  7.  A  hazard  rate  of  one  per  day  is  equivalent  to 
seven  per  week. 

T  F  8.  If  you  know  the  form  of  a  hazard  function,  then 
you  can  determine  the  corresponding  survivor 
curve,  and  vice  versa. 

T  F  9.  One  use  of  a  hazard  function  is  to  gain  insight 
about  conditional  failure  rates. 

T  F  10.  If  the  survival  curve  for  group  1  lies  completely 
above  the  survival  curve  for  group  2,  then  the 
median  survival  time  for  group  2  is  longer  than 
that  for  group  1 . 

T  F  11.  The  risk  set  at  6  weeks  is  the  set  of  individuals 
whose  survival  times  are  less  than  or  equal  to 
6  weeks. 

T  F  12.  If  the  risk  set  at  6  weeks  consists  of  22  persons, 
and  4  persons  fail  and  3  persons  are  censored 
by  the  7th  week,  then  the  risk  set  at  7  weeks 
consists  of  18  persons. 

T  F  13.  The  measure  of  effect  used  in  survival  analysis 
is  an  odds  ratio. 

T  F  14.  If  a  hazard  ratio  comparing  group  1  relative  to 
group  2  equals  10,  then  the  potential  for  failure 
is  ten  times  higher  in  group  1  than  in  group  2. 

T  F  15.  The  outcome  variable  used  in  a  survival  analy¬ 
sis  is  different  from  that  used  in  linear  or  logis¬ 
tic  modeling. 

16.  State  two  properties  of  a  hazard  function. 

17.  State  three  reasons  why  hazard  functions  are  used. 

18.  State  three  goals  of  a  survival  analysis. 
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19.  The  following  data  are  a  sample  from  the  1967-1980 
Evans  County  study.  Survival  times  (in  years)  are 
given  for  two  study  groups,  each  with  25  participants. 
Group  1  has  no  history  of  chronic  disease  (CHR  =  0), 
and  group  2  has  a  positive  history  of  chronic  disease 
(CHR  =  1): 

Group  1  (CHR  =  0):  12.3+,  5.4,  8.2,  12.2+,  11.7,  10.0, 

5.7,  9.8,  2.6,  11.0,  9.2,  12.1+,  6.6, 

2.2,  1.8,  10.2,  10.7,  11.1,  5.3,  3.5, 

9.2,  2.5,  8.7,  3.8,  3.0 

Group  2  (CHR  =  1):  5.8,  2.9,  8.4,  8.3,  9.1,  4.2,  4.1,  1.8, 
3.1,  11.4,  2.4,  1.4,  5.9,  1.6,  2.8, 
4.9,  3.5,  6.5,  9.9,  3.6,  5.2,  8.8,  7.8, 

4.7,  3.9 

For  group  1,  complete  the  following  table  involving 
ordered  failure  times: 

_ kr>  mf  gf _ R(kp) _ 

Group  1:  0.0  0  0  25  persons  survived  >  0  years 

1.8  1  0  25  persons  survived  >  1.8  years 

2.2 

2.5 

2.6 
3.0 

3.5 

3.8 

5.3 

5.4 

5.7 

6.6 

8.2 

8.7 

9.2 

9.8 
10.0 

10.2 

10.7 
11.0 
11.1 

11.7 
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Answers  to 

Practice 

Exercises 


20. 


1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 
9. 

10. 

11. 

12. 

13. 

14. 

15. 

16. 


17. 

18. 

19. 

20. 


For  the  data  of  Problem  19,  the  average  survival 
time  (T)  and  the  average  hazard  rate  (/?)  for  each 
group  are  given  as  follows: 

_ T  Ti 

Group  1:  7.5  .1165 

Group  2:  5.3  .1894 

a.  Based  on  the  above  information,  which  group  has 
a  better  survival  prognosis?  Explain  briefly. 

b.  How  would  a  comparison  of  survivor  curves  pro¬ 
vide  additional  information  to  what  is  provided  in 
the  above  table? 

F:  the  outcome  is  continuous;  time  until  an  event 
occurs. 

T 

T 

F:  the  person  fails,  i.e.,  is  not  censored. 

F:  S(t)  is  the  survivor  function. 

F:  the  hazard  is  a  rate,  not  a  probability. 

T 

F:  the  hazard  function  gives  instantaneous  potential. 

T 

T 

T 

T 

F:  ordered  failure  times  are  data  for  persons  who  are 
failures. 

T 

T 


tjf) _ mf_ 

0  0 

1  2 

2  3 

3  2 


-  33 

r  =  l5  =  2-2’ 


qi _ _ 

0  15  persons  survive  >  0  weeks 

3  15  persons  survive  >  1  weeks 

2  10  persons  survive  >  2  weeks 

3  5  persons  survive  >  3  weeks 


h=  —  =  0.22 
33 


Median  =  3  weeks 
Group  A 
Group  B 
Group  A 
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Introduction 


Abbreviated 

Outline 


We  begin  with  a  brief  review  of  the  purposes  of  survival 
analysis,  basic  notation  and  terminology,  and  the  basic 
data  layout  for  the  computer. 

We  then  describe  how  to  estimate  and  graph  survival 
curves  using  the  Kaplan-Meier  (KM)  method.  The  esti¬ 
mated  survival  probabilities  are  computed  using  a  product 
limit  formula. 

Next,  we  describe  how  to  compare  two  or  more  survival 
curves  using  the  log-rank  test  of  the  null  hypothesis  of 
a  common  survival  curve.  For  two  groups,  the  log-rank 
statistic  is  based  on  the  summed  observed  minus  expected 
score  for  a  given  group  and  its  variance  estimate.  For 
several  groups,  a  computer  should  always  be  used  because 
the  log-rank  formula  is  more  complicated  mathematically. 
The  test  statistic  is  approximately  chi-square  in  large 
samples  with  G  —  1  degrees  of  freedom,  where  G  denotes 
the  number  of  groups  being  compared. 

Several  alternatives  to  the  log-rank  test  will  be  briefly 
described.  These  tests  are  variations  of  the  log  rank  test 
that  weigh  each  observation  differently.  They  are  also  large 
sample  chi-square  tests  with  G  -  1  degrees  of  freedom. 

Finally,  we  describe  how  to  compute  confidence  intervals 
for  the  KM  curve  and  for  the  median  survival  time. 

The  outline  below  gives  the  user  a  preview  of  the  material 
to  be  covered  by  the  presentation.  A  detailed  outline  for 
review  purposes  follows  the  presentation. 

I.  Review  (pages  58-60) 

II.  An  example  of  Kaplan-Meier  curves  (pages  61-65) 

III.  General  features  of  KM  curves  (pages  66-67) 

IV.  The  log- rank  test  for  two  groups  (pages  67-71  ) 

V.  The  log-rank  test  for  several  groups  (pages  71-73) 

VI.  Alternatives  to  the  log  rank  test  (pages  73-78) 

VII.  Confidence  intervals  for  KM  curves 
(pages  78-79) 

VIII.  Confidence  intervals  for  the  median  survival 
time  (page  80) 

IX.  Summary  (page  81) 
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Objectives 


Upon  completing  the  chapter,  the  learner  should  be  able  to: 

1 .  Compute  Kaplan-Meier  (KM)  probabilities  of  survival, 
given  survival  time  and  failure  status  information  on  a 
sample  of  subjects. 

2.  Interpret  a  graph  of  KM  curves  that  compare  two  or 
more  groups. 

3.  Draw  conclusions  as  to  whether  or  not  two  or  more 
survival  curves  are  the  same  based  on  computer  results 
that  provide  a  log-rank  test  and/or  an  alternative  test. 

4.  Decide  whether  the  log-rank  test  or  one  of  the 
alternatives  to  this  test  is  more  appropriate  for  a  given 
set  of  survival  data. 

5.  Compute  a  95%  confidence  interval  for  a  KM  survival 
probability. 

6.  Compute  a  95%  confidence  interval  for  the  median 
survival  time  obtained  from  a  KM  curve. 


58  2.  Kaplan-Meier  Survival  Curves  and  the  Log-Rank  Test 


Presentation 


This  presentation  describes  how  to  plot  and 
interpret  survival  data  using  Kaplan-Meier 
(KM)  survival  curves  and  how  to  test  whether 
or  not  two  or  more  KM  curves  are  equivalent 
using  the  log-rank  test.  We  also  describe  alter¬ 
native  tests  to  the  log-rank  test.  Furthermore, 
we  provide  formulae  for  computing  95% 
confidence  intervals  for  a  KM  curve  and  for 
the  median  survival  time. 


We  begin  by  reviewing  the  basics  of  survival 
analysis.  Generally,  survival  analysis  is  a  col¬ 
lection  of  statistical  procedures  for  the  analysis 
of  data  in  which  the  outcome  variable  of  inter¬ 
est  is  time  until  an  event  occurs.  By  event,  we 
mean  death,  disease  incidence,  relapse  from 
remission,  or  any  designated  experience  of 
interest  that  may  happen  to  an  individual. 

Time  =  survival  time  When  doing  a  survival  analysis,  we  usually 

refer  to  the  time  variable  as  survival  time.  We 
Event  =  failure  also  typically  refer  to  the  event  as  a  failure. 


I.  Review 

Start  TIME  Event 

Event:  death 
disease 
relapse 


Censoring:  Don’t  know  survival 
time  exactly 


Most  survival  analyses  consider  a  key  data  ana¬ 
lytical  problem  called  censoring.  In  essence,  cen¬ 
soring  occurs  when  we  have  some  information 
about  individual  survival  time,  but  we  don’t 
know  the  survival  time  exactly. 


■  True  survival  time 


Observed  survival  time 
Right-censored  - 


Most  survival  time  data  is  right-censored, 
because  the  true  survival  time  interval,  which 
we  don’t  really  know,  has  been  cut  off  (i.e., 
censored)  at  the  right  side  of  the  observed 
time  interval,  giving  us  an  observed  survival 
time  that  is  shorter  than  the  true  survival  time. 
We  want  to  use  the  observed  survival  time  to 
draw  implications  about  the  true  survival  time. 


NOTATION 

T  =  survival  time 


'random  variable 
t  =  specific  value  for  T 


As  notation,  we  denote  by  a  capital  T  the 
random  variable  for  a  person’s  survival  time. 
Next,  we  denote  by  a  small  letter  t  any  specific 
value  of  interest  for  the  variable  T. 
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d  =  (0,  1)  random  variable 


f  1  if  failure 
1  0  if  censored 


We  let  d  denote  a  (0,1)  random  variable 
indicating  either  censorship  or  failure.  A  per¬ 
son  who  does  not  fail,  that  is,  does  not  get  the 
event  during  the  study  period,  must  have  been 
censored  either  before  or  at  the  end  of  the 
study. 


S(t )  =  survivor  function 
=  Pr  (T>t) 


The  survivor  function,  denoted  by  S(t),  gives 
the  probability  that  the  random  variable  T 
exceeds  the  specified  time  t. 


0  t  Study  end 


Theoretically,  as  t  ranges  from  0  up  to  infinity, 
the  survivor  function  is  graphed  as  a  decreas¬ 
ing  smooth  curve,  which  begins  at  S(t)  =  1  at 
t  =  0  and  heads  downward  toward  zero  as 
t  increases  toward  infinity. 


In  practice,  using  data,  we  usually  obtain  esti¬ 
mated  survivor  curves  that  are  step  functions, 
as  illustrated  here,  rather  than  smooth  curves. 


h(t)  —  hazard  function 

=  instantaneous  potential 
given  survival  up  to  time  t 


The  hazard  function,  denoted  by  hit),  gives  the 
instantaneous  potential  per  unit  time  for 
the  event  to  occur  given  that  the  individual 
has  survived  up  to  time  t. 


Not  failing  h{t) 


h{t)  is  a  rate:  0  to  oo 


In  contrast  to  the  survivor  function,  which 
focuses  on  not  failing,  the  hazard  function 
focuses  on  failing;  in  other  words,  the  higher 
the  average  hazard,  the  worse  the  impact  on 
survival.  The  hazard  is  a  rate,  rather  than  a 
probability.  Thus,  the  values  of  the  hazard 
function  range  between  zero  and  infinity. 


Regardless  of  which  function  S(t)  or  h  ( / )  one 
prefers,  there  is  a  clearly  defined  relation¬ 
ship  between  the  two.  In  fact,  if  one  knows 
the  form  of  Sit),  one  can  derive  the  corres¬ 
ponding  hit),  and  vice  versa. 
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General  Data  Layout: 
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Alternative  (ordered)  data 
layout: 


Ordered 


failure 

times, 

f(f) 

#  of 
failures 
nif 

#  censored  in 
hf+  n)> 
9f 

Risk 

set. 

£< o)  =  0 

m0  =  0 

9o 

R(t(  o)) 

hi) 

m\ 

q  i 

R(t(  n) 

f(2) 

m2 

<12 

R(t(  2)) 

t(k) 

mk 

9k 

R(t(k )) 

The  general  data  layout  for  a  survival  analysis 
is  given  by  the  table  shown  here.  The  first  col¬ 
umn  of  the  table  identifies  the  study  subjects. 
The  second  column  gives  the  observed  survival 
time  information.  The  third  column  gives  the 
information  for  d,  the  dichotomous  variable 
that  indicates  censorship  status.  The  remain¬ 
der  of  the  information  in  the  table  gives  values 
for  explanatory  variables  of  interest. 


An  alternative  data  layout  is  shown  here.  This 
layout  is  the  basis  upon  which  Kaplan-Meier 
survival  curves  are  derived.  The  first  column 
in  the  table  gives  ordered  survival  times  from 
smallest  to  largest.  The  second  column  gives 
frequency  counts  of  failures  at  each  distinct  fail¬ 
ure  time.  The  third  column  gives  frequency 
counts,  denoted  by  qf,  of  those  persons  censored 
in  the  time  interval  starting  with  failure  time  ty) 
up  to  but  not  including  the  next  failure  time, 
denoted  by  t(f+iy  The  last  column  gives  the 
risk  set,  which  denotes  the  collection  of  indivi¬ 
duals  who  have  survived  at  least  to  time  t(f). 


Table  of  ordered  failures: 

•  Uses  all  information  up  to  time 
of  censorship; 

•  S(t)  is  derived  from  R(t). 


To  estimate  the  survival  probability  at  a  given 
time,  we  make  use  of  the  risk  set  at  that  time  to 
include  the  information  we  have  on  a  censored 
person  up  to  the  time  of  censorship,  rather 
than  simply  throw  away  all  the  information 
on  a  censored  person. 


Survival  probability: 

Use  Kaplan-Meier  (KM) 

method. 


The  actual  computation  of  such  a  survival 
probability  can  be  carried  out  using  the 
Kaplan-Meier  (KM)  method.  We  introduce  the 
KM  method  in  the  next  section  by  way  of  an 
example. 
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II.  An  Example  of 

Kaplan-Meier  Curves 


EXAMPLE 


The  data:  remission  times  (weeks)  for 
two  groups  of  leukemia  patients 


Group  1  («  =  21) 
treatment 

Group  2  (n  —  21) 
placebo 

6,  6,  6,  7,  10, 

13,  16,  22,  23, 

6+,  9+,  10+,  11+, 
17+,  19+,  20+, 
25+,  32+,  32+, 
34+,  35+, 

I,  1,  2,  2,  3, 

4,  4,  5,  5, 

8,  8,  8,  8, 

II,  11,  12,  12, 

15,  17,  22,  23 

Note:  +  denotes  censored 

#  failed 

#  censored  Total 

Group  1  9 

Group  2  21 

12  21 

0  21 

Descriptive  statistics: 

T i (ignoring  +  's)  =  17.1,  f2  =  8.6 


hi  =  .025,  h2  =  .115,  ^  =  4.6 
hi 


The  data  for  this  example  derive  from  a  study 
of  the  remission  times  in  weeks  for  two  groups 
of  leukemia  patients,  with  21  patients  in  each 
group.  Group  1  is  the  treatment  group  and 
group  2  is  the  placebo  group.  The  basic  question 
of  interest  concerns  comparing  the  survival 
experience  of  the  two  groups. 


Of  the  2 1  persons  in  group  1 ,  9  failed  during  the 
study  period  and  12  were  censored.  In  contrast, 
none  of  the  data  in  group  2  are  censored;  that 
is,  all  2 1  persons  in  the  placebo  group  went  out 
of  remission  during  the  study  period. 

In  Chapter  1 ,  we  observed  for  this  data  set  that 
group  1  appears  to  have  better  survival  progno¬ 
sis  than  group  2,  suggesting  that  the  treatment 
is  effective.  This  conclusion  was  supported  by 
descriptive  statistics  for  the  average  survival 
time  and  average  hazard  rate  shown.  Note, 
however,  that  descriptive  statistics  provide 
overall  comparisons  but  do  not  compare  the 
two  groups  at  different  times  of  follow-up. 
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1  EXAMPLE:  (continued)  1 

Ordered  failure  times: 

Group  1  (treatment) 

z(/) 

nf 

mf 

9f 

0 

21 

0 

0 

6 

21 

3 

1 

7 

17 

1 

1 

10 

15 

1 

2 

13 

12 

1 

0 

16 

11 

1 

3 

22 

7 

1 

0 

23 

6 

1 

5 

>23 

— 

— 

— 

Group  2  (placebo) 

t(f) 

nf 

mf 

<if 

0 

21 

0 

0 

1 

21 

2 

0 

2 

19 

2 

0 

3 

17 

1 

0 

4 

16 

2 

0 

5 

14 

2 

0 

8 

12 

4 

0 

11 

8 

2 

0 

12 

6 

2 

0 

15 

4 

1 

0 

17 

3 

1 

0 

22 

2 

1 

0 

23 

1 

1 

0 

Group  2: 

no  censored  subjects 

Group  2  (placebo) 

G) 

nf 

mf  qf 

S  (?</)) 

0 

21 

0  0 

1 

1 

21 

2  0 

19/21  = 

.90 

2 

19 

2  0 

17/21  = 

.81 

3 

17 

1  0 

16/21  = 

.76 

4 

16 

2  0 

14/21  = 

.67 

5 

14 

2  0 

12/21  = 

.57 

8 

12 

4  0 

8/21  = 

.38 

11 

8 

2  0 

6/21  = 

.29 

12 

6 

2  0 

4/21  = 

.19 

15 

4 

1  0 

3/21  = 

.14 

17 

3 

1  0 

2/21  = 

.10 

22 

2 

1  0 

1/21  = 

.05 

23 

1 

1  0 

0/21  = 

.00 

A  table  of  ordered  failure  times  is  shown  here 
for  each  group.  These  tables  provide  the  basic 
information  for  the  computation  of  KM  curves. 

Each  table  begins  with  a  survival  time  of  zero, 
even  though  no  subject  actually  failed  at  the 
start  of  follow-up.  The  reason  for  the  zero  is 
to  allow  for  the  possibility  that  some  subjects 
might  have  been  censored  before  the  earliest 
failure  time. 

Also,  each  table  contains  a  column  denoted  as 
rif  that  gives  the  number  of  subjects  in  the  risk 
set  at  the  start  of  the  interval.  Given  that  the 
risk  set  is  defined  as  the  collection  of  indivi¬ 
duals  who  have  survived  at  least  to  time 
it  is  assumed  that  rif  includes  those  persons 
failing  at  time  fy).  In  other  words,  rif  counts 
those  subjects  at  risk  for  failing  instanta¬ 
neously  prior  to  time  tq). 


We  now  describe  how  to  compute  the  KM 
curve  for  the  table  for  group  2.  The  computa¬ 
tions  for  group  2  are  quite  straightforward 
because  there  are  no  censored  subjects  for 
this  group. 

The  table  of  ordered  failure  times  for  group 
2  is  presented  here  again  with  the  addition  of 
another  column  that  contains  survival  proba¬ 
bility  estimates.  These  estimates  are  the  KM 
survival  probabilities  for  this  group.  We  will 
discuss  the  computations  of  these  probabilities 
shortly. 
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EXAMPLE:  (continued) 


KM  Curve  for  Group  2  (Placebo) 


S(t)  =  Pr  (r  >  t ) 
Group  2  (placebo) 


*(f)  nf  mf  qf  S  (tOT) 


0 

21 

0 

0 

1 

1 

21 

2 

0 

19/21  = 

.90 

2 

19 

2 

0 

17/21  = 

.81 

3 

17 

1 

0 

16/21  = 

.76 

4 

16 

2 

0 

14/21  = 

.67 

5 

14 

2 

0 

12/21  = 

.57 

8 

12 

4 

0 

8/21  = 

.38 

11 

8 

2 

0 

6/21  = 

.29 

12 

6 

2 

0 

4/21  = 

.19 

15 

4 

1 

0 

3/21  = 

.14 

17 

3 

1 

0 

2/21  = 

.10 

22 

2 

1 

0 

1/21  = 

.05 

23 

1 

1 

0 

0/21  = 

.00 

#  surviving  past  tlf] 

^ \T(f )) 

21 

No  censorship  in  group  2 
Alternative  formula:  KM  approach 


A  plot  of  the  KM  survival  probabilities 
corresponding  to  each  ordered  failure  time  is 
shown  here  for  group  2.  Empirical  plots  such 
as  this  one  are  typically  plotted  as  a  step  func¬ 
tion  that  starts  with  a  horizontal  line  at  a  sur¬ 
vival  probability  of  1  and  then  steps  down  to 
the  other  survival  probabilities  as  we  move 
from  one  ordered  failure  time  to  another. 

We  now  describe  how  the  survival  probabilities 
for  the  group  2  data  are  computed.  Recall  that 
a  survival  probability  gives  the  probability  that 
a  study  subject  survives  past  a  specified  time. 


Thus,  considering  the  group  2  data,  the  pro¬ 
bability  of  surviving  past  zero  is  unity,  as  it 
will  always  be  for  any  data  set. 

Next,  the  probability  of  surviving  past  the 
first  ordered  failure  time  of  1  week  is  given  by 
19/21  or  (.90)  because  2  people  failed  at  2  week, 
so  that  19  people  from  the  original  21  remain 
as  survivors  past  2  week. 

Similarly,  the  next  probability  concerns  sub¬ 
jects  surviving  past  2  weeks,  which  is  17/21 
(or.  81)  because  2  subjects  failed  at  1  week  and 
2  subjects  failed  at  2  weeks  leaving  17  out  of  the 
original  21  subjects  surviving  past  2  weeks. 

The  remaining  survival  probabilities  in  the 
table  are  computed  in  the  same  manner,  that 
is,  we  count  the  number  of  subjects  surviving 
past  the  specified  time  being  considered  and 
divide  this  number  by  2 1 ,  the  number  of  sub¬ 
jects  at  the  start  of  follow-up. 

Recall  that  no  subject  in  group  2  was  censored, 
so  the  q  column  for  group  2  consists  entirely 
of  zeros.  If  some  of  the  q  s  had  been  nonzero, 
an  alternative  formula  for  computing  survival 
probabilities  would  be  needed.  This  alterna¬ 
tive  formula  is  called  the  Kaplan-Meier  (KM) 
approach  and  can  be  illustrated  using  the 
group  2  data  even  though  all  values  of  q  are 


zero. 
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For  example,  an  alternative  way  to  calculate 
the  survival  probability  of  exceeding  4  weeks 
for  the  group  2  data  can  be  written  using  the 
KM  formula  shown  here.  This  formula  involves 
the  product  of  conditional  probability  terms. 
That  is,  each  term  in  the  product  is  the  proba¬ 
bility  of  exceeding  a  specific  ordered  failure 
time  t(f)  given  that  a  subject  survives  up  to  that 
failure  time. 

Thus,  in  the  KM  formula  for  survival  past 
4  weeks,  the  term  19/21  gives  the  probability 
of  surviving  past  the  first  ordered  failure  time, 
1  week,  given  survival  up  to  the  first  week.  Note 
that  all  21  persons  in  group  2  survived  up  to 
1  week,  but  that  2  failed  at  1  week,  leaving  19 
persons  surviving  past  1  week. 

Similarly,  the  term  16/17  gives  the  probability 
of  surviving  past  the  third  ordered  failure  time 
at  week  3,  given  survival  up  to  week  3.  There 
were  1 7  persons  who  survived  up  to  week  3  and 
1  of  these  then  failed,  leaving  1 6  survivors  past 
week  3.  Note  that  the  17  persons  in  the  deno¬ 
minator  represents  the  number  in  the  risk  set 
at  week  3. 

Notice  that  the  product  terms  in  the  KM  for¬ 
mula  for  surviving  past  4  weeks  stop  at  the  4th 
week  with  the  component  14/16.  Similarly,  the 
KM  formula  for  surviving  past  8  weeks  stops 
at  the  eighth  week. 

More  generally,  any  KM  formula  for  a  survival 
probability  is  limited  to  product  terms  up  to 
the  survival  week  being  specified.  That  is 
why  the  KM  formula  is  often  referred  to  as  a 
"product-limit”  formula. 

Next,  we  consider  the  KM  formula  for  the  data 
»  from  group  1 ,  where  there  are  several  censored 

Qt  ^  (hf))  observations. 

0  ® 

1  1  x  The  estimated  survival  probabilities  obtained 

using  the  KM  formula  are  shown  here  for 
group  1. 

The  first  survival  estimate  on  the  list  is  S(0)  —  1 , 
as  it  will  always  be,  because  this  gives  the  prob¬ 
ability  of  surviving  past  time  zero. 


Group  1  (treatment) 

t(f)  nf  mf 

0  21  0 

6  21  3 


KM  formula  =  product  limit 
formula 


Presentation:  II.  An  Example  of  Kaplan-Meier  Curves  65 


EXAMPLE:  (continued) 


Group  1  (treatment) 

t(f)  nf  mf  4f  S 


0 

21 

0 

0 

© 

6 

21 

3 

1 

lxft 

\2y 

=  .8571 

7 

17 

1 

1 

.8571  xj 

l6\ 

=  .8067 

10 

15 

f 

2 

.8067  x 

14 

15 

=  .7529 

13 

12 

1 

0 

.7529  x 

11 

n 

=  .6902 

16 

11 

1 

3 

.6902  x 

10 

IT 

=  .6275 

22 

7 

1 

0 

.6275  x 

6 

7  _ 

=  .5378 

23 

6 

1 

5 

.5378  x 

5 

=  .4482 

Fraction  at  tqy  Pr (T  >  tq)  \  T  >  t(f)) 

Not  available  at  tyy failed  prior  to 
or 

censored  prior  to  t <y) 


group  1  only 


KM  Plots  for  Remission  Data 


computer  package,  e.g.,  SAS, 

Stata, 

SPSS 

R 


The  other  survival  estimates  are  calculated  by 
multiplying  the  estimate  for  the  immediately 
preceding  failure  time  by  a  fraction.  For  exam¬ 
ple,  the  fraction  is  18/21  for  surviving  past 
week  6,  because  21  subjects  remain  up  to 
week  6  and  3  of  these  subjects  fail  to  survive 
past  week  6.  The  fraction  is  16/17  for  surviving 
past  week  7,  because  17  people  remain  up  to 
week  7  and  1  of  these  fails  to  survive  past  week  7. 
The  other  fractions  are  calculated  similarly. 

For  a  specified  failure  time  1(f),  the  fraction  may 
be  generally  expressed  as  the  conditional  prob¬ 
ability  of  surviving  past  time  tq},  given  avail¬ 
ability  (i.e.,  in  the  risk  set)  at  time  tq,.  This 
is  exactly  the  same  formula  that  we  previously 
used  to  calculate  each  product  term  in  the  pro¬ 
duct  limit  formula  used  for  the  group  2  data. 

Note  that  a  subject  might  not  be  available  at 
time  tq)  for  one  of  two  reasons:  (1)  either  the 
subject  has  failed  prior  to  /(p,  or  (2)  the  subject 
has  been  censored  prior  to  Z(p.  Group  1  has 
censored  observations,  whereas  group  2  does 
not.  Thus,  for  group  1,  censored  observations 
have  to  be  taken  into  account  when  determin¬ 
ing  the  number  available  at  tq,. 

Plots  of  the  KM  curves  for  groups  1  and  2  are 
shown  here  on  the  same  graph.  Notice  that  the 
KM  curve  for  group  1  is  consistently  higher 
than  the  KM  curve  for  group  2.  These  figures 
indicate  that  group  1,  which  is  the  treatment 
group,  has  better  survival  prognosis  than  group 
2,  the  placebo  group.  Moreover,  as  the  number 
of  weeks  increases,  the  two  curves  appear  to  get 
farther  apart,  suggesting  that  the  beneficial 
effects  of  the  treatment  over  the  placebo  are 
greater  the  longer  one  stays  in  remission. 

The  KM  plots  shown  above  can  be  easily 
obtained  from  most  computer  packages  that 
perform  survival  analysis,  including  SAS,  Stata, 
SPSS,  and  R.  All  the  user  needs  to  do  is  provide 
a  KM  computer  program  with  the  basic  data 
layout  and  then  provide  appropriate  commands 
to  obtain  plots. 


66  2.  Kaplan-Meier  Survival  Curves  and  the  Log-Rank  Test 


III.  General  Features 
of  KM  Curves 


General  KM  formula: 

5(/(f)) 

=  ^O(f-i))  x  —  f(h) 


The  general  formula  for  a  KM  survival  probabil¬ 
ity  at  failure  time  fy)  is  shown  here.  This  fonnula 
gives  the  probability  of  surviving  past  the  previ¬ 
ous  failure  time  multiplied  by  the  condi¬ 

tional  probability  of  surviving  past  time  t^, 
given  survival  to  at  least  time  t(f). 


KM  formula  =  product  limit 
formula 

/— 1  ^ 

%-d)= n^(r>^ir^^o) 

i=i 


The  above  KM  formula  can  also  be  expressed  as 
a  product  limit  if  we  substitute  for  the  survival 
probability  S  (t(f_  q),  the  product  of  all  fractions 
that  estimate  the  conditional  probabilities  for 
failure  times  h/_p  and  earlier. 


EXAMPLE  1 

5(10)  = 

.8067  x  ^  =  .7529 

= 

18  16  ^  14 

2lXl7 

5(16)  = 

.6902  x  ^ 

1  '4 

18x16x14xH  x10 

21  17X  15  X  12  X  11 

For  example,  the  probability  of  surviving 
past  10  weeks  is  given  in  the  table  for  group  1 
(page  65)  by  .8067  times  14/15,  which  equals 
.7529.  But  the  .8067  can  be  alternatively 
written  as  the  product  of  the  fractions  18/21 
and  16/17.  Thus,  the  product  limit  formula  for 
surviving  past  10  weeks  is  given  by  the  triple 
product  shown  here. 

Similarly,  the  probability  of  surviving  past 
16  weeks  can  be  written  either  as  .6902  x  10/11, 
or  equivalently  as  the  five-way  product  of  frac¬ 
tions  shown  here. 


/ 

s(f(/)) = n/5,'[r>r«ir^w] 

i=i 

=  1)) 

_ x  P'iT > t(f)\T  >  t^) 


The  general  expression  for  the  product  limit 
formula  for  the  KM  survival  estimate  is  shown 
here  together  with  the  general  KM  fonnula 
given  earlier.  Both  expressions  are  equivalent. 


Math  proof: 

Pr(A  and  B)  =  Pr(A)  x  Pr(B  |  A) 
always 


A  simple  mathematical  proof  of  the  KM  for¬ 
mula  can  be  described  in  probability  terms. 
One  of  the  basic  rules  of  probability  is  that 
the  probability  of  a  joint  event,  say  A  and  B, 
is  equal  to  the  probability  of  one  event,  say  A, 
times  the  conditional  probability  of  the  other 
event,  B,  given  A. 
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A  =  “T  >  t(f)’  — >  A  and  B  =  B 
B  =  “T  >  tlf)” 

Pr(A  and  B)  =  Pr(B)  =  (Zu^jT) 


If  we  let  A  be  the  event  that  a  subject  survives  to 
at  least  time  t ^  and  we  let  B  be  the  event  that  a 
subject  survives  past  time  hq,  then  the  joint 
event  A  and  B  simplifies  to  the  event  B,  which 
is  inclusive  of  A.  It  follows  that  the  probability 
of  A  and  B  equals  the  probability  of  surviving 
past  time  t^. 


No  failures  during  h/vq  <  T  <  /q,  Also,  because  /(q  is  the  next  failure  time  after 
Pr(A)  =  Pr( T  >  l(f-\),  there  can  be  no  failures  after  time 

and  before  time  fy).  Therefore,  the  probability 
of  A  is  equivalent  to  the  probability  of  surviving 
past  the  (f—  l)th  ordered  failure  time. 


Furthermore,  the  conditional  probability  of  B 
given  A  is  equivalent  to  the  conditional  proba¬ 
bility  in  the  KM  formula. 


Thus,  from  Pr(A  and  B)  formula,  Thus,  using  the  basic  rules  of  probability,  the 

KM  formula  can  be  derived. 

Pr(A  and  B)  =  Pr(A)  x  Pr(B  |  A) 

5  (*(/))  =  5('(/-i)) 

x  Pr(T  >  t(f)\T  >  t{f)) 


IV.  The  Log-Rank  Test 
for  Two  Groups 

Are  KM  cui'ves  statistically 
equivalent? 


We  now  describe  how  to  evaluate  whether  or 
not  KM  curves  for  two  or  more  groups  are 
statistically  equivalent.  In  this  section  we  con¬ 
sider  two  groups  only.  The  most  popular  testing 
method  is  called  the  log-rank  test. 


When  we  state  that  two  KM  curves  are  “statisti¬ 
cally  equivalent,”  we  mean  that,  based  on  a  test¬ 
ing  procedure  that  compares  the  two  curves  in 
some  “overall  sense,”  we  do  not  have  evidence  to 
indicate  that  the  true  (population)  survival 
curves  are  different. 
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•  Chi-square  test 

•  Overall  comparison  of  KM 
curves 

•  Observed  versus  expected 
counts 

•  Categories  defined  by  ordered 
failure  times 


EXAMPLE 

Remission  data:  n  = 

#  failures 

:  42 

#  in  risk  set 

'«1  f 

m2f 

«if 

n2f 

1 

0 

2 

21 

21 

2 

0 

2 

21 

19 

3 

0 

1 

21 

17 

© 

0 

2 

21 

16 

5 

0 

2 

21 

14 

6 

3 

0 

21 

12 

7 

1 

0 

17 

12 

8 

0 

4 

16 

12 

© 

1 

0 

15 

8 

11 

0 

2 

13 

8 

12 

0 

2 

12 

6 

13 

1 

0 

12 

4 

15 

0 

I 

11 

4 

16 

1 

0 

11 

3 

17 

0 

i 

10 

3 

22 

1 

i 

7 

2 

23 

1 

i 

6 

1 

Expected  cell  counts: 

o/  =  f  — ~T~ — )  x  imif  +  m2f ) 
\nif  +  n2fJ 

T  T 

Proportion  #  of  failures  over 
in  risk  set  both  groups 

e% f  =  (  — — — )  x  (mif  +  m2f) 

W  +  «2 f) 


The  log-rank  test  is  a  large-sample  chi-square 
test  that  uses  as  its  test  criterion  a  statistic  that 
provides  an  overall  comparison  of  the  KM 
curves  being  compared.  This  (log-rank)  statis¬ 
tic,  like  many  other  statistics  used  in  other 
kinds  of  chi-square  tests,  makes  use  of  observed 
versus  expected  cell  counts  over  categories  of 
outcomes.  The  categories  for  the  log-rank  sta¬ 
tistic  are  defined  by  each  of  the  ordered  failure 
times  for  the  entire  set  of  data  being  analyzed. 

As  an  example  of  the  information  required  for 
the  log-rank  test,  we  again  consider  the  com¬ 
parison  of  the  treatment  (group  1)  and  placebo 
(group  2)  subjects  in  the  remission  data  on 
42  leukemia  patients. 

Here,  for  each  ordered  failure  time,  t(f)t  in  the 
entire  set  of  data,  we  show  the  numbers  of 
subjects  (nijf)  failing  at  that  time,  separately  by 
group  (z),  followed  by  the  numbers  of  subjects 
( riif )  in  the  risk  set  at  that  time,  also  separately 
by  group. 

Thus,  for  example,  at  week  4,  no  subjects  failed 
in  group  1,  whereas  two  subjects  failed  in 
group  2.  Also,  at  week  4,  the  risk  set  for  group  1 
contains  21  persons,  whereas  the  risk  set  for 
group  2  contains  16  persons. 

Similarly,  at  week  10,  1  subject  failed  in  group  1, 
and  no  subjects  failed  at  group  2;  the  risk  sets 
for  each  group  contain  15  and  8  subjects, 
respectively. 

We  now  expand  the  previous  table  to  include 
expected  cell  counts  and  observed  minus 
expected  values  for  each  group  at  each  ordered 
failure  time.  The  formula  for  the  expected 
cell  counts  is  shown  here  for  each  group.  For 
group  1,  this  formula  computes  the  expected 
number  at  time  f  (i.e.,  e\f)  as  the  proportion  of 
the  total  subjects  in  both  groups  who  are  at  risk 
at  time  f,  that  is,  n\f!(n\f  +  n2f),  multiplied  by 
the  total  number  of  failures  at  that  time  for 
both  groups  (i.e.,  m ^  +  m2f ).  For  group  2,  e2f 
is  computed  similarly. 
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EXAMPLE  | 

Expande 

f  hf) 

:d  Table  (Remission  Data) 

#  failures  #  in  risk  set 

#  expected 

Observed-expected 

mlf 

m2f 

nlf 

n2f 

elf 

e2f 

m2f~e2f 

1 

1 

0 

2 

21 

21 

(21/42)  x 

2 

(21/42) 

X 

2 

-1.00 

1.00 

2 

2 

0 

2 

21 

19 

(21/40)  x 

2 

(19/40) 

X 

2 

-1.05 

1.05 

3 

3 

0 

1 

21 

17 

(21/38)  x 

1 

(17/38) 

X 

1 

-0.55 

0.55 

4 

4 

0 

2 

21 

16 

(21/37)  x 

2 

(16/37) 

X 

2 

-1.14 

1.14 

5 

5 

0 

2 

21 

14 

(21/35)  x 

2 

(14/35) 

X 

2 

-1.20 

1.20 

6 

6 

3 

0 

21 

12 

(21/33)  x 

3 

(12/33) 

X 

3 

1.09 

-1.09 

7 

7 

1 

0 

17 

12 

(17/29)  x 

1 

(12/29) 

X 

1 

0.41 

-0.41 

8 

8 

0 

4 

16 

12 

(16/28)  x 

4 

(12/28) 

X 

4 

-2.29 

2.29 

9 

10 

1 

0 

15 

8 

(15/23)  x 

1 

(8/23) 

X 

1 

0.35 

-0.35 

10 

11 

0 

2 

13 

8 

(13/21)  x 

2 

(8/21) 

X 

2 

-1.24 

1.24 

11 

12 

0 

2 

12 

6 

(12/18)  x 

2 

(6/18) 

X 

2 

-1.33 

1.33 

12 

13 

F 

0 

12 

4 

(12/16)  x 

1 

(4/16) 

X 

1 

0.25 

-0.25 

13 

15 

0 

i 

11 

4 

(11/15)  x 

1 

(4/15) 

X 

1 

-0.73 

0.73 

14 

16 

1 

0 

11 

3 

(11/14)  x 

1 

(3/14) 

X 

i 

0.21 

-0.21 

15 

17 

0 

1 

10 

3 

(10/13)  x 

1 

(3/13) 

X 

1 

-0.77 

0.77 

16 

22 

1 

1 

7 

2 

(7/9)  x 

2 

(2/9) 

X 

2 

-0.56 

0.56 

17 

23 

1 

1 

6 

1 

(6/7)  x 

2 

(1/7) 

X 

2 

-0.71 

0.71 

Totals 

9 

@ 

19.26 

(1074) 

-10.26 

(hoi6) 

#  of  failure  times 

I 

17 

Oi  —  E,  =  (ntjf  —  e,f) , 

/=! 

i  =  1,2 


EXAMPLE 


O]  =  -10.26 
02-E2=  10.26 


When  two  groups  are  being  compared,  the 
log-rank  test  statistic  is  formed  using  the  sum 
of  the  observed  minus  expected  counts  over  all 
failure  times  for  one  of  the  two  groups.  In  this 
example,  this  sum  is  —10.26  for  group  1  and 
10.26  for  group  2.  We  will  use  the  group  2  value 
to  carry  out  the  test,  but  as  we  can  see,  except 
for  the  minus  sign,  the  difference  is  the  same 
for  the  two  groups. 


Two  groups: 

02  —  E2  =  summed  observed  minus 
expected  score  for  group  2 


Log— rank  statistic 


(O2-E2)2 
Var (02  -  E2) 


For  the  two-group  case,  the  log-rank  statistic, 
shown  here  at  the  left,  is  computed  by  dividing 
the  square  of  the  summed  observed  minus 
expected  score  for  one  of  the  groups  —  say, 
group  2  —  by  the  variance  of  the  summed 
observed  minus  expected  score. 
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Var  (Oi-Ei) 


=  E 


»i/«2/('»i/  +  f)  («i/  +  ny  -  mif  ~  m2f) 


(«i/  +  n2fy  (nlf  +  n2f  -  1) 


1=1,  2 


//(,:  no  difference  between  survival 
curves 

Log-rank  statistic  with  1  df 
under  H0 


Computer  programs: 

Stata’s  “sts  test”: 

•  descriptive  statistics  for  KM 
curves 

•  log-rank  statistic 

•  Alternative  statistics  to 
log-rank  statistic 


EXAMPLE 


Using  Stata:  Remission  Data 

Events  Events 

Group  observed  expected 

1  9  19.25 

2  _ 21 _ 10.75 

Total  30  30.00 

Log  rank  =  chi2(2)  =  16.79 
P-Value  =  Pr  >  chi2  =  0.000 


The  expression  for  the  estimated  variance  is 
shown  here.  For  two  groups,  the  variance  for¬ 
mula  is  the  same  for  each  group.  This  variance 
formula  involves  the  number  in  the  risk  set  in 
each  group  (n;y)  and  the  number  of  failures 
in  each  group  (m,y)  at  time  f.  The  summation 
is  over  all  distinct  failure  times. 

The  null  hypothesis  being  tested  is  that  there  is 
no  overall  difference  between  the  two  survival 
curves.  Under  this  null  hypothesis,  the  log- 
rank  statistic  is  approximately  chi-square  with 
one  degree  of  freedom.  Thus,  a  P-value  for  the 
log-rank  test  is  determined  from  tables  of  the 
chi-square  distribution. 

Several  computer  programs  are  available  for 
calculating  the  log-rank  statistic.  For  example 
the  Stata  package  has  a  command  called  “sts 
test”  that  computes  descriptive  information 
about  Kaplan-Meier  curves,  the  log-rank  sta¬ 
tistic,  and  alternative  statistics  to  the  log-rank 
statistic,  to  be  described  later.  Other  packages, 
like  SAS  and  SPSS,  have  procedures  that  pro¬ 
vide  results  similar  to  those  of  Stata.  A  compar¬ 
ison  of  Stata,  SAS,  SPSS  and  R  procedures  and 
output  is  provided  in  the  Computer  Appendix  at 
the  back  of  this  text. 

For  the  remission  data,  the  edited  printout 
from  using  the  Stata  “sts  test”  procedure  is 
shown  here.  The  log-rank  statistic  is  16.79 
and  the  corresponding  P-value  is  zero  to  three 
decimal  places.  This  P-value  indicates  that  the 
null  hypothesis  should  be  rejected.  We  can 
therefore  conclude  that  the  treatment  and  pla¬ 
cebo  groups  have  significantly  different  KM 
survival  curves. 
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EXAMPLE 


02-E2=  10.26 
Var (02  -  E2)  =  6.2685 


Log  -  rank  statistic  = 


(Oi  ~  E2)2 
Var(C>2  —  E2) 


(10.26)2 

6.2685 


16.793 


Although  the  use  of  a  computer  is  the  easiest 
way  to  calculate  the  log-rank  statistic,  we  pro¬ 
vide  here  some  of  the  details  of  the  calculation. 
We  have  already  seen  from  earlier  computa¬ 
tions  that  the  value  of  02  —  E2  is  10.26.  The 
estimated  variance  of  02  -  E2  is  computed 
from  the  variance  formula  above  to  be  6.2685. 
The  log-rank  statistic  then  is  obtained  by  squar¬ 
ing  10.26  and  dividing  by  6.285,  which  yields 
16.793,  as  shown  on  the  computer  printout. 


Approximate  formula: 

#  of  groups  (n 

x2  k,  ^2  —  ~ 

Ei 


An  approximation  to  the  log-rank  statistic, 
shown  here,  can  be  calculated  using  observed 
and  expected  values  for  each  group  without 
having  to  compute  the  variance  formula.  The 
approximate  formula  is  of  the  classic  chi-square 
form  that  sums  over  each  group  being  compared 
the  square  of  the  observed  minus  expected  value 
divided  by  the  expected  value. 


EXAMPLE 


2  (— 10.26)2  (10.26)2 
19.26  10.74 

=  15.276 

Log-rank  statistic  =  16.793 


The  calculation  of  the  approximate  formula  is 
shown  here  for  the  remission  data.  The 
expected  values  are  19.26  and  10.74  for  groups 
1  and  2,  respectively.  The  chi-square  value 
obtained  is  15.276,  which  is  slightly  smaller 
than  the  log-rank  statistic  of  16.793. 


V.  The  Log-Rank  Test 
for  Several  Groups 

H0:  All  survival  curves  are  the  same. 


The  log-rank  test  can  also  be  used  to  compare 
three  or  more  survival  curves.  The  null  hypo¬ 
thesis  for  this  more  general  situation  is  that  all 
survival  curves  are  the  same. 


Log-rank  statistics  for  >  2  groups 
involves  variances  and  covariances 
of  O,  -  E,. 

Matrix  formula:  See  Appendix  at 
end  of  this  chapter. 


Although  the  same  tabular  layout  can  be  used 
to  carry  out  the  calculations  when  there  are 
more  than  two  groups,  the  test  statistic  is  more 
complicated  mathematically,  involving  both 
variances  and  covariances  of  summed  observed 
minus  expected  scores  for  each  group.  A  con¬ 
venient  mathematical  formula  can  be  given  in 
matrix  terms.  We  present  the  matrix  formula  for 
the  interested  reader  in  an  Appendix  at  the  end 
of  this  chapter. 


72  2.  Kaplan-Meier  Survival  Curves  and  the  Log-Rank  Test 


Use  computer  program  for 
calculations. 


G  (>  2)  groups: 

log-rank  statistic  ~  /2  with 
G  -  1  df 


We  will  not  describe  further  details  about  the 
calculation  of  the  log-rank  statistic,  because  a 
computer  program  can  easily  cany  out  the 
computations  from  the  basic  data  file.  Instead, 
we  illustrate  the  use  of  this  test  with  data 
involving  more  than  two  groups. 

If  the  number  of  groups  being  compared  is  G 
(>  2),  then  the  log-rank  statistic  has  approxi¬ 
mately  a  large  sample  chi-square  distribution 
with  G  —  1  degrees  of  freedom.  Therefore, 
the  decision  about  significance  is  made  using 
chi-square  tables  with  the  appropriate  degrees 
of  freedom. 


Approximation  formula: 


X2 


#ofgroups 

E 

i 


(Of  -  Ej)2 

E, 


Not  required  because  computer 
program  calculates  the  exact  log- 
rank  statistic 


The  approximate  formula  previously  described 
involving  only  observed  and  expected  values 
without  variance  or  covariance  calculations 
can  also  be  used  when  there  are  more  than 
two  groups  being  compared.  However,  practi¬ 
cally  speaking,  the  use  of  this  approximate 
formula  is  not  required  as  long  as  a  com¬ 
puter  program  is  available  to  calculate  the 
exact  log-rank  statistic. 


We  now  provide  an  example  to  illustrate  the 
use  of  the  log-rank  statistic  to  compare  more 
than  two  groups. 


EXAMPLE 


vets.dat:  survival  time  in  days, 
n  137 

Veteran’s  Administration  Lung  Cancer  Trial 

Column  1:  Treatment  (standard  =  1,  test  =  2) 
Column  2:  Cell  type  1  (large  =  1,  other  =  0) 
Column  3:  Cell  type  2  (adeno  =  1,  other  =  0) 
Column  4:  Cell  type  3  (small  =  1,  other  =  0) 
Column  5:  Cell  type  4  (squamous  —  1,  other  —  0) 
Column  6:  Survival  time  (days) 

Column  7:  (^Performance  Status 

(0  —  worst ...  100  =  best) 

Column  8:  Disease  duration  (months) 

Column  9:  Age 

Column  10:  Prior  therapy  (none  =  0,  some  —  1) 
Column  1 1 :  Status  (0  =  censored,  1  =  died) 


The  data  set  “vets.dat”  considers  survival  times 
in  days  for  137  patients  from  the  Veteran’s 
Administration  Lung  Cancer  Trial  cited  by 
Kalbfleisch  and  Prentice  in  their  text  ( The  Sta¬ 
tistical  Analysis  of  Survival  Time  Data,  John 
Wiley,  pp.  223-224,  1980).  A  complete  list  of 
the  variables  is  shown  here.  Failure  status  is 
defined  by  the  status  variable  (column  11). 

Among  the  variables  listed,  we  now  focus  on  the 
performance  status  variable  (column  7).  This 
variable  is  a  continuous  variable,  so  before  we 
can  obtain  KM  curves  and  the  log-rank  test, 
we  need  to  categorize  this  variable. 
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EXAMPLE:  (continued) 


Performance  Status  Categories 


Group  #  Categories  Size 


1 

0-59 

52 

2 

60-74 

50 

3 

75-100 

35 

KM  curves  for  performance 

status 

groups 


Events  Events 
Group  observed  expected 


1 

50 

26.30 

2 

47 

55.17 

3 

31 

46.53 

Total 

128 

128.00 

Log-rank 

=  chi2(2)  = 

:  29.18 

P-value  = 

Pr  >  chi2  = 

=  0.0000 

G  =  3  groups;  df  =  G 

-1=2 

Log-rank 

test  is  highly  significant. 

Conclude  significant  difference 
among  three  survival  curves. 


If,  for  the  performance  status  variable,  we  choose 
the  categories  0-59,  60-74,  and  75-100,  we 
obtain  three  groups  of  sizes  52,  50,  and  35, 
respectively. 


The  KM  curves  for  each  of  three  groups  are 
shown  here.  Notice  that  these  curves  appear 
to  be  quite  different.  A  test  of  significance 
of  this  difference  is  provided  by  the  log-rank 
statistic. 


An  edited  printout  of  descriptive  information 
about  the  three  KM  curves  together  with  the 
log-rank  test  results  are  shown  here.  These 
results  were  obtained  using  the  Stata  package. 


Because  three  groups  are  being  compared 
here,  G  =  3  and  the  degrees  of  freedom  for  the 
log-rank  test  is  thus  G  -  1,  or  2.  The  log-rank 
statistic  is  computed  to  be  29.181,  which  has  a 
P-value  of  zero  to  three  decimal  places.  Thus, 
the  conclusion  from  the  log-rank  test  is  that 
there  is  a  highly  significant  difference  among 
the  three  survival  curves  for  the  performance 
status  groups. 


VI.  Alternatives  to  the 
Log  Rank  Test 

Alternative  tests  supported  by  Stata 

Wilcoxen 
Tar  one -Ware 
Peto 

Flemington-Harrington 


There  are  several  alternatives  to  the  log  rank  test 
offered  by  Stata,  SAS,  SPSS,  and  R  designed 
to  test  the  hypothesis  that  two  or  more  sur¬ 
vival  curves  are  equivalent.  In  this  section  we 
describe  the  Wilcoxon,  the  Tarone-Ware,  the 
Peto,  and  the  Flemington-Harrington  test.  All 
of  these  tests  are  variations  of  the  log  rank  test 
and  are  easily  implemented  in  Stata. 
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Log  rank  uses 

°'  =  ( m‘f  ~  eif) 

f 

i  =  group  # 
f  =  fth  failure  time 


In  describing  the  differences  among  these  tests, 
recall  that  the  log  rank  test  uses  the  summed 
observed  minus  expected  score  O  -  E  in  each 
group  to  form  the  test  statistic.  This  simple  sum 
gives  the  same  weight  -  namely,  unity  -  to  each 
failure  time  when  combining  observed  minus 
expected  failures  in  each  group. 


Weighting  the  test  statistic  for  two 
groups 

Test  statistic: 

f 


The  Wilcoxon,  Tarone-Ware,  Peto,  and 
Flemington-Harrington  test  statistics  are  varia¬ 
tions  of  the  log  rank  test  statistic  and  are  derived 
by  applying  different  weights  at  the  f-th  failure 
time  (as  shown  on  the  left  for  two  groups). 


van 


i  =  1,  2 

f  =  fth  failure  time 

=  weight  at  fth  failure  time 


Wilcoxon  Test 

•  w(tf)  =  n f  (number  at  risk) 

•  Earlier  failures  receive  more 
weight 

•  Appropriate  if  treatment  effect 
is  strongest  in  earliest  phases  of 
administration 


The  Wilcoxon  test  (called  the  Breslow  test  in 
SPSS)  weights  the  observed  minus  expected 
score  at  time  tf  by  the  number  at  risk  rif,  over 
all  groups  at  time  tf.  Thus,  the  Wilcoxon  test 
places  more  emphasis  on  the  information 
at  the  beginning  of  the  survival  curve  where 
the  number  at  risk  is  large  allowing  early  fail¬ 
ures  to  receive  more  weight  than  later  failures. 
This  type  of  weighting  may  be  used  to  assess 
whether  the  effect  of  a  treatment  on  survival  is 
strongest  in  the  earlier  phases  of  administra¬ 
tion  and  tends  to  be  less  effective  over  time. 


Weights  Used  for  Various  Test 
Statistics 


Test  Statistic 

Log  rank 

1 

Wilcoxon 

nf 

Tarone-Ware 

V"f 

Peto 

Flemington- 

s(%-dF  . 

Harrington 

X 

i — i 

i 

GO: 

The  Tarone-Ware  test  statistic  also  applies  more 
weight  to  the  early  failure  times  by  weighting  the 
observed  minus  expected  score  at  time  fy)  by 
the  square  root  of  the  number  at  risk  ^fnj.  The 
Peto  test  weights  the  f-th  failure  time  by  the 
survival  estimate .?(%)  calculated  over  all  groups 
combined.  This  survival  estimate  is  similar 
but  not  exactly  equal  to  the  Kaplan-Meier  sur¬ 
vival  estimate.  The  Flemington-Harrington  test 
uses  the  Kaplan-Meier  survival  estimate  S(t)  over 
all  groups  to  calculate  its  weights  for  the  f-th 
failure  time,  Shi/vnfTl  —  S  (f(/_i))]'?.  The  weights 
for  each  of  these  test  statistics  are  summarized 
on  the  left. 
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Flemington-Harrington  Test 

w(0  =  (rlf-i))/’[1  -  $  (f(/-i))]? 

if  p  =  1  and  q  =  0,  w(t)  =  5(t(/-  i)) 
if  p  =  0  and  q  =  1 , 
w(t)  =  1  -  S(t(f_  d) 
if  p  =  0  and  q  =  0, 
w(t)  =  1  (log  rank  test) 


Comparisons  of  Test  Results: 
Remission  Data,  Testing 
Treatment  (RX) 


Test 

Chi-square 

(ldf) 

P-value 

Log  rank 

16.79 

0.0000 

Wilcoxon 

13.46 

0.0002 

Tarone- 

15.12 

0.0001 

Ware 

Peto 

14.08 

0.0002 

FH  (p  =  3, 

8.99 

0.0027 

q=  1) 

FH  (p=  1, 

12.26 

0.005 

q=  3) 

Vets  Data, 
Status 

3-Level  Performance 

Test 

Chi-square 
(2  df) 

P-value 

Log  rank 
Wilcoxon 

29.18 

46.10 

0.0000 

0.0000 

Remission  Data,  2-Level  Treatment 

Test 

Chi-square 

(ldf) 

P-value 

Log  rank 
Wilcoxon 

16.79 

13.46 

0.0000 

0.0002 

The  Flemington-Harrington  test  allows  the 
most  flexibility  in  terms  of  the  choice  of 
weights  because  the  user  provides  the  values 
of  p  and  q.  For  example,  if  p  =  1  and  q  =  0  then 
w(t)  =  ,S’(f (/-_,  ))  which  gives  more  weight  for  the 
earlier  survival  times  when  5(f (/-_,))  is  close  to 
one.  However,  if  p  =  0  and  q  =  1  then  w(t)  =  1  — 
S(t(f-  d)  in  which  case  the  later  survival  times 
receive  more  weight.  If  p  =  0  and  q  =  0  then 
w(t)  =  1,  and  the  Flemington-Harrington  test 
reduces  to  the  log  rank  test. 

On  the  left  is  a  comparison  of  test  results  for 
the  effect  of  treatment  (vs.  placebo)  using  the 
remission  data.  The  log  rank  chi-square  statis¬ 
tic  (also  displayed  previously  in  this  chapter)  is 
the  highest  among  these  tests  at  16.79.  The 
Flemington-Harrington  (FH)  test  with  p  =  3 
and  q  =  1  yielded  the  lowest  chi-square  value 
at  8.99,  although  with  this  weighting  it  is  not 
immediately  obvious  which  part  of  the  survival 
curve  is  getting  the  most  weight.  However,  all 
the  test  results  are  highly  significant  yielding  a 
similar  conclusion  to  reject  the  null  hypothesis. 


On  the  left  are  comparisons  of  the  log  rank  and 
Wilcoxon  tests  for  the  3-level  performance  sta¬ 
tus  variable  from  the  vets  dataset  discussed  in 
the  previous  section.  The  Wilcoxon  test  yields  a 
higher  chi-square  value  (46.10)  than  the  log 
rank  test  (29.18).  In  contrast,  the  log  rank  test 
for  the  effect  of  treatment  (RX)  from  the  remis¬ 
sions  data  yields  a  higher  chi-square  value 
(16.79)  than  the  Wilcoxon  test  (13.46).  How¬ 
ever,  both  the  Wilcoxon  and  log  rank  tests  are 
highly  significant  for  both  performance  status 
and  for  treatment  variables. 
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KM  curves  for  performance  status  groups 


A  comparison  of  survival  curves  gives  insight 
into  why  the  Wilcoxon  test  yields  a  higher  chi- 
square  value  than  the  log  rank  test  for  the  3- 
level  performance  status  variable.  The  3  curves 
being  compared  are  farthest  apart  in  the  early 
part  of  followup  before  becoming  closer  later. 
By  contrast,  a  comparison  of  the  2  curves  for 
treatment  shows  the  curves  diverging  over  time. 


KM  Plots  for  Remission  Data 


Choosing  a  Test 

•  Results  of  different  weightings 
usually  lead  to  similar 
conclusions 

•  The  best  choice  is  test  with  most 
power 

•  Power  depends  on  how  the  null 
is  violated 

•  There  may  be  a  clinical  reason  to 
choose  a  particular  weighting 

•  Choice  of  weighting  should  be  a 
priori 


In  general,  the  various  weightings  should  pro¬ 
vide  similar  results  and  will  usually  lead  to  the 
same  decision  as  to  whether  the  null  hypothe¬ 
sis  is  rejected.  The  choice  of  which  weighting 
of  the  test  statistic  to  use  (e.g.,  log  rank  or 
Wilcoxon)  depends  on  which  test  is  believed  to 
provide  the  greatest  statistical  power,  which 
in  turn  depends  on  how  it  is  believed  the  null 
hypothesis  is  violated. 

If  there  is  a  clinical  reason  to  believe  the  effect 
of  an  exposure  is  more  pronounced  toward  the 
beginning  (or  end)  of  the  survival  function, 
then  it  makes  sense  to  use  a  weighted  test  sta¬ 
tistic.  However,  one  should  make  an  a  priori 
decision  on  which  statistical  test  to  use 
rather  than  fish  for  a  desired  p-value.  Fishing 
for  a  desired  result  may  lead  to  bias. 
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Stratified  log  rank  test 

•  O  —  E  scores  calculated  within 
strata 

•  O  —  E  scores  then  summed 
across  strata 

•  Allows  control  of  stratified 
variable 


Stratified  log-rank  test 
->lwbc3  =  1 


1 

Events 

Events 

rx 

1  observed 

expected 

0 

1 

0 

2.91 

1 

1 

4 

1.09 

Total 

1 

4 

4.00 

1 

->lwbc3 

=  2 

1 

Events 

Events 

rx 

1 

observed 

expected 

0 

1 

5 

7.36 

1 

1 

5 

2.64 

Total 

1 

10 

10.00 

->lwbc3 

=  3 

1 

Events 

Events 

rx 

| observed 

expected 

0 

1 

4 

6.11 

1 

1 

12 

9.89 

Total 

1 

16 

16.00 

->Total 

| 

Events 

1 

Events 

expected 

rx 

| observed 

(*) 

0 

1 

9 

16.38 

1 

1 

21 

13.62 

Total 

1 

30 

30.00 

The  stratified  log  rank  test  is  another  variation 
of  the  log  rank  test.  With  this  test  the  summed 
observed  minus  expected  scores  O  —  E  are  cal¬ 
culated  within  strata  of  each  group  and  then 
summed  across  strata.  The  stratified  log  rank 
test  provides  a  method  of  testing  the  equi¬ 
valence  of  survival  curves  controlling  for 
the  stratified  variable.  An  example  of  the  stra¬ 
tified  log  rank  test  is  presented  next  using  the 
remission  data. 

On  the  left  is  Stata  output  from  performing  a 
stratified  log  rank  test  for  the  effect  of  treat¬ 
ment  (RX)  stratified  by  a  3 -level  variable 
(LWBC3)  indicating  low,  medium,  or  high  log 
white  blood  cell  count  (coded  1,  2,  and  3, 
respectively). 

Within  each  stratum  of  LWBC3,  the  expected 
number  of  events  is  calculated  for  the  treated 
group  (RX  =  0)  and  for  the  placebo  group 
(RX  =1).  The  total  expected  number  of  events 
for  the  treated  group  is  found  by  summing  the 
expected  number  of  events  over  the  three 
strata:  2.91  +  7.36  +  6.11  =  16.38.  Similarly 
the  total  expected  number  of  events  for  the  pla¬ 
cebo  group  is  calculated:  1.09  +  2.64  +  9.89  = 
13.62.  This  compares  to  9  observed  cases  from 
the  treated  group  and  21  observed  cases 
from  the  placebo  group  yielding  a  chi-square 
value  of  10.14  with  1  degree  of  freedom  (for 
2  levels  of  treatment)  and  a  corresponding 
p-value  of  0.0014. 

Recall  that  when  we  did  not  control  for  log 
white  blood  cell  count,  the  log  rank  test  for  the 
effect  of  treatment  yielded  a  chi-square  value  of 
16.79  and  a  corresponding  p-value  rounded 
to  0.0000. 
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Log  rank  unstratified 

O,  ~Ei  =  J2  ("Hr  ~  eif) 

j 

i  =  group  #,  f  =  fth  failure  time 

Log  rank  stratified 

Oj  —  Ej  =  "^2  (■ mifs  ~  eifs) 

‘  f 

i  =  group  #,  f  =  jth  failure  time, 
s  =  stratum  # 

Stratified  or  unstratified  (G  groups) 
Under  H0: 


The  only  difference  between  the  unstratified 
and  stratified  approaches  is  that  for  the  unstrat¬ 
ified  approach,  the  observed  minus  expected 
number  of  events  for  each  failure  time  is  sum¬ 
med  over  all  failure  times  for  each  group  (i). 
With  the  stratified  approach,  the  observed 
minus  expected  number  of  events  is  summed 
over  all  failure  times  for  each  group  within 
each  stratum  and  then  summed  over  all  strata. 
Either  way,  the  null  distribution  is  chi-square 
with  G  —  1  degrees  of  freedom,  where  G  repre¬ 
sents  the  number  of  groups  being  compared 
(not  the  number  of  strata). 


log  rank  statistic  ^/2  with 
G-ldf 


Can  stratify  with  other  tests 
Wilcoxon,  Tarone-Ware, 

Peto,  Flemington-Harrington 

Limitation 

Sample-size  may  be  small  within 
strata 


The  stratified  approach  can  also  be  applied  to 
any  of  the  weighted  variations  of  the  log  rank 
test  (e.g.,  Wilcoxon).  A  limitation  of  the  strati¬ 
fied  approach  is  the  reduced  sample  size  within 
each  stratum.  This  is  particularly  problematic 
with  the  remission  dataset,  which  has  a  small 
sample  size  to  begin  with. 


Alternatively 

Test  associations  using  modeling 

•  Can  simultaneously  control 
covariates 

•  Shown  in  next  chapter 


We  have  shown  how  the  stratified  log  rank  test 
can  be  used  to  test  the  effect  of  treatment  while 
controlling  for  log  white  blood  cell  count.  In 
the  next  chapter  we  show  how  modeling  can 
be  used  to  test  an  association  of  a  predictor 
variable  while  simultaneously  controlling  for 
other  covariates. 


VII.  Confidence  intervals 
for  KM  curves 


We  now  describe  how  to  calculate  (95%) 
confidence  intervals  (CIs)  for  the  estimated 
Kaplan-Meier  (KM)  curve. 


95%  Cl  for  the  KM  curve: 
Wt)  ±  1.96^Uar[Wt)] 


where  Greenwood’s  formula  for 

Var[SKM(t)]  is  given  by 


Var[SKu(X)\  —  (Skm  ft))  ^ 


YYlf 


The  95%  Cl  formula  for  estimated  KM  proba¬ 
bility  at  any  time  point  over  follow-up  has  the 
general  large  sample  form  shown  on  the  left, 
where  SKM(t)denotes  the  KM  survival  estimate 
at  time  t  and  Var[SKivi(t)]  denotes  variance  of 
Skm(0-  The  most  common  approach  used  to 
calculate  this  variance  uses  the  Greenwood’s 
formula,  also  shown  on  the  left. 


t(f)  =  f-ordered  failure  time 
mf  =  number  of  failures  at  t(f>, 
nf  =  number  in  the  risk  set  at  t(f>, 
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mf 


( weight  1 

V 


[Conditional  risk] 


nf  (nf-  mf)  nf  ^nf-  mf 


m  m 


t 

n 

m 

n(n  —  m) 

^n(n  -  m) 

S(t) 

Var[S(1 

0 

21 

0 

0 

0 

1 

0 

6 

21 

3 

0.0079 

0.0079 

0.857 

0.0058 

7 

17 

1 

0.0037 

0.0116 

0.807 

0.0076 

10 

15 

1 

0.0048 

0.0164 

0.753 

0.0093 

13 

12 

1 

0.0076 

0.0240 

0.690 

0.0114 

16 

11 

1 

0.0091 

0.0330 

0.628 

0.0130 

22 

7 

1 

0.0238 

0.0569 

0.538 

0.0164 

23 

6 

1 

0.0333 

0.0902 

0.448 

0.0181 

6  weeks:  0.857±1.96-\/(.857)2  (0.0079) 
=  0.857  ±0.149  =(.708,1.006) 


10  weeks: 


m3 

113(113  —  m3) 


1 

15(14) 


=  .0048 


The  summation  component  of  Greenwood's 
formula  essentially  gives  at  each  failure  time  t(f), 
a  weighted  (by  l/nf)  average  of  the  conditional 
risk  of  failing  at  those  failure  times  prior  to  t(p. 
Thus,  the  variance  formula  gives  the  square  of 
the  KM  coordinate  at  each  event  time  weighted 
by  the  cumulative  estimate  of  the  risk  at  time  t. 

We  illustrate  how  Greenwood's  variance  is  cal¬ 
culated  for  the  treatment  group  (Group  1)  of 
the  remission  times  data  described  earlier.  The 
layout  for  this  computation  is  shown  n  the  left. 

At  6  weeks,  the  estimate  of  the  survival  function 
is  0.857.  There  were  three  events  at  6  weeks  and 
2 1  patients  at  risk.  Therefore,  mf/nf(nf  —  mf)  = 
3/(21  x  18)  =  0.0079.  As  this  is  the  only  compo¬ 
nent  of  the  sum,  the  variance  is  then  0.0079  x 
0.8572  =  0.0058.  The  corresponding  95%  confi¬ 
dence  interval  is  shown  on  the  left,  where  the 
upper  level  should  be  modified  to  1 . 


E 

f:t(f)<t=10 


mf 

nf(nf  -  mf) 


=  0.0079  +  0.0037  +  0.0048  =  0.0164. 


Var[SKM(10)]  =  (0.753)2(.0164)  =  0.0093 
95%  Cl:  0.753  ±  1.96v/0.0093  =  (0.564,0.942) 


At  10  weeks,  the  estimate  of  the  survival  func¬ 
tion  is  0.753.  There  was  1  event  at  10  months 
and  15  patients  at  risk.  Therefore,  mf/nf(nf-mf) 
1/(15/14)  =  0.0048. 


Same  Cl  at  t  =11  and  12,  since  no 
events  occurred  at  those  times. 


There  were  two  other  risk  components  prior  to 
this  time,  0.0079  at  time  6  and  0.0037  at  time  7 
and  their  sum  is  0.01 64.  The  variance  at  1 0  weeks 
is  then  0. 0164x0. 7532,  which  equals  0.0093. 


The  95%  Cl  for  the  proportion  of  patients  at 
10  weeks  is  shown  on  the  left  to  have  the  limits 
(0.564,  0.942).  Note  that  since  the  variance  is 
only  defined  at  event  times,  the  95%  confidence 
interval  remains  the  same  at  11  and  12  weeks 
also. 


REMISSION 


Time 


On  the  left  we  show  the  KM  curves  and  their 
corresponding  95%  confidence  intervals  for 
Group  1  (treatment)  and  Group  2  (placebo) 
for  the  remission  time  data. 
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VIII.  Confidence  intervals 
for  the  median 
survival  time 

Remission  data  example: 

Group  1  median  =  8  weeks 
95%  Cl  ? 

Formula  for  95%  Cl  derived  from: 
(5km(M>  —  0.5)2 

Var|&u(M)] 

M  =  true  (unknown)  median  sur¬ 
vival  time,  i.e.,  SKm(M)  =  0.5 
Skm(M)  =  estimated  survival  prob¬ 
ability  from  KM  curve  at  the  time 
median  survival  time 
Var[SKM(M)]  uses  Greenwood’s 
formula. 

95%  Cl  for  median  survival: 

(SKM(t)  -  0.5)2  <  3MVar[SKM(t)\ 


Returning  again  to  the  remission  time  dataset, 
we  now  consider  the  calculation  of  the  95%  Cl 
for  the  median  of  the  remission  times  for  the 
placebo  group.  Recall  that  the  median  survival 
for  this  group  is  8  weeks. 

Brookmeyer  and  Crowley  (1982)  proposed  a 
simple  way  to  calculate  the  Cl  for  the  median 
survival  time  based  on  the  fact  that  the  square 
of  a  standardized  function  of  the  survival  curve 
around  the  true  (unknown)  median  value  (M) 
is  asymptotically  ^distributed.  This  relation¬ 
ship  is  shown  mathematically  on  the  left. 

Using  the  above  result  about  the  standar¬ 
dized  survival  curve,  a  general  formula  for  the 
95%  Cl  for  the  median  survival  is  provided 
by  the  inequality  expression  shown  on  the  left. 

The  times  for  which  this  inequality  holds  are 
plausible  values  of  the  true  median,  while 
the  boundaries  represent  upper  and  lower 
times  for  the  95%  Cl  for  the  median.  The 

lower  boundary  may  be  0  and  the  upper  bound¬ 
ary  may  not  always  exist. 


*» 

S(t) 

CS(t)-O.S)2 

3.84  Var(S(t)) 

Inequality 

satisfied? 

0 

1 

0.250 

- 

1 

0.90 

0.160 

0.016 

N 

2 

0.81 

0.096 

0.028 

N 

3 

0.76 

oo 

& 

© 

0.033 

N 

4 

0.67 

0..029 

0.041 

Y 

5 

0.57 

0/005 

0.045 

Y 

8 

0.38 

,0.014 

0.044 

Y 

11 

0.29 

./  0.44 

0.0$8 

N 

12 

0.1 9-' 

0.096 

0.028 

N 

15 

0.4*4 

0.130 

..4022 

N 

17 

0.10 

0.160  .... 

0.016 

N 

22 

•0.05 

0.203' 

0.008 

N 

23 

/  0.00 

0(050 

0 

N 

0.096  >  0.02$  so  the  inequality  is  not  satisfied 
0.014  >  0.o4^f  and  the  inequality  is  satisfied 


Caution  (ref  B&C,  1982):  upper 
limit  should  be  adjusted  to  reflect 
censoring,  e.g.,  SAS’s  LIFETEST 
adjusts  above  95%  Cl  from  (4,8) 
to  (4,  11). 


For  the  remission  time  data,  the  calculation  of 
the  Cl  around  the  median  of  8  weeks  is  given  in 
the  table  shown  on  the  left.  Since  the  inequality 
in  the  Cl  formula  is  satisfied  in  the  range  t=4 
weeks  to  8  weeks,  the  resulting  95%  Cl  is  (4,8). 

Brookmeyer  and  Crowley  (B&C)  caution  that 
the  upper  limit  for  these  estimates  should 
be  adjusted  to  reflect  censoring.  They  recom¬ 
mend  reporting  semiopen  intervals  that  extend 
one  event  beyond  the  event  that  satisfies  the 
inequality;  some  packages  (e.g.,  SAS,  R)  incor¬ 
porate  this  recommendation. 

In  this  example,  the  95%  Cl  for  the  median 
survival  time  obtained  from  SAS  output  is 
given  by  the  interval  (4,  11),  with  the  upper 
limit  now  extended  beyond  8  weeks  but  not 
including  the  event  at  month  1 1 . 
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IX.  Summary 

KM  curves: 

KM  curves: 


We  now  briefly  summarize  this  presentation. 
First,  we  described  how  to  estimate  and  graph 
survival  curves  using  the  Kaplan-Meier  (KM) 
method. 


t(fy  fth  ordered  failure  time 


/ 

=  ntr[T  >  t(i)\T  >  t(i)] 

i=  1 

=  S(t(f_  i)) 

x  Pr(T  >  t(f) \T  > 


To  compute  KM  curves,  we  must  form  a  data 
layout  that  orders  the  failure  times  from  smal¬ 
lest  to  largest.  For  each  ordered  failure  time, 
the  estimated  survival  probability  is  computed 
using  the  product  limit  formula  shown  here. 
Alternatively,  this  estimate  can  be  computed  as 
the  product  of  the  survival  estimate  for  the 
previous  failure  time  multiplied  by  the  condi¬ 
tional  probability  of  surviving  past  the  current 
failure  time. 


Log-rank  test: 


H0:  common  survival  curve  for 
all  groups 


Log  -  rank  statistic 


(Oi  -  E2)2 
Var(CL  —  Ei) 


log-rank  statistic  with  G  I  df 
under  H0 

G  =  #  of  groups 


When  survival  curves  are  being  compared,  the 
log-rank  test  gives  a  statistical  test  of  the  null 
hypothesis  of  a  common  survival  curve.  For 
two  groups,  the  log-rank  statistic  is  based  on 
the  summed  observed  minus  expected  scores 
for  a  given  group  and  its  variance  estimate.  For 
several  groups,  a  computer  should  always  be 
used  since  the  log-rank  formula  is  more  com¬ 
plicated  mathematically.  The  test  statistic  is 
approximately  chi-square  in  large  samples 
with  G  —  1  degrees  of  freedom,  where  G 
denotes  the  number  of  groups  being  compared. 


Greenwood’s  Variance  formula: 
Var[5KM(0]  =  (Skm(O) 


E 


mf 

nf(nf  -  mf) 


Large  sample  confidence  intervals  for  KM 
curves  can  be  computed  based  on  Greenwood’s 
formula,  shown  at  the  left,  for  the  variance  of 
an  estimated  KM  survival  probability. 


x 
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95%  Cl  for  KM: 

5km (t)  ±  1.96yVar[5KM(t)] 


The  expression  for  the  95%  confidence  interval 
is  shown  below  Greenwood’s  formula. 


95%  Cl  for  median  survival: 
(Wt)-  0.5)23.84Var[«t)] 


A  large  sample  confidence  interval  formula  for 
the  median  of  the  KM  curve  can  also  be  com¬ 
puted  using  the  inequality  formula  shown  here 
on  the  left.  The  upper  and  lower  boundaries  of 
t  for  which  this  inequality  holds  provide  the 
95%  confidence  limits. 


Chapters 


1.  Introduction 


/2. 


Kaplan-Meier  Survival  Curves 
and  the  Log-Rank  Test _ 


This  presentation  is  now  complete.  You  can 
review  this  presentation  using  the  detailed 
outline  that  follows  and  then  try  the  practice 
exercises  and  test. 


Next: 

3.  The  Cox  Proportional 
Hazards  Model  and  Its 
Characteristics 


Chapter  3  introduces  the  Cox  proportional 
hazards  (PH)  model,  which  is  the  most  popular 
mathematical  modeling  approach  for  estimat¬ 
ing  survival  curves  when  considering  several 
explanatory  variables  simultaneously. 
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Detailed 

Outline 


I.  Review  (pages  58-60) 

A.  The  outcome  variable  is  (survival)  time  until  an 
event  (failure)  occurs. 

B.  Key  problem:  censored  data,  i.e.,  don't  know 
survival  time  exactly. 

C.  Notation:  T  =  survival  time  random  variable 

t  =  specific  value  of  T 
d  =  (0,  1)  variable  for  failure/cen¬ 
sorship  status 
S(t)  =  survivor  function 
h(t)  =  hazard  function 

D.  Properties  of  survivor  function: 

i.  theoretically,  graph  is  smooth  curve, 
decreasing  from  S  (f)  =  1  at  time  t  =  0  to  S 
{t)  =  0  at  t  =  oo; 

ii.  in  practice,  graph  is  step  function. 

E.  Properties  of  h(t): 

i.  instantaneous  potential  for  failing  given 
survival  up  to  time; 

ii.  h(t)  is  a  rate;  ranges  from  0  to  oo. 

F.  Relationship  of  S  (/)  to  h(t):  if  you  know  one  you 
can  determine  the  other. 

G.  Goals  of  survival  analysis:  estimation  of  survi¬ 
vor  and  hazard  functions;  comparisons  and 
relationships  of  explanatory  variables  to  sur¬ 
vival. 

H.  Data  layouts 

i.  for  the  computer; 

ii.  for  understanding  the  analysis:  involves 

risk  sets. 


II.  An  Example  of  Kaplan-Meier  Curves  (pages  61-65) 

A.  Data  are  from  study  of  remission  times  in  weeks 
for  two  groups  of  leukemia  patients  (2 1  in  each 
group). 

B.  Group  1  (treatment  group)  has  several  censored 
observations,  whereas  group  2  has  no  censored 
observations. 

C.  Table  of  ordered  failure  times  is  provided  for 
each  group. 

D.  For  group  2  (all  noncensored),  survival  prob¬ 
abilities  are  estimated  directly  and  plotted. 
Formula  used  is 


%(/)) 


#  surviving  past  t(j) 
21 
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E.  Alternative  approach  for  group  2  is  given  by  a 

product  limit  formula. 

F.  For  group  1 ,  survival  probabilities  calculated  by 
multiplying  estimate  for  immediately  preceding 
failure  time  by  a  conditional  probability  of  sur¬ 
viving  past  current  failure  time,  i.e., 

%)  =  %- i)P''[T  >  t(f)\T  ^  ?(/)]- 


III.  General  Features  of  KM  Curves  (pages  66-67) 

A.  Two  alternative  general  formulae: 


s(f) 


npr[r 


i=  1 


>  t(j)  T  >  f(,) 


(product  limit 
formula) 


S(f)  =  Sif- 1)  Pr[r  >  f(f)  \T  ^  r(f). 


B.  Second  formula  derived  from  probability  rule: 
Pr(A  and  B)  =  Pr(A)  x  Pr(B|A) 

IV.  The  Log-Rank  Test  for  Two  Groups  (pages  67-71) 

A.  Large  sample  chi-square  test;  provides  overall 
comparison  of  KM  curves. 

B .  U ses  observed  versus  expected  counts  over  cate¬ 
gories  of  outcomes,  where  categories  are 
defined  by  ordered  failure  times  for  entire  set 
of  data. 

C.  Example  provided  using  remission  data  involv¬ 
ing  two  groups: 

i.  expanded  table  described  to  show  how 
expected  and  observed  minus  expected  cell 
counts  are  computed. 

ii.  for  zth  group  at  time  /'  where  i  =  1  or  2: 
observed  counts  =  m,y, 

expected  counts  =  e,y,  where 

expected  counts  =  (proportion  in  risk  set)  x 

(#  failures  over  both  groups), 


D.  Log-rank  statistic  for  two  groups: 

{Oi  -  e,02~ 

Var(0,  -  Ej) 1 


where  i  =  1.2 
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°i  -  Ei  =  ^2  {mif  -  eif),  and 
/ 

Var(0,  -  Ei) 

_  -^2  nVn2f(mif  +  mZ f)  («i/  +  «2/  -  mif  -  mif) 
f  («i/  +  n2f)  («i/  +  n2f  -  l) 

i  =  l,  2 

E.  H0:  no  difference  between  survival  curves. 

F.  Log-rank  statistic  ~  y2  with  1  df  under  H0. 

G.  Approximate  formula: 

X 2  =  - — - — ,  where  G  =  2  =  #  of  groups 

Ei 

H.  Remission  data  example:  Log-rank  statistic  = 
16.793,  whereas  X2  =  15.276. 

V.  The  Log-Rank  Test  for  Several  Groups 

(pages  71-73) 

A.  Involves  variances  and  covariances;  matrix  for¬ 
mula  in  Appendix. 

B.  Use  computer  for  calculations. 

C.  Under  H0,  log-rank  statistic  with  G  -  1  df, 
where  G  =  #  of  groups. 

D.  Example  provided  using  vets.dat  with  interval 
variable  “performance  status”;  this  variable  is 
categorized  into  G  =  3  groups,  so  df  for  log- 
rank  test  is  G  -  1  =  2,  log-rank  statistic  is 
29.181  (P  =  0.0). 

VI.  Alternatives  to  the  Log-Rank  Test  (pages  73-78) 

A.  Alternative  tests  supported  by  Stata:  Wilcoxen, 
Tarone-Ware,  Peto,  and  Flemington-Harring- 
ton. 

B.  Alternative  tests  differ  by  applying  different 
weights  at  the  j-th  failure  time. 

C.  The  choice  of  alternative  depends  on  the  reason 
for  the  belief  that  the  effect  is  more  pronounced 
towards  the  beginning  (or  end)  of  the  survival 
function. 

D.  The  stratified -log-rank  test  is  a  variation  of  the 
log-rank  test  that  controls  for  one  or  more  stra¬ 
tified  variables. 
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VII.  Confidence  Intervals  for  KM  Curves 

(pages  78-79) 

A.  General  form  of  95%  Cl: 

WO  ±  \.96^Var[SKMm 

B.  Var[SKM(t)]uses  Greenwood’s  formula: 
Var[SKMm  =  (SKM(t))2  V  — T 

/fe  L */(«/  -  mf)\ 

C.  Example  using  Remission  Time  Data 

VIII.  Confidence  Intervals  for  the  Median  Survival 
Time  (page  80) 

A.  General  form  of  95%  Cl  where  values  of  t  satis¬ 
fying  the  following  inequality  provide  confi¬ 
dence  limits: 

{SKM(t)  -  0.5)2<  3MVar[SKM(t)] 

B.  Example  using  Remission  Time  Data 

IX.  Summary  (page  81) 
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Practice 

Exercises 


1.  The  following  data  are  a  sample  from  the  1967-1980 
Evans  County  study.  Survival  times  (in  years)  are  given 
for  two  study  groups,  each  with  25  participants.  Group  1 
has  no  history  of  chronic  disease  (CHR  =  0),  and  group  2 
has  a  positive  history  of  chronic  disease  (CHR  =1): 

Group  1  (CHR  =  0):  12.3+,  5.4,  8.2,  12.2+,  11.7,  10.0, 

5.7,  9.8,  2.6,  11.0,  9.2,  12. 1+, 

6.6,  2.2,  1.8,  10.2,  10.7,  11.1, 

5.3,  3.5,  9.2,  2.5,  8.7,  3.8,  3.0 

Group  2  (CHR  =  1):  5.8,  2.9,  8.4,  8.3,  9.1,  4.2,  4.1,  1.8, 
3.1,  11.4,  2.4,  1.4,  5.9,  1.6,  2.8, 

4.9,  3.5,  6.5,  9.9,  3.6,  5.2,  8.8, 

7.8,  4.7,  3.9 

a.  Fill  in  the  missing  information  in  the  following 
table  of  ordered  failure  times  for  groups  1  and  2: 


Group  1  Group  2 


kf) 

nf 

ntf 

<7f 

S(%) 

kf) 

nf 

nif 

(If 

S(k») 

0.0 

25 

0 

0 

1.00 

0.0 

25 

0 

0 

1.00 

1.8 

25 

1 

0 

.96 

1.4 

25 

1 

0 

.96 

2.2 

24 

1 

0 

.92 

1.6 

24 

1 

0 

.92 

2.5 

23 

1 

0 

.88 

1.8 

23 

1 

0 

.88 

2.6 

22 

1 

0 

.84 

2.4 

22 

1 

0 

.84 

3.0 

21 

1 

0 

.80 

2.8 

21 

1 

0 

.80 

3.5 

20 

( 

) 

2.9 

20 

1 

0 

.76 

3.8 

19 

1 

0 

.72 

3.1 

19 

1 

0 

.72 

5.3 

18 

1 

0 

.68 

3.5 

18 

1 

0 

.68 

5.4 

17 

1 

0 

.64 

3.6 

17 

1 

0 

.64 

5.7 

16 

1 

0 

.60 

3.9 

6.6 

15 

1 

0 

.56 

4.1 

8.2 

14 

1 

0 

.52 

4.2 

8.7 

13 

1 

0 

.48 

4.7 

13 

1 

0 

.48 

9.2 

( 

4.9 

12 

1 

0 

.44 

9.8 

10 

1 

0 

.36 

5.2 

11 

1 

0 

.40 

10.0 

9 

1 

0 

.32 

5.8 

10 

1 

0 

.36 

10.2 

8 

1 

0 

.28 

5.9 

9 

1 

0 

.32 

10.7 

7 

1 

0 

.24 

6.5 

8 

1 

0 

.28 

11.0 

6 

1 

0 

.20 

7.8 

7 

1 

0 

.24 

11.1 

5 

1 

0 

.16 

8.3 

6 

1 

0 

.20 

11.7 

4 

( 

_ ) 

8.4 

5 

1 

0 

.16 

8.8 

4 

1 

0 

.12 

9.i 

' 

9.9 

11.4 

1 

1 

0 

.00 

b.  Based  on  your  results  in  part  a,  plot  the  KM  curves 
for  groups  1  and  2  on  the  same  graph.  Comment  on 
how  these  curves  compare  with  each  other. 
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c.  Fill  in  the  following  expanded  table  of  ordered  failure 
times  to  allow  for  the  computation  of  expected  and 
observed  minus  expected  values  at  each  ordered  fail¬ 
ure  time.  Note  that  your  new  table  here  should  com¬ 
bine  both  groups  of  ordered  failure  times  into  one 
listing  and  should  have  the  following  format: 


Totals  22  25 
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d.  Use  the  results  in  part  c  to  compute  the  log-rank 
statistic.  Use  this  statistic  to  carry  out  the  log-rank 
test  for  these  data.  What  is  your  null  hypothesis  and 
how  is  the  test  statistic  distributed  under  this  null 
hypothesis?  What  are  your  conclusions  from  the 
test? 

2.  The  following  data  set  called  "anderson.dat”  consists 
of  remission  survival  times  on  42  leukemia  patients, 
half  of  whom  get  a  certain  new  treatment  therapy  and 
the  other  half  of  whom  get  a  standard  treatment 
therapy.  The  exposure  variable  of  interest  is  treatment 
status  ( Rx  =  0  if  new  treatment,  Rx  1  if  standard 
treatment).  Two  other  variables  for  control  as  potential 
confounders  are  log  white  blood  cell  count  (i.e., 
logwbc)  and  sex.  Failure  status  is  defined  by  the  relapse 
variable  (0  if  censored,  1  if  failure).  The  data  set  is  listed 
as  follows: 


Subj 

Survt 

Relapse 

Sex 

log  WBC 

Rx 

1 

35 

0 

1 

1.45 

0 

2 

34 

0 

1 

1.47 

0 

3 

32 

0 

1 

2.20 

0 

4 

32 

0 

1 

2.53 

0 

5 

25 

0 

1 

1.78 

0 

6 

23 

1 

1 

2.57 

0 

7 

22 

1 

1 

2.32 

0 

8 

20 

0 

1 

2.01 

0 

9 

19 

0 

0 

2.05 

0 

10 

17 

0 

0 

2.16 

0 

11 

16 

1 

1 

3.60 

0 

12 

13 

1 

0 

2.88 

0 

13 

11 

0 

0 

2.60 

0 

14 

10 

0 

0 

2.70 

0 

15 

10 

1 

0 

2.96 

0 

16 

9 

0 

0 

2.80 

0 

17 

7 

1 

0 

4.43 

0 

18 

6 

0 

0 

3.20 

0 

19 

6 

1 

0 

2.31 

0 

20 

6 

1 

1 

4.06 

0 

21 

6 

1 

0 

3.28 

0 

22 

23 

1 

1 

1.97 

1 

23 

22 

1 

0 

2.73 

1 

24 

17 

1 

0 

2.95 

1 

25 

15 

1 

0 

2.30 

1 

26 

12 

1 

0 

1.50 

1 

27 

12 

1 

0 

3.06 

1 

28 

11 

1 

0 

3.49 

1 

( Continued  on  next  page) 
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Subj 

Survt 

Relapse 

Sex 

log  WBC 

Rx 

29 

11 

1 

0 

2.12 

1 

30 

8 

1 

0 

3.52 

1 

31 

8 

1 

0 

3.05 

1 

32 

8 

1 

0 

2.32 

1 

33 

8 

1 

1 

3.26 

1 

34 

5 

1 

1 

3.49 

1 

35 

5 

1 

0 

3.97 

1 

36 

4 

1 

1 

4.36 

1 

37 

4 

1 

1 

2.42 

1 

38 

3 

1 

1 

4.01 

1 

39 

2 

1 

1 

4.91 

1 

40 

2 

1 

1 

4.48 

1 

41 

1 

1 

1 

2.80 

1 

42 

1 

1 

1 

5.00 

1 

a.  Suppose  we  wish  to  describe  KM  curves  for  the 
variable  logwbc.  Because  logwbc  is  continuous, 
we  need  to  categorize  this  variable  before  we  com¬ 
pute  KM  curves.  Suppose  we  categorize  logwbc 
into  three  categories  -  low,  medium,  and  high  -  as 
follows: 

low  (0-2.30),  n  =  11; 
medium  (2.31-3.00),  n  =  14; 
high  (>3.00),  n  =  17. 

Based  on  this  categorization,  compute  and  graph 
KM  curves  for  each  of  the  three  categories  of 
logwbc.  (You  may  use  a  computer  program  to  assist 
you  or  you  can  form  three  tables  of  ordered  failure 
times  and  compute  KM  probabilities  directly.) 

b.  Compare  the  three  KM  plots  you  obtained  in  part  a. 
How  are  they  different? 

c.  Below  is  an  edited  printout  of  the  log-rank  test 
comparing  the  three  groups. 


Events  Events 
Group  observed  expected 

1  4  13.06 

2  10  10.72 

3  _ 16 _ 6.21 

Total  30  30.00 


Log-rank  =  chi2(2)  =  26.39 
P-value  =  Pr  >  chi2  =  0.0000 


What  do  you  conclude  about  whether  or  not  the 
three  survival  curves  are  the  same? 
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Test 


To  answer  the  questions  below,  you  will  need  to  use  a 
computer  program  (from  SAS,  Stata,  SPSS,  R  or  any 
other  package  you  are  familiar  with)  that  computes  and 
plots  KM  curves  and  computes  the  log-rank  test.  Freely 
downloadable  files  can  be  obtained  from  weblink  http:// 
www.  sph.  emory .  edu/ dkleinb/ surv3  .htm . 

1.  For  the  vets.dat  data  set  described  in  the  presentation: 

a.  Obtain  KM  plots  for  the  two  categories  of  the  vari¬ 
able  cell  type  1(1=  large,  0  =  other).  Comment  on 
how  the  two  curves  compare  with  each  other.  Cany 
out  the  log-rank,  and  draw  conclusions  from  the 
test(s). 

b.  Obtain  KM  plots  for  the  four  categories  of  cell  type  - 
large,  adeno,  small,  and  squamous.  Note  that  you 
will  need  to  recode  the  data  to  define  a  single  vari¬ 
able  which  numerically  distinguishes  the  four  cate¬ 
gories  (e.g.,  1  =  large,  2  =  adeno,  etc.).  As  in  part  a, 
compare  the  four  KM  curves.  Also,  carry  out  the 
log-rank  for  the  equality  of  the  four  curves  and 
draw  conclusions. 

2.  The  following  questions  consider  a  data  set  from  a 
study  by  Caplehorn  et  al.  (“Methadone  Dosage  and 
Retention  of  Patients  in  Maintenance  Treatment,” 
Med.  J.  Aust.,  1991).  These  data  comprise  the  times  in 
days  spent  by  heroin  addicts  from  entry  to  departure 
from  one  of  two  methadone  clinics.  There  are  two  fur¬ 
ther  covariates,  namely,  prison  record  and  methadone 
dose,  believed  to  affect  the  survival  times.  The  data  set 
name  is  addicts.dat.  A  listing  of  the  variables  is  given 
below: 

Column  1:  Subject  ID 
Column  2:  Clinic  (1  or  2) 

Column  3:  Survival  status  (0  =  censored,  1  =  departed 
from  clinic) 

Column  4:  Survival  time  in  days 

Column  5:  Prison  record  (0  =  none,  1  =  any) 

Column  6:  Methadone  dose  (mg/day) 
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a.  Compute  and  plot  the  KM  plots  for  the  two  categories 
of  the  “clinic”  variable  and  comment  on  the  extent  to 
which  they  differ. 

b.  A  printout  of  the  log-rank  and  Wilcoxon  tests  (using 
Stata)  is  provided  below.  What  are  your  conclusions 
from  this  printout? 


Log-rank  =  chi2(l)  =  27.89 
P-value  =  Pr  >  chi2  =  0.0000 
Wilcoxon  =  chi2(l)  =  11.63 
P-value  =  Pr  >  chi2  =  0.0007 


c.  Compute  and  evaluate  KM  curves  and  the  log-rank 
test  for  comparing  suitably  chosen  categories  of  the 
variable  "Methadone  dose.”  Explain  how  you  deter¬ 
mined  the  categories  for  this  variable. 
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1.  a. 


Group  1  Group  2 


kf) 

nf 

tHf 

<?/■ 

S(%) 

kf) 

nf 

nif 

(If 

sikn) 

0.0 

25 

0 

0 

1.00 

0.0 

25 

0 

0 

1.00 

1.8 

25 

1 

0 

.96 

1.4 

25 

1 

0 

.96 

2.2 

24 

1 

0 

.92 

1.6 

24 

1 

0 

.92 

2.5 

23 

1 

0 

.88 

1.8 

23 

1 

0 

.88 

2.6 

22 

1 

0 

.84 

2.4 

22 

1 

0 

.84 

3.0 

21 

1 

0 

.80 

2.8 

21 

1 

0 

.80 

3.5 

20 

(1 

0 

■76) 

2.9 

20 

1 

0 

.76 

3.8 

19 

1 

0 

.72 

3.1 

19 

1 

0 

.72 

5.3 

18 

1 

0 

.68 

3.5 

18 

1 

0 

.68 

5.4 

17 

1 

0 

.64 

3.6 

17 

1 

0 

.64 

5.7 

16 

1 

0 

.60 

3.9 

16 

1 

0 

.60' 

6.6 

15 

1 

0 

.56 

4.1 

15 

1 

0 

.56 

8.2 

14 

1 

0 

.52 

4.2 

14 

1 

0 

■52, 

8.7 

13 

1 

0 

.48 

4.7 

13 

1 

0 

.48 

9.2 

ax: 

2 

0 

“ 040) 

4.9 

12 

1 

0 

.44 

9.8 

10 

1 

0 

.36 

5.2 

11 

1 

0 

.40 

10.0 

9 

1 

0 

.32 

5.8 

10 

1 

0 

.36 

10.2 

8 

1 

0 

.28 

5.9 

9 

1 

0 

.32 

10.7 

7 

1 

0 

.24 

6.5 

8 

1 

0 

.28 

11.0 

6 

1 

0 

.20 

7.8 

7 

1 

0 

.24 

11.1 

5 

1 

0 

.16 

8.3 

6 

1 

0 

.20 

11.7 

4 

cn 

3 

■  12) 

8.4 

5 

1 

0 

.16 

8.8 

4 

1 

0 

.12 

9.1 

3 

1 

0 

.08' 

9.9 

,2 

1 

0 

11.4 

1 

1 

0 

.00 

b.  KM  curves  for  CHR  data: 


Group  1  appears  to  have  consistently  better  survival 
prognosis  than  group  2.  However,  the  KM  curves 
are  very  close  during  the  first  4  years,  but  are  quite 
separate  after  4  years,  although  they  appear  to 
come  close  again  around  12  years, 
c.  Using  the  expanded  table  format,  the  following 
information  is  obtained: 
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t(f> 

mif  m2f\ 

n if  n2f\ 

eif 

e2f 

mif  —  e 

if  m2f  -  - 

e2f 

1.4 

0 

1 

25 

25 

.500 

.500 

-.500 

.500 

1.6 

0 

1 

25 

24 

.510 

.490 

-.510 

.510 

1.8 

1 

1 

25 

23 

1.042 

.958 

-.042 

.042 

2.2 

1 

0 

24 

22 

.522 

.478 

.478 

-.478 

2.4. 

0 

1 

23 

22 

.511 

.489 

-.511 

.511 

2.5. 

1 

0 

23 

21 

.523 

.477 

.477 

- .477 

2.6 

1 

0 

22 

21 

.516 

.484 

.484 

-.484 

2.8 

0 

1 

21 

21 

.500 

.500 

-.500 

.500 

2.9 

0 

1 

21 

20 

.512 

.488 

-.512 

.512 

3.0 

1 

0 

21 

19 

.525 

.475 

.475 

-.475 

3.1 

0 

1 

20 

19 

.513 

.487 

-.513 

.513 

3.5 

1 

1 

20 

18 

1.053 

.947 

-.053 

.053 

3.6 

0 

1 

19 

17 

.528 

.472 

-.528 

.528 

3.8 

1 

0 

19 

16 

.543 

.457 

.457 

-.457 

3.9 

0 

1 

18 

16 

.529 

.471 

-.529 

.529 

4.1 

0 

1 

18 

15 

.545 

.455 

-.545 

.545 

4.2 

0 

1 

18 

14 

.563 

.437 

-.563 

.563 

4.7 

0 

1 

18 

13 

.581 

.419 

-.581 

.581 

4.9 

0 

1 

18 

12 

.600 

.400 

-.600 

.600 

5.2 

0 

1 

18 

11 

.621 

.379 

-.621 

.621 

5.3 

1 

0 

18 

10 

.643 

.357 

.357 

-.357 

5.4 

1 

0 

17 

10 

.630 

.370 

.370 

-.370 

5.7 

1 

0 

16 

10 

.615 

.385 

.385 

-.385 

5.8 

0 

1 

15 

10 

.600 

.400 

-.600 

.600 

5.9 

0 

1 

15 

9 

.625 

.375 

-.625 

.625 

6.5 

0 

1 

15 

8 

.652 

.348 

-.652 

.652 

6.6 

1 

0 

15 

7 

.682 

.318 

.318 

-.318 

7.8 

0 

1 

14 

7 

.667 

.333 

-.667 

.667 

8.2 

1 

0 

14 

6 

.700 

.300 

.300 

-.300 

8.3 

0 

1 

13 

6 

.684 

.316 

-.684 

.684 

8.4 

0 

1 

13 

5 

.722 

.278 

-.722 

.722 

8.7 

1 

0 

13 

4 

.765 

.235 

.335 

-.335 

8.8 

0 

1 

12 

4 

.750 

.250 

-.750 

.750 

9.1 

0 

1 

12 

3 

.800 

.200 

-.800 

.800 

9.2 

2 

0 

12 

2 

1.714 

.286 

.286 

-.286 

9.8 

1 

0 

10 

2 

.833 

.167 

.167 

-.167 

9.9 

0 

1 

9 

2 

.818 

.182 

-.818 

.818 

10.0 

1 

0 

9 

1 

.900 

.100 

.100 

-.100 

10.2 

1 

0 

8 

1 

.888 

.112 

.112 

-.112 

10.7 

1 

0 

7 

1 

.875 

.125 

.125 

-.125 

11.0 

1 

0 

6 

1 

.857 

.143 

.143 

-.143 

11.1 

1 

0 

5 

1 

.833 

.167 

.167 

-.167 

11.4 

0 

1 

4 

1 

.800 

.200 

-.800 

.800 

11.7 

1 

0 

4 

0 

1.000 

.000 

.000 

.000 

30.79  16.21  (-8.790  8.790) 


Totals  22  25 
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d.  The  log-rank  statistic  can  be  computed  from  the 
totals  of  the  expanded  table  using  the  formulae: 


log-rank  statistic 
Var (Of  -  E,) 


(Of  -  Ej)2 

Var(0,  -  E,) 


nifn2f(mif  +  mv)  (nv  +  nv  ~  mV  -  nh-f) 
i  (n  if  +  n2f ) 2  (n  1/  +  n2f  -  1 ) 


The  variance  turns  out  to  be  9.658,  so  that  the  log- 
rank  statistic  is  (8.79)2/9.658  =  7.993. 

Using  Stata,  the  results  for  the  log-rank  test  are 
given  as  follows: 


Events  Events 
Group  observed  expected 

1  22  30.79 

2  25  16.21 

Total  47  47.00 


Log-rank  =  chi2(l)  =  7.99 
P-value  =  Pr  >  chi2  =  0.0047 


The  log-rank  test  gives  highly  significant  results. 
This  indicates  that  there  is  a  significant  difference 
in  survival  between  the  two  groups. 

2.  a.  For  the  Anderson  dataset,  the  KM  plots  for  the  three 
categories  of  log  WBC  are  shown  below: 


Weeks 


b.  The  KM  curves  are  quite  different  with  group  1 
having  consistently  better  survival  prognosis  than 
group  2,  and  group  2  having  consistently  better 
survival  prognosis  than  group  3.  Note  also  that  the 
difference  between  group  1  and  2  is  about  the  same 
over  time,  whereas  group  2  appears  to  diverge  from 
group  3  as  time  increases. 

c.  The  log-rank  statistic  (26.391)  is  highly  significant 
with  P-values  equal  to  zero  to  three  decimal  places. 
These  results  indicate  that  there  is  some  overall 
difference  between  the  three  curves. 
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Appendix: 
Matrix 
Formula 
for  the 
Log-Rank 
Statistic  for 
Several 
Croups 


For  z  =  1,  2,. . G  andf  =  1,  2,. . .,  k,  where  G  =  #  of  groups 
and  k  =  #  of  distinct  failure  times, 

rijf  =  #  at  risk  in  z'th  group  at  fth  ordered  failure  time 

m,y  =  observed  #  of  failures  in  z'th  group  at  fth  ordered 
failure  time 

e,y  =  expected  #  of  failures  in  z'th  group  at  fth  ordered  failure 
time 


f  +  «2// 


nf  =  n‘f 

i=  1 
G 

mf  = 


i=  1 


<9,  =  (m‘f  ~  e>f) 

f=  i 

Var(0,  -  £,)  =  £  f: ^  “  ”»>' 


/=1 


(«/  -  !) 


Cov(0,  -  E„  O,  -  £,)  =  £  f 

/TT  v  W  - !)  , 

d  =  (Cf  —  Zq,  O2  —  £2?  •  •  •  5  Gg-i  —  £c-i)' 

v  =  (M) 


where  v„-  =  Var  (O,  -  £,)  and  va  =  Cov  (O,  -  Eh  Oi  -  Et)  for  i 
=  1,  2,...,  G  -  l;/  =  1,  2,...,  G  -  1. 


Then,  the  log-rank  statistic  is  given  by  the  matrix  product 
formula: 

Log-rank  statistic  =  d'V  1  d 


which  has  approximately  a  chi-square  distribution  with 
G  —  1  degrees  of  freedom  under  the  null  hypothesis  that 
all  G  groups  have  a  common  survival  curve. 
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Introduction  We  begin  by  discussing  some  computer  results  using  the 

Cox  PH  model,  without  actually  specifying  the  model; 
the  purpose  here  is  to  show  the  similarity  between  the 
Cox  model  and  standard  linear  regression  or  logistic 
regression. 

We  then  introduce  the  Cox  model  and  describe  why  it  is  so 
popular.  In  addition,  we  describe  its  basic  properties, 
including  the  meaning  of  the  proportional  hazards 
assumption  and  the  Cox  likelihood.  We  also  describe  how 
and  why  we  might  consider  using  “age  as  the  time  scale” 
instead  of  “time-on  follow-up”  as  the  outcome  variable. 

Abbreviated  The  outline  below  gives  the  user  a  preview  of  the  material 

Outline  to  be  covered  by  the  presentation.  A  detailed  outline  for 

review  purposes  follows  the  presentation. 

I.  A  computer  example  using  the  Cox  PH  model 

(pages  100-108) 

II.  The  formula  for  the  Cox  PH  model 

(pages  108-110) 

III.  Why  the  Cox  PH  model  is  popular  ( pages  110-112) 

IV.  ML  estimation  of  the  Cox  PH  model 

(pages  1 12-114) 

V.  Computing  the  hazard  ratio  ( pages  114-117) 

VI.  Interval  estimation:  interaction  ( pages  117-119) 

VII.  Adjusted  survival  curves  using  the  Cox  PH  model 

(pages  120-123) 

VIII.  The  meaning  of  the  PH  assumption 

(pages  123-127) 

IX.  The  Cox  likelihood  ( pages  127-131) 

X.  Using  age  as  the  time  scale  ( pages  131-142) 

XI.  Summary  (pages  143-144) 
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Objectives 


Upon  completing  this  chapter,  the  learner  should  be  able  to: 

1 .  State  or  recognize  the  general  form  of  the  Cox  PH 
model. 

2.  State  the  specific  form  of  a  Cox  PH  model  appropriate 
for  the  analysis,  given  a  survival  analysis  scenario 
involving  one  or  more  explanatory  variables. 

3.  State  or  recognize  the  form  and  properties  of  the 
baseline  hazard  function  in  the  Cox  PH  model. 

4.  Give  three  reasons  for  the  popularity  of  the  Cox  PH 
model. 

5.  State  the  formula  for  a  designated  hazard  ratio  of 
interest  given  a  scenario  describing  a  survival  analysis 
using  a  Cox  PH  model,  when 

a.  there  are  confounders  but  no  interaction  terms  in 
the  model; 

b.  there  are  both  confounders  and  interaction  terms 
in  the  model. 

6.  State  or  recognize  the  meaning  of  the  PH  assumption. 

7.  Determine  and  explain  whether  the  PH  assumption  is 
satisfied  when  the  graphs  of  the  hazard  functions  for 
two  groups  cross  each  other  over  time. 

8.  State  or  recognize  what  is  an  adjusted  survival  curve. 

9.  Compare  and/or  interpret  two  or  more  adjusted 
survival  curves. 

10.  Given  a  computer  printout  involving  one  or  more 
fitted  Cox  PH  models, 

a.  compute  or  identify  any  hazard  ratio(s)  of  interest; 

b.  cany  out  and  interpret  a  designated  test  of 
hypothesis; 

c.  carry  out,  identify  or  interpret  a  confidence 
interval  for  a  designated  hazard  ratio; 

d.  evaluate  interaction  and  confounding  involving 
one  or  more  co variates. 

1 1 .  Give  an  example  of  how  the  Cox  PH  likelihood  is 
formed. 

12.  Given  left  truncated  survival  data,  describe  how  and 
when  you  would  consider  using  “age  as  the  time  scale” 
instead  of  “time-on  follow-up”  as  the  outcome 
variable. 

13.  Given  left-truncated  survival  data,  state  the  hazard 
function  formula  that  uses  “age  as  the  time  scale”  as 
the  outcome  variable. 

14.  Illustrate  the  difference  between  an  “open  cohort”  and 
a  “closed  cohort”. 
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Presentation 


This  presentation  describes  the  Cox  proportional 
hazards  (PH)  model,  a  popular  mathematical 
model  used  for  analyzing  survival  data.  Here, 
we  focus  on  the  model  form,  why  the  model  is 
popular,  maximum  likelihood  (ML)  estimation 
of  the  model  parameters,  the  formula  for  the 
hazard  ratio,  how  to  obtain  adjusted  survival 
curves,  and  the  meaning  of  the  PH  assumption. 


I.  A  Computer  Example 
Using  the  Cox  PH 
Model 


EXAMPLE 


Leukemia  Remission  Data 


Group  1(«  =  21)  Group  2(n  =  21) 


t  (weeks)  log  WBC 

t  (weeks)  log  WBC 

6 

2.31 

1 

2.80 

6 

4.06 

1 

5.00 

6 

3.28 

2 

4.91 

7 

4.43 

2 

4.48 

10 

2.96 

3 

4.01 

13 

2.88 

4 

4.36 

16 

3.60 

4 

2.42 

22 

2.32 

5 

3.49 

23 

2.57 

5 

3.97 

6+ 

3.20 

8 

3.52 

9+ 

2.80 

8 

3.05 

10+ 

2.70 

8 

2.32 

11+ 

2.60 

8 

3.26 

17+ 

2.16 

11 

3.49 

19+ 

2.05 

11 

2.12 

20+ 

2.01 

12 

1.50 

25+ 

1.78 

12 

3.06 

32+ 

2.20 

15 

2.30 

32+ 

2.53 

17 

2.95 

34+ 

1.47 

22 

2.73 

35+ 

1.45 

23 

1.97 

+  denotes  censored  observation 


We  introduce  the  Cox  PH  model  using  com¬ 
puter  output  from  the  analysis  of  remission 
time  data  (Freireich  et  al.,  Blood,  1963),  which 
we  previously  discussed  in  Chapters  1  and  2. 
The  data  set  is  listed  here  at  the  left. 

These  data  involve  two  groups  of  leukemia 
patients,  with  21  patients  in  each  group. 
Group  1  is  the  treatment  group,  and  group  2  is 
the  placebo  group.  The  data  set  also  contains 
the  variable  log  WBC,  which  is  a  well-known 
prognostic  indicator  of  survival  for  leukemia 
patients. 

For  this  example,  the  basic  question  of  interest 
concerns  comparing  the  survival  experience  of 
the  two  groups  adjusting  for  the  possible  con¬ 
founding  and/or  interaction  effects  of  log  WBC. 
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We  are  thus  considering  a  problem  involving 
two  explanatory  variables  as  predictors  of  sur¬ 
vival  time  T,  where  T  denotes  “weeks  until 
going  out  of  remission.”  We  label  the  explana¬ 
tory  variables  X,  (for  group  status)  and  X2  (for 
log  WBC).  The  variable  Xt  is  the  primary  study 
or  exposure  variable  of  interest.  The  variable  X2 
is  an  extraneous  variable  that  we  are  including 
as  a  possible  confounder  or  effect  modifier. 

Note  that  if  we  want  to  evaluate  the  possible 
interaction  effect  of  log  WBC  on  group  status, 
we  would  also  need  to  consider  a  third  variable, 
that  is,  the  product  of  X1  and  X2. 

For  this  dataset,  the  computer  results  from  fit¬ 
ting  three  different  Cox  proportional  hazards 
models  are  presented  below.  The  computer  pack¬ 
age  used  is  Stata.  This  is  one  of  several  packages 
that  have  procedures  for  carrying  out  a  survival 
analysis  using  the  Cox  model.  The  information 
printed  out  by  different  packages  will  not  have 
exactly  the  same  format,  but  they  will  provide 
similar  information.  A  comparison  of  output 
using  Stata,  SAS,  SPSS,  and  R  procedures  on 
Edited  Output  From  Stata:  the  same  dataset  is  provided  in  the  computer 

appendix  at  the  back  of  this  text. 

Model  1: 


Coef. 

Std.  Err.  z 

p  >  M 

Haz.  Ratio 

[95%  Conf.  Interval] 

Rx  1.509 

0.410  3.68 

0.000 

4.523 

2.027 

10.094 

No.  of  subjects 

=  42 

Log  likelihood  =  —86.380 

Prob  >  chi2  = 

0.0001 

Model  2: 

Coef. 

Std.  Err.  z 

p  >  M 

Haz.  Ratio 

[95%  Conf.  Interval] 

Rx 

1.294 

0.422  3.07 

0.002 

3.648 

1.595 

8.343 

log  WBC 

1.604 

0.329  4.87 

0.000 

4.975 

2.609 

9.486 

No.  of  subjects 

=  42 

Log  likelihood  —  —72.280 

Prob  >  chi2  — 

0.0000 

Model  3: 

Coef. 

Std.  Err.  z 

p  >  M 

Haz.  Ratio 

[95%  Conf.  Interval] 

Rx 

2.355 

1.681  1.40 

0.161 

10.537 

0.391 

284.201 

log  WBC 

1.803 

0.447  4.04 

0.000 

6.067 

2.528 

14.561 

Rx  x  log  WBC 

-0.342 

0.520  -0.66 

0.510 

0.710 

0.256 

1.967 

No.  of  subjects 

=  42 

Log  likelihood  =  —72.066 

Prob  >  chi2  = 

-  0.0000 

EXAMPLE:  (continued) 


T  =  weeks  until  going  out  of  remission 

Xi  =  group  status  =  E 

X2  =  log  WBC  (confounding?) 


Interaction? 

X3  =  X\  x  X2  —  group  status  x  log 
WBC 


Computer  results  for  three  Cox  PH 
models  using  the  Stata  package 

Other  computer  packages  provide 
similar  information. 

Computer  Appendix:  uses  Stata,  SAS, 
and  SPSS  on  the  same  dataset. 
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We  now  describe  how  to  use  the  computer 
printout  to  evaluate  the  possible  effect  of  treat¬ 
ment  status  on  remission  time  adjusted  for  the 
potential  confounding  and  interaction  effects 
of  the  covariate  log  WBC.  For  now,  we  focus 
only  on  five  columns  of  information  provided 
in  the  printout,  as  presented  at  the  left  for  all 
three  models. 

For  each  model,  the  first  column  identifies  the 
variables  that  have  been  included  in  the  model. 
The  second  column  gives  estimates  of  regres¬ 
sion  coefficients  corresponding  to  each  vari¬ 
able  in  the  model.  The  third  column  gives 
standard  errors  of  the  estimated  regression 
coefficients.  The  fourth  column  gives  p-values 
for  testing  the  significance  of  each  coefficient. 
The  fifth  column,  labeled  as  Haz.  Ratio,  gives 
ec°ef  £or  eacj1  variat>le  in  each  model. 

As  we  discuss  later  in  this  chapter,  eCoef  gives  an 
estimated  hazard  ratio  (HR)  for  the  effect  of 
each  variable  adjusted  for  the  other  variables  in 
a  model  (e.g.  Models  1  and  2)  without  product 
terms.  With  product  terms  such  as  Rx  x  log 
WBC  in  Model  3,  the  hazard  ratio  formula  is 
more  complicated,  as  we  also  discuss  later. 

Except  for  the  Haz.  Ratio  column,  these  com¬ 
puter  results  are  typical  of  output  found  in  stan¬ 
dard  linear  regression  printouts.  As  the  printout 
suggests,  we  can  analyze  the  results  from  a  Cox 
model  in  a  manner  similar  to  the  way  we  would 
analyze  a  linear  regression  model. 

We  now  distinguish  among  the  output  for  the 
three  models  shown  here.  All  three  models  are 
using  the  same  remission  time  data  on  42  sub¬ 
jects.  The  outcome  variable  for  each  model  is 
the  same:  time  in  weeks  until  a  subject  goes 
out  of  remission.  However,  the  independent 
variables  are  different  for  each  model.  Model  1 
contains  only  the  treatment  status  variable, 
indicating  whether  a  subject  is  in  the  treat¬ 
ment  or  placebo  group.  Model  2  contains  two 
variables,  treatment  status  and  log  WBC.  And 
model  3  contains  an  interaction  term  defined  as 
the  product  of  treatment  status  and  log  WBC. 


EXAMPLE:  (continued) 


Same  dataset  for  each  model 
n  =  42  subjects 

T  =  time  (weeks)  until  out  of  remission 

Model  1:  Rx  only 

Model  2:  Rx  and  log  WBC 

Model  3:  Rx,  log  WBC,  and 
Rx  x  log  WBC 


EDITED  OUTPUT  FROM  STATA 

Model  1: 

Coef. 

Std.  Err. 

p>|z| 

Haz.  Ratio 

Rx  1.509 

0.410 

0.000 

4.523 

No.  of  subjects  =  42  Log  likelihood  =  -86.380 

Hazard  ratios 


Model  2: 


Coef. 

Std.  Err. 

p>|z| 

Haz.  Ratio 

Rx  1.294 

0.422 

0.002 

3.648 

log  WBC  1.604 

0.329 

0.000 

4.975 

No.  of  subjects  = 

42  Log  likelihood  = 

-72.280 

Model  3: 

Coef.  Std.  Err.  p  >  |z|  Haz.  Ratio 


Rx  2.355  1.681 

0.161 

10.537 

log  WBC  1.803  0.447 

0.000 

6.067 

Rx  x  log  WBC  -0.342  0.520 

0.510 

0.710 

No.  of  subjects  =  42  Log  likelihood  =  - 

-72.066 

Models  1  and  2:  ecoef  =  HR 
Model  3:  HR  formula  more 
complicated 
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EDITED  OUTPUT  ML  ESTIMATION 

Model  3: 

Coef. 

Std. 

Err. 

P>|z| 

Haz. 

Ratio 

Rx 

2.355 

1.681 

0.161 

10.537 

log  WBC 

1.803 

0.447 

0.000 

6.067 

Rx  x  log  WBC 

-0.342 

0.520 

(0.510) 

0.710 

No.  of  subjects 

=  42  |Log  likelihood  =  — 72.066| 

We  now  focus  on  the  output  for  model  3.  The 
method  of  estimation  used  to  obtain  the  coeffi¬ 
cients  for  this  model,  as  well  as  the  other  two 
models,  is  maximum  likelihood  (ML)  estima¬ 
tion.  Note  that  a  p-value  of  0.5 1 0  is  obtained  for 
the  coefficient  of  the  product  term  for  the  inter¬ 
action  of  treatment  with  log  WBC.  This  p-value 
indicates  that  there  is  no  significant  inter¬ 
action  effect,  so  that  we  can  drop  the  product 
term  from  the  model  and  consider  the  other 
two  models  instead. 


EXAMPLE:  (continued) 


P  =  0.510  :  °'^'2  =  -0.66  =  Z  Wald 

statistic  _0-5-0 
LR  statistic:  uses  Log  likelihood  = 
-72.066 

— 2  In  L  (log  likelihood  statistic)  = 
-2  x  (-72.066)  =  144.132 


The  p-value  of  0.510  that  we  have  just  des¬ 
cribed  is  obtained  by  dividing  the  coefficient 
-0.342  of  the  product  term  by  its  standard 
error  of  0.520,  which  gives  —0.66,  and  then 
assuming  that  this  quantity  is  approximately 
a  standard  normal  or  Z  variable.  This  Z  statistic 
is  known  as  a  Wald  statistic,  which  is  one  of 
two  test  statistics  typically  used  with  ML  esti¬ 
mates.  The  other  test  statistic,  called  the  likeli¬ 
hood  ratio,  or  Li?  statistic,  makes  use  of  the  log 
likelihood  statistic.  The  log  likelihood  statistic 
is  obtained  by  multiplying  the  "Log  likelihood” 
in  the  Stata  output  by  —2  to  get  -2  In  L. 


I  Edited  Output 

Model  2: 

Coef.  Std.  Err. 

p  >  M 

Haz.  Ratio 

Rx  1.294  0.422 

0.002 

3.648 

log  WBC  1.604  0.329 

0.000 

4.975 

No.  of  subjects  =  42  |Log  likelihood  = 

-72.280| 

-2  In  L  =  — 2x  (-72.280)  =  144.550 

We  now  look  at  the  output  for  model  2,  which 
contains  two  variables.  The  treatment  status 
variable  (Rx)  represents  the  exposure  variable 
of  primary  interest.  The  log  WBC  variable  is 
being  considered  as  a  confounder.  Our  goal  is 
to  describe  the  effect  of  treatment  status 
adjusted  for  log  WBC.  Note  that  for  Model  2, 
—2  In  L  equals  144.550. 


EXAMPLE:  (continued) 


LR  (interaction  in  model  3) 

—  2  In  Lmodel  2  (  2  In  Lmodel  3) 

In  general: 

LR  =  -2  In  Lr  -  (-2  In  LF) 


To  use  the  likelihood  ratio  (LR)  statistic  to 
test  the  significance  of  the  interaction  term, 
we  need  to  compute  the  difference  between 
the  log  likelihood  statistic  of  the  reduced 
model  which  does  not  contain  the  interaction 
term  (model  2)  and  the  log  likelihood  statistic 
of  the  full  model  containing  the  interaction 
term  (model  3).  In  general,  the  LR  statistic 
can  be  written  in  the  form  —2  In  Lw  minus 
—2  In  Lf,  where  R  denotes  the  reduced  model 
and  F  denotes  the  full  model. 
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To  obtain  the  LR  statistic  in  this  example,  we 
compute  144.550  minus  144. 132  to  obtain  0.428. 
Under  the  null  hypothesis  of  no  interaction 
effect,  the  test  statistic  has  a  chi-square  distribu¬ 
tion  with  p  degrees  of  freedom,  where  p  denotes 
the  number  of  predictors  being  assessed.  The 
p-value  for  this  test  is  between  0.40  and  0.50, 
which  indicates  no  significant  interaction. 
Although  the  p-values  for  the  Wald  test  (0.510) 
and  the  LR  test  are  not  exactly  the  same,  both 
p-values  lead  to  the  same  conclusion. 

In  general,  the  LR  and  Wald  statistics  may  not 
give  exactly  the  same  answer.  Statisticians 
have  shown  that  of  the  two  test  procedures, 
the  LR  statistic  has  better  statistical  properties, 
so  when  in  doubt,  you  should  use  the  LR  test. 

We  now  focus  on  how  to  assess  the  effect  of 
treatment  status  adjusting  for  log  WBC  using 
the  model  2  output,  again  shown  here. 


There  are  three  statistical  objectives  typically 
considered.  One  is  to  test  for  the  significance 
of  the  treatment  status  variable,  adjusted  for 
log  WBC.  Another  is  to  obtain  a  point  estimate 
of  the  effect  of  treatment  status,  adjusted  for 
log  WBC.  And  a  third  is  to  obtain  a  confidence 
interval  for  this  effect.  We  can  accomplish 
these  three  objectives  using  the  output  pro¬ 
vided,  without  having  to  explicitly  describe 
the  formula  for  the  Cox  model  being  used. 

To  test  for  the  significance  of  the  treatment 
effect,  the  p-value  provided  in  the  table  for  the 
Wald  statistic  is  0.002,  which  is  highly  signifi¬ 
cant.  Alternatively,  a  likelihood  ratio  (LR)  test 
could  be  performed  by  comparing  the  log  like¬ 
lihood  statistic  (144.559)  for  model  2  with  the 
log  likelihood  statistic  for  a  model  which  does 
not  contain  the  treatment  variable.  This  latter 
model,  which  should  contain  only  the  log  WBC 
variable,  is  not  provided  here,  so  we  will  not 
report  on  it  other  than  to  note  that  the  LR  test 
is  also  very  significant.  Thus,  these  test  results 
show  that  using  model  2,  the  treatment  effect 
is  significant,  after  adjusting  for  log  WBC. 


EXAMPLE:  (continued) 


Test  for  treatment  effect: 

Wald  statistic:  P  =  0.002  (highly 

significant) 

LR  statistic:  compare 
—2  log  L  from  model  2  with 
—2  log  L  from  model  without  Rx 
variable 

^Printout  not  provided  here 

Conclusion:  treatment  effect  is 
significant,  after  adjusting  for  log  WBC 


OUTPUT  ! 

Model  2: 

Coef. 

Std.  Err.  p  >  |z| 

Haz.  Ratio 

Rx  (1.294) 

0.422  (0.002) 

(3.648) 

log  WBC  1.604 

0.329  0.000 

4.975 

No.  of  subjects  = 

42  |Log  likelihood  —  - 

-72.280| 

Three  statistical  objectives. 

1 .  test  for  significance  of  effect 

2.  point  estimate  of  effect 

3.  confidence  interval  for  effect 


LR  ^  Wald 

When  in  doubt,  use  the  LR  test. 


EXAMPLE:  (continued) 


LR  (interaction  in  model  3) 

=  2  In  Cmotjei  2  (  2  In  Cmo(je]  3) 

=  (-2  x  -72.280)  -  (-2  x  -72.066) 
=  144.550  -  144.132  =  0.428 

{LR  is  x2  with  1  d.f.  under  H0:  no 
interaction.) 

0.40  <  P  <  0.50,  not  significant 
Wald  test  P  =  0.510 
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EXAMPLE:  (continued) 


Point  estimate: 


HR  =  3.648 


7 


Coefficient  of  treatment  variable 


A  point  estimate  of  the  effect  of  the  treatment  is 
provided  in  the  HR  column  by  the  value  3.648. 
This  value  gives  the  estimated  hazard  ratio 
(HR)  for  the  effect  of  the  treatment;  in  particu¬ 
lar,  we  see  that  the  hazard  for  the  placebo 
group  is  3.6  times  the  hazard  for  the  treatment 
group.  Note  that  the  value  3.648  is  calculated 
as  e  to  the  coefficient  of  the  treatment  variable; 
that  is,  e  to  the  1.294  equals  3.648. 


To  describe  the  confidence  interval  for  the  effect 
of  treatment  status,  we  consider  the  output  for 
the  extended  table  for  model  2  given  earlier. 


1  Output  I 

Model  2: 

Coef. 

Std.  Err. 

z  P>|z| 

Haz.  Ratio 

|[95%  Conf.  Interval]  | 

Rx 

|l,294 

0.422| 

3.07  0.002 

3.648 

11.595  8.3431 

Log  WBC 

1.604 

0.329 

4.87  0.000 

4.975 

2.609  9.486 

No.  of  subjects 

=  42 

Log  likelihood  =  —72.280 

Prob  >  chi2 

=  0.0000 

EXAMPLE:  (continued) 


95%  confidence  interval  for  the  HR: 
(1.595,  8.343) 


95%  Cl  for  HR  =  eft 

exp  [Pj  ±  1.96Sp  ]  =^±1.96(0.422) 


From  the  table,  we  see  that  a  95%  confidence 
interval  for  the  treatment  effect  is  given  by 
the  range  of  values  1.595-8.343.  This  is  a  confi¬ 
dence  interval  for  the  hazard  ratio  (HR),  which 
suiTounds  the  point  estimate  of  3.648  previ¬ 
ously  described.  Notice  that  this  confidence 
interval  is  fairly  wide,  indicating  that  the  point 
estimate  is  somewhat  unreliable.  As  expected 
from  the  low  p-value  of  0.002,  the  confidence 
interval  for  HR  does  not  contain  the  null  value 
of  1. 

The  calculation  of  the  confidence  interval  for 
HR  is  carried  out  as  follows: 

1 .  Compute  a  95%  confidence  interval  for  the 
regression  coefficient  of  the  Rx  variable 
(Pi).  The  large  sample  formula  is  1.294  plus 
or  minus  1.96  times  the  standard  error 
0.422,  where  1.96  is  the  97.5  percentile  of 
the  standard  normal  or  Z  distribution. 

2.  Exponentiate  the  two  limits  obtained  for 
the  confidence  interval  for  the  regression 
coefficient  of  Rx. 


106  3.  The  Cox  Proportional  Hazards  Model  and  Its  Characteristics 

Stata:  provides  Cl  directly  The  Stata  output  provides  the  required  confi¬ 

dence  interval  directly,  so  that  the  user  does 
Other  packages:  provide  (3  and  not  have  to  carry  out  the  computations 

required  by  the  large  sample  formula.  Other 
computer  packages  may  not  provide  the  confi¬ 
dence  interval  directly,  but,  rather,  may  pro¬ 
vide  only  the  estimated  regression  coefficients 
and  their  standard  errors. 

To  this  point,  we  have  made  use  of  information 
from  outputs  for  models  2  and  3,  but  have  not 
yet  considered  the  model  1  output,  which  is 
shown  again  here.  Note  that  model  1  contains 
only  the  treatment  status  variable,  whereas 
model  2,  shown  below,  contains  log  WBC  in 
addition  to  treatment  status.  Model  1  is  some¬ 
times  called  the  “crude”  model  because  it 
ignores  the  effect  of  potential  covariates  of 
interest,  like  log  WBC. 


Model  1  can  be  used  in  comparison  with  model 
2  to  evaluate  the  potential  confounding  effect 
of  the  variable  log  WBC.  In  particular,  notice 
that  the  value  in  the  HR  column  for  the  treat¬ 
ment  status  variable  is  4.523  for  model  1,  but 
only  3.648  for  model  2.  Thus,  the  crude  model 
yields  an  estimated  hazard  ratio  that  is  some¬ 
what  higher  than  the  corresponding  estimate 
obtained  when  we  adjust  for  log  WBC.  If  we 
decide  that  the  crude  and  adjusted  estimates 
are  meaningfully  different,  we  then  say  that 
there  is  confounding  due  to  log  WBC. 

Once  we  decide  that  confounding  is  present, 
we  then  must  control  for  the  confounder,  in 
this  case,  log  WBC,  in  order  to  obtain  a  valid 
estimate  of  the  effect.  Thus,  we  prefer  model  2, 
which  controls  for  log  WBC,  to  model  1,  which 
does  not. 

Note  that  if  we  had  decided  that  there  is  no 
"meaningful”  confounding,  then  we  would  not 
need  to  control  for  log  WBC  to  get  a  valid  answer. 
Nevertheless,  we  might  wish  to  control  for  log 
WBC  anyhow,  to  obtain  a  more  precise  estimate 
of  the  hazard  ratio.  That  is,  if  the  confidence 
interval  for  the  HR  is  narrower  when  using 
model  2  than  when  using  model  1,  we  would 
prefer  model  2  to  model  1  for  precision  gain. 


EDITED  OUTPUT 


Model  1: 

Coef.  Std.  Err.  p  >  |z|  Haz.  Ratio 
Rx  1.509  0.410  0.000  [43231 

No.  of  subjects  =  42  Log  likelihood  =  —86.380 

Model  2: 

Haz. 

Coef.  Std.  Err.  p  >  |z|  Ratio 

Rx  1.294  0.422  0.002  133481 

log  WBC  1.604  0.329  0.000  4.975 

No.  of  subjects  =  42  Log  likelihood  =  —72.280 


EXAMPLE:  (continued) 


HR  for  model  1  (4.523)  is  higher  than 
HR  for  model  2  (3.648) 

Confounding:  crude  versus  adjusted 
HR  are  meaningfully  different. 


Confounding  due  to  log  WBC  =>  must 
control  for  log  WBC,  i.e.,  prefer  model 
2  to  model  1 . 


If  no  confounding,  then  consider 
precision:  e.g.,  if  95%  Cl  is  narrower 
for  model  2  than  model  1,  we  prefer 
model  2. 
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EDITED  OUTPUT:  Confidence 
Intervals 


[95%  Conf.  Interval] 

Rx  model  1 

2.027  10.094 

width  =  8.067 

width  =  6.748 

Rx  model  2 

1.595  8.343 

log  WBC 

2.609  9.486 

The  confidence  intervals  for  Rx  in  each  model 
are  shown  here  at  the  left.  The  interval  for  Rx  in 
model  1  has  width  equal  to  10.094  minus  2.027, 
or  8.067;  for  model  2,  the  width  is  8.343  minus 
1.595,  or  6.748.  Therefore,  model  2  gives  a 
more  precise  estimate  of  the  hazard  ratio  than 
does  model  1 . 


EXAMPLE:  (continued) 


Model  2  is  best  model. 

HR  =  3.648  statistically  significant 
95%  Cl  for  HR:  (1.6,  8.3) 


Our  analysis  of  the  output  for  the  three  models 
has  led  us  to  conclude  that  model  2  is  the  best 
of  the  three  models  and  that,  using  model  2,  we 
get  a  statistically  significant  hazard  ratio  of 
3.648  for  the  effect  of  the  treatment,  with  a 
95%  confidence  interval  ranging  between  1.6 
and  8.3. 


Cox  model  formulae  not  specified 

Analysis  strategy  and  methods  for 
Cox  model  analogous  to  those  for 
logistic  and  classical  linear  models. 


EXAMPLE:  (continued) 


Survival  Curves  Adjusted  for  log  WBC 
(Model  2) 

k>) 


Note  that  we  were  able  to  carry  out  this  analy¬ 
sis  without  actually  specifying  the  formulae  for 
the  Cox  PH  models  being  fit.  Also,  the  strategy 
and  methods  used  with  the  output  provided 
have  been  completely  analogous  to  the  strategy 
and  methods  one  uses  when  fitting  logistic 
regression  models  (see  Kleinbaum  and  Klein, 
Logistic  Regression,  Chapters  6  and  7,  2010), 
and  very  similar  to  carrying  out  a  classical  lin¬ 
ear  regression  analysis  (see  Kleinbaum  et  al., 
Applied  Regression  Analysis,  4th  ed.,  Chapter  16, 
2008). 

In  addition  to  the  above  analysis  of  this  data,  we 
can  also  obtain  survival  curves  for  each  treat¬ 
ment  group,  adjusted  for  the  effects  of  log 
WBC  and  based  on  the  model  2  output.  Such 
curves,  sketched  here  at  the  left,  give  additional 
information  to  that  provided  by  estimates  and 
tests  about  the  hazard  ratio.  In  particular,  these 
curves  describe  how  the  treatment  groups  com¬ 
pare  over  the  time  period  of  the  study. 


For  these  data,  the  survival  curves  show  that 
the  treatment  group  consistently  has  higher 
survival  probabilities  than  the  placebo  group 
after  adjusting  for  log  WBC.  Moreover,  the 
difference  between  the  two  groups  appears  to 
widen  over  time. 
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Adjusted  survival 
curves 

KM  curves 

Adjusted  for 

No  covariates 

covariates 

Use  fitted  Cox 

No  Cox  model 

model 

fitted 

Note  that  adjusted  survival  curves  are  mathe¬ 
matically  different  from  Kaplan-Meier  (KM) 
curves.  KM  curves  do  not  adjust  for  covariates 
and,  therefore,  are  not  computed  using  results 
from  a  fitted  Cox  PH  model. 

Nevertheless,  for  these  data,  the  plotted  KM 
curves  (which  were  described  in  Chapter  2)  are 
similar  in  appearance  to  the  adjusted  survival 
curves. 


Remainder: 

•  Cox  model  formula 

•  basic  characteristics  of  Cox 
model 

•  meaning  of  PH  assumption 


In  the  remainder  of  this  presentation,  we  des¬ 
cribe  the  Cox  PH  formula  and  its  basic  charac¬ 
teristics,  including  the  meaning  of  the  PH 
assumption  and  the  Cox  likelihood. 


II.  The  Formula  for  the 
Cox  PH  Model 


E  px 

h(t,X)  =  ho(t)e‘=l 


x  =  (xux2,...xp) 

explanatory/predictor  variables 


The  Cox  PH  model  is  usually  written  in  terms 
of  the  hazard  model  formula  shown  here  at 
the  left.  This  model  gives  an  expression  for 
the  hazard  at  time  t  for  an  individual  with  a 
given  specification  of  a  set  of  explanatory  vari¬ 
ables  denoted  by  the  bold  X.  That  is,  the  bold  X 
represents  a  collection  (sometimes  called  a 
“vector”)  of  predictor  variables  that  is  being 
modeled  to  predict  an  individual’s  hazard. 


ho(t)  x  ei=1 


Baseline 

Exponential 

hazard 

Involves  t 

Involves  X’s  but  not 

but  not  X’s 

t  (X’s  are  time- 

independent) 

The  Cox  model  formula  says  that  the  hazard  at 
time  t  is  the  product  of  two  quantities.  The  first 
of  these,  h0(t),  is  called  the  baseline  hazard 
function.  The  second  quantity  is  the  exponen¬ 
tial  expression  e  to  the  linear  sum  of  (TX,,  where 
the  sum  is  over  the  p  explanatory  X  variables. 

An  important  feature  of  this  formula,  which  con¬ 
cerns  the  proportional  hazards  (PH)  assump¬ 
tion,  is  that  the  baseline  hazard  is  a  function 
of  t,  but  does  not  involve  the  Xs.  In  contrast, 
the  exponential  expression  shown  here,  involves 
the  X’s,  but  does  not  involve  t.  The  X’s  here  are 
called  time-independent  X’s. 
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X’s  involving  t:  time-dependent 

Requires  extended  Cox  model 
(no  PH) 


Time-dependent  variables: 
Chapter  6 


Time-independent  variable: 
Values  for  a  given  individual 
do  not  change  over  time;  e.g., 
SEX  and  SMK 


\ 


Assumed  not  to  change  once 
measured 


AGE  and  WGT  values  do  not 
change  much,  or  effect  on  survival 
depends  on  one  measurement. 


Xi=  X2  =  ■■■=  Xp=  0 

h(t,X)  =  ho(t)  ei=i 
=  h0(t)  e° 

=  h0(t) 

Baseline  hazard 


No  Xs  in  model:  h(t,X)  =  h0  ( t ). 


h0(t)  is  unspecified. 

Cox  model:  semiparametric 


It  is  possible,  nevertheless,  to  consider  Xs  which 
do  involve  t.  Such  Xs  are  called  time-dependent 
variables.  If  time-dependent  variables  are  con¬ 
sidered,  the  Cox  model  form  may  still  be  used, 
but  such  a  model  no  longer  satisfies  the  PH 
assumption,  and  is  called  the  extended  Cox 
model. 

The  use  of  time-dependent  variables  is  discussed 
in  Chapter  6.  For  the  remainder  of  this  presenta¬ 
tion,  we  will  consider  time-independent  Xs  only. 

A  time-independent  variable  is  defined  to  be 
any  variable  whose  value  for  a  given  individual 
does  not  change  over  time.  Examples  are  SEX 
and  smoking  status  (SMK).  Note,  however, 
that  a  person’s  smoking  status  may  actually 
change  over  time,  but  for  purposes  of  the  anal¬ 
ysis,  the  SMK  variable  is  assumed  not  to 
change  once  it  is  measured,  so  that  only  one 
value  per  individual  is  used. 

Also  note  that  although  variables  like  AGE  and 
weight  (WGT)  change  over  time,  it  may  be 
appropriate  to  treat  such  variables  as  time- 
independent  in  the  analysis  if  their  values  do 
not  change  much  over  time  or  if  the  effect  of 
such  variables  on  survival  risk  depends  essen¬ 
tially  on  the  value  at  only  one  measurement. 

The  Cox  model  formula  has  the  property  that  if 
all  the  Xs  are  equal  to  zero,  the  formula 
reduces  to  the  baseline  hazard  function.  That 
is,  the  exponential  part  of  the  formula  becomes 
e  to  the  zero,  which  is  1 .  This  property  of  the 
Cox  model  is  the  reason  why  h0(t)  is  called  the 
baseline  function. 

Or,  from  a  slightly  different  perspective,  the 
Cox  model  reduces  to  the  baseline  hazard 
when  no  X’s  are  in  the  model.  Thus,  h0(t)  may 
be  considered  as  a  starting  or  “baseline”  version 
of  the  hazard  function,  prior  to  considering  any 
of  the  Xs. 

Another  important  property  of  the  Cox  model 
is  that  the  baseline  hazard,  h0{t),  is  an  unspeci¬ 
fied  function.  It  is  this  property  that  makes  the 
Cox  model  a  semiparametric  model. 
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1  Example:  Parametric  Model  1 

Weibull: 
h  (t,  X)  =  Xp tp- 

-1 

where  X  =  exp 

E  PA, 

and  h0  (f)  =  p tp  1 

In  contrast,  a  parametric  model  is  one  whose 
functional  form  is  completely  specified,  except 
for  the  values  of  the  unknown  parameters.  For 
example,  the  Weibull  hazard  model  is  a  para¬ 
metric  model  and  has  the  form  shown  here, 
where  the  unknown  parameters  are  A,  p,  and 
the  Pi’s.  Note  that  for  the  Weibull  model,  h0  ( t ) 
is  given  by  Aptp_1  (see  Chapter  7). 


Semiparametric  property 

a- 

Popularity  of  the  Cox  model 


One  of  the  reasons  why  the  Cox  model  is  so 
popular  is  that  it  is  semiparametric.  We  discuss 
this  and  other  reasons  in  the  next  section  (III) 
concerning  why  the  Cox  model  is  so  widely 
used. 


HI.  Why  the  Cox  PH 
Model  Is  Popular 

Cox  PH  model  is  “robust”:  Will 
closely  approximate  correct 
parametric  model 


If  correct  model  is: 

Cox  model  will 
Weibull  =>■  approximate 
Weibull 
Cox  model  will 

Exponential  =>■  approximate 
exponential 


A  key  reason  for  the  popularity  of  the  Cox  model 
is  that,  even  though  the  baseline  hazard  is  not 
specified,  reasonably  good  estimates  of  regres¬ 
sion  coefficients,  hazard  ratios  of  interest,  and 
adjusted  survival  curves  can  be  obtained  for  a 
wide  variety  of  data  situations.  Another  way 
of  saying  this  is  that  the  Cox  PH  model  is  a 
"robust”  model,  so  that  the  results  from  using 
the  Cox  model  will  closely  approximate  the 
results  for  the  correct  parametric  model. 

For  example,  if  the  correct  parametric  model  is 
Weibull,  then  use  of  the  Cox  model  typically 
will  give  results  comparable  to  those  obtained 
using  a  Weibull  model.  Or,  if  the  correct  model 
is  exponential,  then  the  Cox  model  results  will 
closely  approximate  the  results  from  fitting  an 
exponential  model. 


Prefer  parametric  model  if  sure  of  We  would  prefer  to  use  a  parametric  model  if  we 
correct  model,  e.g.,  use  goodness-  were  sure  of  the  correct  model.  Although  there 
of-fit  test  (Lee,  1982).  are  various  methods  for  assessing  goodness  of  fit 

of  a  parametric  model  (for  example,  see  Lee, 
Statistical  Methods  for  Survival  Data  Analysis, 
1982),  we  may  not  be  completely  certain  that  a 
given  parametric  model  is  appropriate. 
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When  in  doubt,  the  Cox  model  is  a  Thus,  when  in  doubt,  as  is  typically  the  case, 
“safe"  choice.  the  Cox  model  will  give  reliable  enough  results 

so  that  it  is  a  "safe”  choice  of  model,  and  the 
user  does  not  need  to  worry  about  whether  the 
wrong  parametric  model  is  chosen. 

In  addition  to  the  general  “robustness”  of  the 
Cox  model,  the  specific  form  of  the  model  is 
attractive  for  several  reasons. 


E  PA 

h(t,  X)  =  ho(t)  x  ei=1 


Baseline 

Exponential 

hazard 

ii 

0  <  h(t,  X)  <  oo  always 

As  described  previously,  the  specific  form  of 
the  Cox  model  gives  the  hazard  function  as  a 
product  of  a  baseline  hazard  involving  t  and  an 
exponential  expression  involving  the  X’s  with¬ 
out  t.  The  exponential  part  of  this  product  is 
appealing  because  it  ensures  that  the  fitted 
model  will  always  give  estimated  hazards  that 
are  non-negative. 


h0(t)  x 


Linear 


11 

Might  be  <  0 


We  want  such  nonnegative  estimates  because, 
by  definition,  the  values  of  any  hazard  function 
must  range  between  zero  and  plus  infinity,  that 
is,  a  hazard  is  always  nonnegative.  If,  instead  of 
an  exponential  expression,  the  X  part  of  the 
model  were,  for  example,  linear  in  the  Xs,  we 
might  obtain  negative  hazard  estimates,  which 
are  not  allowed. 


Even  though  h0(t)  is  unspecified, 
we  can  estimate  the  p’s. 

Measure  of  effect:  hazard  ratio 
(HR)  involves  only  P’s,  without 
estimating  h0(t). 


Another  appealing  property  of  the  Cox  model  is 
that,  even  though  the  baseline  hazard  part  of 
the  model  is  unspecified,  it  is  still  possible  to 
estimate  the  p's  in  the  exponential  part  of  the 
model.  As  we  will  show  later,  all  we  need  are 
estimates  of  the  P’s  to  assess  the  effect  of 
explanatory  variables  of  interest.  The  measure 
of  effect,  which  is  called  a  hazard  ratio,  is  cal¬ 
culated  without  having  to  estimate  the  baseline 
hazard  function. 


Can  estimate  h(t,X)  and  S(t,  X)  for 
Cox  model  using  a  minimum  of 
assumptions. 


Note  that  the  hazard  function  h(t,X)  and  its 
corresponding  survival  curves  S(t,X)  can  be 
estimated  for  the  Cox  model  even  though  the 
baseline  hazard  function  is  not  specified.  Thus, 
with  the  Cox  model,  using  a  minimum  of 
assumptions,  we  can  obtain  the  primary  infor¬ 
mation  desired  from  a  survival  analysis, 
namely,  a  hazard  ratio  and  a  survival  curve. 
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Cox  model  preferred  to  logistic  model. 


11- 

Uses  survival 
times  and 
censoring 


11- 

Uses  (0,1)  outcome; 
ignores  survival  times 
and  censoring 


One  last  point  about  the  popularity  of  the  Cox 
model  is  that  it  is  preferred  over  the  logistic 
model  when  survival  time  information  is  avail¬ 
able  and  there  is  censoring.  That  is,  the  Cox 
model  uses  more  information,  the  survival 
times,  than  the  logistic  model,  which  considers 
a  (0,  1)  outcome  and  ignores  survival  times  and 
censoring. 


IV.  ML  Estimation  of  the 
Cox  PH  Model 


h  (f,X)  =  h0(t)  e-i 


We  now  describe  how  estimates  are  obtained 
for  the  parameters  of  the  Cox  model.  The 
parameters  are  the  p’s  in  the  general  Cox 
model  formula  shown  here.  The  corresponding 
estimates  of  these  parameters  are  called  maxi¬ 
mum  likelihood  (ML)  estimates  and  are 
denoted  as  p,. 


ML  estimates:  p, 


Coef.  ‘ 

^Std.Err.  p  >  |z| 

Haz.  Ratio 

Rx 

1.294 

0.422 

0.002 

3.648 

log  WBC 

1.604 

0.329 

0.000 

4.975 

No.  of  subjects  =  42  Log  likelihood  =  -72.280 


Estimated  model: 

h  (t,  X)  =  ho{t)  eL294  Rx  +  L604  lo«  WBC 


As  an  example  of  ML  estimates,  we  consider 
once  again  the  computer  output  for  one  of  the 
models  (model  2)  fitted  previously  from  remis¬ 
sion  data  on  42  leukemia  patients. 

The  Cox  model  for  this  example  involves  two 
parameters,  one  being  the  coefficient  of  the 
treatment  variable  (denoted  here  as  Rx)  and 
the  other  being  the  coefficient  of  the  log  WBC 
variable.  The  expression  for  this  model  is 
shown  at  the  left,  which  contains  the  estimated 
coefficients  1 .294  for  Rx  and  1.604  for  log  white 
blood  cell  count. 


ML  estimates:  maximize  likelihood 
function  L 

L  =  joint  probability  of  observed 
data  =  L(P) 


As  with  logistic  regression,  the  ML  estimates  of 
the  Cox  model  parameters  are  derived  by  max¬ 
imizing  a  likelihood  function,  usually  denoted 
as  L.  The  likelihood  function  is  a  mathematical 
expression  which  describes  the  joint  probabil¬ 
ity  of  obtaining  the  data  actually  observed  on 
the  subjects  in  the  study  as  a  function  of  the 
unknown  parameters  (the  p’s)  in  the  model 
being  considered.  L  is  sometimes  written  nota- 
tionally  as  L(P)  where  P  denotes  the  collection 
of  unknown  parameters. 


The  expression  for  the  likelihood  is  developed 
at  the  end  of  the  chapter.  However,  we  give  a 
brief  overview  below. 
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L  is  a  partial  likelihood: 

•  considers  probabilities  only  for 
subjects  who  fail 

•  does  not  consider  probabilities 
for  subjects  who  are  censored 


The  formula  for  the  Cox  model  likelihood  func¬ 
tion  is  actually  called  a  “partial”  likelihood  func¬ 
tion  rather  than  a  (complete)  likelihood  function. 
The  term  “partial”  likelihood  is  used  because  the 
likelihood  formula  considers  probabilities  only 
for  those  subjects  who  fail,  and  does  not  explic¬ 
itly  consider  probabilities  for  those  subjects  who 
are  censored.  Thus  the  likelihood  for  the  Cox 
model  does  not  consider  probabilities  for  all  sub¬ 
jects,  and  so  it  is  called  a  “partial”  likelihood. 


Number  of  failure  times 


k 

L  =  L\  x  L2  x  L3  x  ■  •  ■  x  Li  =  J|tj 

7=1 

where 

Lf  =  portion  of  L  for  the  ;th  failure 
time  given  the  risk  set  R  ( t(f )) 


In  particular,  the  partial  likelihood  can  be  writ¬ 
ten  as  the  product  of  several  likelihoods,  one 
for  each  of,  say,  k  failure  times.  Thus,  at  thef-th 
failure  time,  Lf  denotes  the  likelihood  of  failing 
at  this  time,  given  survival  up  to  this  time.  Note 
that  the  set  of  individuals  at  risk  at  the  ;th 
failure  time  is  called  the  "risk  set,”  R(t(f)):  and 
this  set  will  change  -  actually  get  smaller  in  size 
-  as  the  failure  time  increases. 


Information  on  censored  subjects 
used  prior  to  censorship. 


tyuses 


in 

J _ 

hf) 


Censored  later 


Thus,  although  the  partial  likelihood  focuses 
on  subjects  who  fail,  survival  time  information 
prior  to  censorship  is  used  for  those  subjects 
who  are  censored.  That  is,  a  person  who  is 
censored  after  the  f-lb  failure  time  is  part  of 
the  risk  set  used  to  compute  Lf  even  though 
this  person  is  censored  later. 


Steps  for  obtaining  ML  estimates: 

•  form  L  from  model 

•  maximize  In  L  by  solving 


i  =  1, . . .  ,p(#  of  parameters) 


Once  the  likelihood  function  is  formed  for  a 
given  model,  the  next  step  for  the  computer  is 
to  maximize  this  function.  This  is  generally 
done  by  maximizing  the  natural  log  of  L, 
which  is  computationally  easier. 


Solution  by  iteration: 

•  guess  at  solution 

•  modify  guess  in  successive  steps 

•  stop  when  solution  is  obtained 


The  maximization  process  is  carried  out  by  tak¬ 
ing  partial  derivatives  of  log  of  L  with  respect  to 
each  parameter  in  the  model,  and  then  solving  a 
system  of  equations  as  shown  here.  This  solu¬ 
tion  is  carried  out  using  iteration.  That  is,  the 
solution  is  obtained  in  a  stepwise  manner,  which 
starts  with  a  guessed  value  for  the  solution,  and 
then  successively  modifies  the  guessed  value 
until  a  solution  is  finally  obtained. 
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Statistical  inferences  for  hazard 
ratios:  (See  SectionI,  pages  100-107) 


Test  hypotheses 

Confidence  intervals 

Wald  test 

LR  test 

Large  sample  95%  Cl 

HR  =  e p  for  a  (0,  1)  exposure 

variable  (no  interaction) 


Once  the  ML  estimates  are  obtained,  we  are 
usually  interested  in  carrying  out  statistical 
inferences  about  hazard  ratios  defined  in 
terms  of  these  estimates.  We  illustrated  previ¬ 
ously  how  to  test  hypotheses  and  form  confi¬ 
dence  intervals  for  the  hazard  ratio  in  Section  I 
above.  There,  we  described  how  to  compute 
a  Wald  test  and  a  likelihood  ratio  (LR)  test. 
We  also  illustrated  how  to  calculate  a  large 
sample  95%  confidence  interval  for  a  hazard 
ratio.  The  estimated  hazard  ratio  (HR)  was 
computed  by  exponentiating  the  coefficient  of 
a  (0,1)  exposure  variable  of  interest.  Note  that 
the  model  contained  no  interaction  terms 
involving  exposure. 


V.  Computing  the  Hazard 
Ratio 


1 1 1\  — 

h(t,X) 

where 

and 

x  =  (xux2,---,x„) 

denote  the  set  of  Xs  for  two 
individuals 


In  general,  a  hazard  ratio  (HR)  is  defined  as  the 
hazard  for  one  individual  divided  by  the  hazard 
for  a  different  individual.  The  two  individuals 
being  compared  can  be  distinguished  by  their 
values  for  the  set  of  predictors,  that  is,  the  X’s. 

We  can  write  the  hazard  ratio  as  the  estimate 
of  h(t,X.*)  divided  by  the  estimate  of  h(t,X), 
where  X*  denotes  the  set  of  predictors  for  one 
individual,  and  X  denotes  the  set  of  predictors 
for  the  other  individual. 


Note  that,  as  with  an  odds  ratio,  it  is  easier  to 
interpret  an  HR  that  exceeds  the  null  value  of  1 
than  an  HR  that  is  less  than  1 .  Thus,  the  X’s  are 
typically  coded  so  that  group  with  the  larger 
hazard  corresponds  to  X*,  and  the  group  with 
the  smaller  hazard  corresponds  to  X.  As  an 
example,  for  the  remission  data  described  pre¬ 
viously,  the  placebo  group  is  coded  as  X\  =  1, 
and  the  treatment  group  is  coded  as  X{  =  0. 

X*  =  (X\,  X),  -  ■  ■  ,X*),  where  Xj  =  1 
denotes  placebo  group. 

X  =  (Xu  X2,  . . .,  Xp),  where  Xt  =  0 
denotes  treatment  group. 


To  interpret  HR,  want  HR  >  1,  i.e., 
h  ( t ,  X*)  >  h  0 t ,  X). 

Typical  coding:  X*:  group  with 
larger  h 
X:  group  with 
smaller  h 


EXAMPLE:  Remission  Data 
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h(t,X*) 


h(t, 


£ft*,* 

e>=' 

..  £ftA 


We  now  obtain  an  expression  for  the  //R  formula 
in  terms  of  the  regression  coefficients  by  substi¬ 
tuting  the  Cox  model  formula  into  the  numerator 
and  denominator  of  the  hazard  ratio  expression. 
This  substitution  is  shown  here.  Notice  that  the 
only  difference  in  the  numerator  and  denomina¬ 
tor  are  theX*’s  versus  the  As.  Notice  also  that  the 
baseline  hazards  will  cancel  out. 


HR  = 


£  ft*,* 
ho(t)  <?'=■ 

,  Eft*. 

h0(t)e‘=i 


£  ft  (*[-*■) 


ei= i 


HR  =  exp 


1=1 


Using  algebra  involving  exponentials,  the  haz¬ 
ard  ratio  formula  simplifies  to  the  exponential 
expression  shown  here.  Thus,  the  hazard  ratio 
is  computed  by  exponentiating  the  sum  of  each 
P i  “hat”  times  the  difference  between  X*  and  A,. 

An  alternative  way  to  write  this  formula,  using 
exponential  notation,  is  shown  here.  We  will 
now  illustrate  the  use  of  this  general  formula 
through  a  few  examples. 


EXAMPLE 


X  =  (Xi,  X2,. . Xp)  =  (XU,  where  W 
denotes  (0,  1)  exposure  status  (p  =  1) 
X\  =  l,Xi  =  0 

/S  =  exp[p,(Xj  -X!)] 

=  exp  [Pi(l  —  0)]  =eft 

Model  1: 

Coef.  Std.  Err.  P  >  |z|  Haz.  Ratio 
Rx  [1.509)  0.410  0.000  (4.523) 


Suppose,  for  example,  there  is  only  one  X  vari¬ 
able  of  interest,  Xt,  which  denotes  (0,1)  expo¬ 
sure  status,  so  that  p  =  1.  Then,  the  hazard 
ratio  comparing  exposed  to  unexposed  persons 
is  obtained  by  letting  X\  =  1  and  A,  =  0  in  the 
hazard  ratio  formula.  The  estimated  hazard 
ratio  then  becomes  e  to  the  quantity  Pi  “hat” 
times  1  minus  0,  which  simplifies  to  e  to  the 
p!  “hat.” 

Recall  the  remission  data  printout  for  Model  1 , 
which  contains  only  the  Rx  variable,  again 
shown  here.  Then  the  estimated  hazard  ratio 
is  obtained  by  exponentiating  the  coefficient 
1.509,  which  gives  the  value  4.523  shown  in 
the  HR  column  of  the  output. 


EXAMPLE  2  1 

Model  2: 

Std. 

Haz. 

Coef. 

Err. 

P  >  M 

Ratio 

Rx 

1 .294' 

0.422 

0.002 

3.648 

log  WBC 

[l.604j 

0.329 

0.000 

4.975 

X*  =  (1,  log  WBC),  X 

=  (0,  log  WBC) 

HR  for  effect  of  Rx  adjusted  for  log 

WBC: 

As  a  second  example,  consider  the  output  for 
Model  2,  which  contains  two  variables,  the  Rx 
variable  and  log  WBC.  Then  to  obtain  the  haz¬ 
ard  ratio  for  the  effect  of  the  Rx  variable 
adjusted  for  the  log  WBC  variable,  we  let  the 
vectors  X*  and  X  be  defined  as  X*  =  (1,  log 
WBC)  and  X  =  (0,  log  WBC).  Here  we  assume 
that  log  WBC  is  the  same  for  X*  and  X  though 
unspecified. 
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EXAMPLE  2:  (continued) 


HR  =  exp  [iT  (X)  X,)  I  |i,  (X*  -  X2)] 

=  exp[l. 294(1  —  0) 

+  i.604(log  WBC  —  log  WBC)] 
=  exp[l. 294(1)  +  1.604(0)] 


The  estimated  hazard  ratio  is  then  obtained  by 
exponentiating  the  sum  of  two  quantities,  one 
involving  the  coefficient  1.294  of  the  Rx  vari¬ 
able,  and  the  other  involving  the  coefficient 
1.604  of  the  log  WBC  variable.  Since  the  log 
WBC  value  is  fixed,  however,  this  portion  of 
the  exponential  is  zero,  so  that  the  resulting 
estimate  is  simply  e  to  the  1.294. 


General  rule:  If  JXTi  is  a  (0,1) 
exposure  variable,  then 

HR  =  e (=  effect  of  exposure 
adjusted  for  other  Tfs) 
provided  no  other  JUs  are  product 
terms  involving  exposure. 


This  second  example  illustrates  the  general  rule 
that  the  hazard  ratio  for  the  effect  of  a  (0,1) 
exposure  variable  which  adjusts  for  other  vari¬ 
ables  is  obtained  by  exponentiating  the  esti¬ 
mated  coefficient  of  the  exposure  variable.  This 
rule  has  the  proviso  that  the  model  does  not 
contain  any  product  terms  involving  exposure. 


EXAMPLE  3 

Model  3: 

Std. 

Haz. 

Coef. 

Err. 

P  >  M 

Ratio 

Rx 

2.355 

1.681 

0.161 

10.537 

log  WBC 

1.803 

0.447 

0.000 

6.067 

Rx  x  log 

-0.342 

0.520 

0.510 

0.710 

WBC 

Want  HR  for  effect  of  Rx  adjusted  for 
log  WBC. 

Placebo  subject: 

X*  =(Xj  =  1,  X\  =  log  WBC, 

Xj  =  1  x  log  WBC) 

Treated  subject: 

X  =  (Xx  =  0,  X2  =  log  WBC, 

X3  =  0  x  log  WBC) 


HR  =  exp 


E  &■(*?-*.-) 


HR  =  exp[2. 355(1  -  0) 

+  1.803(log  WBC  -  log  WBC) 
+  (— 0.342)(1  x  log  WBC 
-0  x  log  WBC)] 

(=  exp[2.355  -  0.342  log  WBC]) 


We  now  give  a  third  example  which  illustrates 
how  to  compute  a  hazard  ratio  when  the  model 
does  contain  product  terms.  We  consider  the 
printout  for  Model  3  of  the  remission  data 
shown  here. 

To  obtain  the  hazard  ratio  for  the  effect  of  Rx 
adjusted  for  log  WBC  using  Model  3,  we  con¬ 
sider  X*  and  X  vectors  which  have  three  com¬ 
ponents,  one  for  each  variable  in  the  model.  The 
X*  vector,  which  denotes  a  placebo  subject, 
has  components  X(  =  I .  X*2=  log  WBC  and 
lx  log  WBC.  The  X  vector,  which  denotes  a 
treated  subject,  has  components  Xt  =  0,  X2  = 
log  WBC  and  W  =  0  x  log  WBC.  Note  again  that, 
as  with  the  previous  example,  the  value  for  log 
WBC  is  treated  as  fixed,  though  unspecified. 

Using  the  general  formula  for  the  hazard  ratio, 
we  must  now  compute  the  exponential  of  the  sum 
of  three  quantities,  corresponding  to  the  three 
variables  in  the  model.  Substituting  the  values 
from  the  printout  and  the  values  of  the  vectors 
X*  and  X  into  this  formula,  we  obtain  the  expo¬ 
nential  expression  shown  here.  Using  algebra, 
this  expression  simplifies  to  the  exponential  of 
2.355  minus  0.342  times  log  WBC. 
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EXAMPLE:  (continued) 


log  WBC  =  2: 

HR  =  exp[2.355  -  0.342(2)] 

=  eim  =  5.32 
log  WBC  =  4: 

HR  =  exp[2.355  -  0.342(4)] 
=  e°'9S1  =  2.68 


In  order  to  get  a  numerical  value  for  the  hazard 
ratio,  we  must  specify  a  value  for  log  WBC.  For 
instance,  if  log  WBC  =  2,  the  estimated  hazard 
ratio  becomes  5.32,  whereas  if  log  WBC  =  4, 
the  estimated  hazard  ratio  becomes  2.68.  Thus, 
we  get  different  hazard  ratio  values  for  dif¬ 
ferent  values  of  log  WBC,  which  should  make 
sense  since  log  WBC  is  an  effect  modifier  in 
Model  3. 


General  rule  for  (0,  1)  exposure 
variables  when  there  are  product 
terms: 


HR  =  exp 


p +Yisjwj 


where 

(3  =  coefficient  of  E 
Sj  =  coefficient  of  E  x  Wj 

(HR  does  not  contain  coefficients  of 
non-product  terms) 


The  example  we  have  just  described  using 
Model  3  illustrates  a  general  rule  which  states 
that  the  hazard  ratio  for  the  effect  of  a  (0,1) 
exposure  variable  in  a  model  which  contains 
product  terms  involving  this  exposure  with 
other  X’s  can  be  written  as  shown  here.  Note 
that  p  "hat”  denotes  the  coefficient  of  the  expo¬ 
sure  variable  and  the  8  "hats”  are  coefficients  of 
product  terms  in  the  model  of  the  form  E  x  Wy. 
Also  note  that  this  formula  does  not  contain 
coefficients  of  nonproduct  terms  other  than 
those  involving  E. 


EXAMPLE 


Model  3: 


<5i  =  coefficient  of  Rx  x  log  WBC 

HR  (Model  3)  =  exp  |^p  +  <5j  log  WBcJ 
=  exp[2.355  -  0.342  log  WBC] 


For  Model  3,  P  "hat"  is  the  coefficient  of  the  Rx 
variable,  and  there  is  only  one  8  “hat”  in  the 
sum,  which  is  the  coefficient  of  the  product 
term  Rx  x  log  WBC.  Thus,  there  is  only  one  W, 
namely  W|  =  log  WBC.  The  hazard  ratio  for¬ 
mula  for  the  effect  of  exposure  is  then  given  by 
exponentiating  p  "hat”  plus  8  "hat”  times  log 
WBC.  Substituting  the  estimates  from  the 
printout  into  this  formula  yields  the  expression 
obtained  previously,  namely  the  exponential  of 
2.355  minus  0.342  times  log  WBC. 


VI.  Interval  Estimation: 
Interaction 


Model  2: 

h(t,  X)  =  h0  (tjexp^Rx  +  P2log  WBC] 


We  have  previously  illustrated  in  Model  2  of  the 
Remission  Time  Data  how  to  obtain  a  95% 
interval  estimate  of  the  HR  when  there  is  only 
one  regression  coefficient  of  interest,  e.g.,  the 
HR  is  of  the  form  expfPi]. 


HR  =  exp[pi] 
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Large  sample  95%  confidence 
interval: 


exp 


Pj  ±  1.96y  Vd/'Pj 


where 

\  =  V^Pi 

No  interaction:  simple  formula 
Interaction:  complex  formula 


The  procedure  typically  used  to  obtain  a  large 
sample  95%  confidence  interval  (Cl)  for  the 
parameter  is  to  compute  the  exponential  of  the 
estimate  of  the  parameter  plus  or  minus  a  per¬ 
centage  point  of  the  normal  distribution  times 
the  estimated  standard  error  of  the  estimate. 
Note  that  the  square  root  of  the  estimated  vari¬ 
ance  is  the  standard  error. 

This  computation  is  relatively  simple  when 
there  are  no  interaction  effects  in  the  model. 
However,  when  there  is  interaction,  the  compu¬ 
tational  formula  for  the  estimated  standard 
error  is  more  complex. 


Model  3: 

h(t,  X)  =  h0  (tfexpfPjRx  +  p2log  WBC 
+  p3  (Rx  x  log  WBC)] 

HR  =  exp[p!  +  p3log  WBC] 


Suppose  we  focus  on  Model  3,  shown  here 
on  the  left,  and,  again,  we  assume  that  Rx  is  a 
(0,  1)  exposure  variable  of  interest.  Then  the 
formula  for  the  HR  for  the  effect  of  Rx 
controlling  for  the  variable  log  WBC  is  given 
on  the  left  underneath  the  model  formula. 


Interaction:  variance  calculation 
difficult 

No  interaction:  variance  directly 
from  printout 


The  difficult  part  in  computing  the  Cl  for  a  HR 
involving  interaction  effects  is  the  calculation 
for  the  estimated  variance.  When  there  is  no 
interaction,  so  that  the  parameter  of  interest  is 
a  single  regression  coefficient,  this  variance  is 
obtained  directly  from  the  listing  of  estimated 
coefficients  and  corresponding  standard 
errors. 


HR  =  exp  [t] , 

where  i  =  p:  +  p3log  WBC 


95%  Cl  for  HR  =  exp[f] 
exp|T±  1.96yVarf)] 


For  Model  3,  we  can  alternatively  write  the 
estimated  HR  formula  as  exp[f],  where  l  is 
the  linear  function  pj  +  p3log  WBC  and  l  is 
the  estimate  of  this  linear  function  using  the 
ML  estimates. 

To  obtain  a  95%  Cl  for  exp[£]  we  must  expo¬ 
nentiate  the  Cl  for  i.  The  formula  is  shown  on 
the  left. 


General  Formula: 
can  consider  any  t,  e.g., 

t  =  pj  +  (hWi  +  +  . . .  6kWk, 

where  X|  =  (0,  1)  exposure  variable 
and  Pi  =  coeff  of  X1( 

8j  =  coeff  ofXjxWj,  j=l,. . .,  k 


This  Cl  formula,  though  motivated  by  our 
example  using  Model  3,  is  actually  the  general 
formula  for  the  95%  Cl  for  any  HR  of  interest 
from  a  Cox  PH  model.  In  general,  for  a  model 
with  a  (0,  1)  exposure  variable  Xj  and  inter¬ 
action  terms  XixWi,  . . .,  X|  xW|(,  the  linear 
function  may  take  any  form  of  interest,  as 
shown  in  the  left. 
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Var(l)  =  Var(  ^  +  dxWx  +■■■  5kWk) 
where  the  estimates  p, ,  <%  are 

correlated,  so  one  must  use 

Var(P]),  Cov(P! ,  S,)  and  Cov(5i,5j) 


Computer  packages  SAS  and 
STATA  compute  Var  £  as  part  of 
the  program  options  (see  Computer 
Appendix). 


General  formula  for  Var  t. 

Var (£)  =  Var(p,)  +  E  W?Var(Si) 
j 

+  2  Y'WjCdvfajj) 
j 

WjWkCdv(5j,  5k) 

j  k 

•  Variances  and  covariances  pro¬ 
vided  in  the  computer  output 

•  User  specifies  W's  values  of 
interest. 


When  the  HR  involves  interaction  effects,  the 
estimated  variance  considers  a  linear  sum  of 
estimated  regression  coefficients.  Because  the 
coefficients  in  the  linear  sum  are  estimated 
from  the  same  data  set,  these  coefficients  are 
correlated  with  one  another.  Consequently,  the 
calculation  of  the  estimated  variance  must  con¬ 
sider  both  the  variances  and  the  covariances  of 
the  estimated  coefficients,  which  makes  com¬ 
putations  somewhat  cumbersome. 

Nevertheless,  most  computer  packages  that 
have  procedures  for  fitting  survival  analysis 
models  like  the  Cox  model  provide  for  comput¬ 
ing  the  estimated  variance  of  linear  functions 
like  t  as  part  of  the  program  options.  See  the 
Computer  Appendix  for  details  on  the  use  of 
the  “contrast”  option  in  SAS  and  the  “lincom” 
option  in  STATA. 

For  the  interested  reader,  we  provide  here  the 
general  formula  for  the  estimated  variance  of 
the  linear  function  i. 


In  applying  this  formula,  the  user  obtains  the 
estimated  variances  and  covariances  from  the 
variancecovariance  output.  The  user  must 
specify  values  of  interest  for  the  effect  modi¬ 
fiers  defined  by  the  Ws  in  the  model. 


Model  3: 

e  =  pj  +  p3iog  wbc 
Var(i)  =Var(p,)  +  (log  frRC)2Var(p3) 
+  2(logfrRC)2Cov(p1,p3) 


Applying  this  variance  formula  to  Model  3,  we 
obtain  the  variance  expression  shown  on  the  left. 
Since  log  WBC  is  the  only  effect  modifier  here, 
the  user  would  need  to  specify  log  WBC  values  of 
interest,  e.g.,  log  WBC  =2  and  log  WBC=4. 


95%  Cl  for  Rx  in  Model  3 
(SAS  edited  output): 

log  WBC  HR  S.E.  Conf  Limits 

2 "  5.3151  3.8410  1.2894  21.9101 

4  2.6809  1.6520  0.8013  8.9700 


Using  the  "contrast”  option  in  SAS’s  PHREG  pro¬ 
cedure,  we  show  on  the  left  computed  95%  Cl’s 
for  the  Rx  variable  for  two  choices  of  log  WBC. 
When  log  WBC  is  2,  the  estimated  HR  is  5.32  with 
a  95%  Cl  given  by  the  limits  ( 1 .29, 2 1 .91 ),  whereas 
when  log  WBC  is  4,  the  estimated  HR  is  2.68  with 
a  95%  Cl  given  by  the  limits  (0.80,  8.97). 


Cl  results  suggest  log  WBC  by  Rx 
interaction  but  conflict  with  non¬ 
significant  interaction  test  result. 
Note:  small  study  size  (n=42) 


These  results  suggest  the  interaction  of  log  WBC 
with  Rx,  and  they  conflict  with  the  previously 
reported  nonsignificance  of  the  test  for  interac¬ 
tion  in  Model  3,  which  might  primarily  be  attrib¬ 
uted  to  the  small  sample  size  (n=42)  of  this  study. 
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VII.  Adjusted  Survival 
Curves  Using  the  Cox 
PH  model 


Two  primary  quantities: 


The  two  primary  quantities  desired  from  a 
survival  analvsis  point  of  viem  are  estimated 
hazard  ratios  and  estimated  survival  curves. 
Having  just  described  how  to  compute  hazard 
ratios,  we  now  turn  to  estimation  of  survival 
curves  using  the  Cox  model. 


1 .  estimated  hazard  ratios 

2.  estimated  survival  curves 


No  model:  use  KM  cunzes 


Recall  that  if  no  model  is  used  to  fit  survival 
data,  a  survival  curve  can  be  estimated  using  a 
Kaplan-Meier  method.  Such  KM  curves  are 
plotted  as  step  functions  as  shown  here  for 
the  remission  data  example. 


Cox  model:  adjusted  survival 
curves  (also  step  fuuctions). 


Cox  model  hazard  function: 


h(t,X)  =  ho(t)e' 


Cox  model  survival  function: 

P 

^  ^  p  x- 

s(t,x)  =  [Sowr1 


Estimated  survival  function: 

P 

x>*. 

S(t,X)  =  [So(t)]e" 


When  a  Cox  model  is  used  to  fit  survival  data, 
survival  curves  can  be  obtained  that  adjust  for 
the  explanatory  variables  used  as  predictors. 
These  are  called  adjusted  survival  curves, 
and,  like  KM  curves,  these  are  also  plotted  as 
step  functions. 

The  hazard  function  formula  for  the  Cox  PH 
model,  shown  here  again,  can  be  converted  to 
a  corresponding  survival  function  formula  as 
shown  below.  This  survival  function  formula 
is  the  basis  for  determining  adjusted  survival 
curves.  Note  that  this  formula  says  that  the 
survival  function  at  time  t  for  a  subject  with 
vector  X  as  predictors  is  given  by  a  baseline 
survival  function  S0(f)  raised  to  a  power  equal 
to  the  exponential  of  the  sum  of  (!,  times  X,-. 

The  expression  for  the  estimated  survival  func¬ 
tion  can  then  be  written  with  the  usual  “hat” 
notation  as  shown  here. 


S0(t)  and  (!,  are  provided  by  the  The  estimates  of  S0(t)  and  /(  are  provided  by 
computer  program.  The  X,  must  the  computer  program  that  fits  the  Cox  model, 
be  specified  by  the  investigator.  The  X’s,  however,  must  first  be  specified  by  the 

investigator  before  the  computer  program  can 
compute  the  estimated  survival  curve. 
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For  example,  if  we  consider  model  2  for  the 
remission  data,  the  fitted  model  written  in  terms 
of  both  the  hazard  function  and  corresponding 
survival  function  is  given  here. 

We  can  obtain  a  specific  survival  curve  by 
specifying  values  for  the  vector  X,  whose  com¬ 
ponent  variables  are  Rx  and  log  WBC. 

For  instance,  if  Rx  =  1  and  log  WBC  =  2.93,  the 
estimated  survival  curve  is  obtained  by  sub¬ 
stituting  these  values  in  the  formula  as  shown 
here,  and  carrying  out  the  algebra  to  obtain  the 
expression  circled.  Note  that  the  value  2.93  is 
the  overall  mean  log  WBC  for  the  entire  dataset 
of  42  subjects. 

Also,  if  Rx  =  0  and  log  WBC  =  2.93,  the  esti¬ 
mated  survival  curve  is  obtained  as  shown  here. 

Each  of  the  circled  expressions  gives  adjusted 
survival  curves,  where  the  adjustment  is  for 
the  values  specified  for  the  Xs.  Note  that  for 
each  expression,  a  survival  probability  can  be 
obtained  for  any  value  of  t. 

The  two  formulae  just  obtained,  again  shown 
here,  allow  us  to  compare  survival  curves  for 
different  treatment  groups  adjusted  for  the 
covariate  log  WBC.  Both  curves  describe  esti¬ 
mated  survival  probabilities  over  time  assum¬ 
ing  the  same  value  of  log  WBC,  in  this  case,  the 
value  2.93. 

Typically,  when  computing  adjusted  survival 
curves,  the  value  chosen  for  a  covariate  being 
adjusted  is  an  average  value  like  an  arithmetic 
mean  or  a  median.  In  fact,  most  computer  pro¬ 
grams  for  the  Cox  model  automatically  use  the 
mean  value  over  all  subjects  for  each  covariate 
being  adjusted. 

In  our  example,  the  mean  log  WBC  for  all 
42  subjects  in  the  remission  data  set  is  2.93. 
That  is  why  we  chose  this  value  for  log  WBC 
in  the  formulae  for  the  adjusted  survival  curve. 


Typically,  use  X  =  A  or  Xmedian 
Computer  uses  X 


EXAMPLE:  (continued) 


Remission  data  ( n  =  42): 
log  WBC  =  2.93 


EXAMPLE:  Model  2  Remission  Data 


h  (f,X)  =  h0  (t)e1M4  **  +  1604  log  WBC 

£  _  |-^  ^^jexp(  1.294  Rx  +  1.604  log  WBC) 


Specify  values  for  X  =  (Rx,  log  WBC) 


Rx=  1,  log  WBC  =  2.93: 

S(t,  X)  =  [So(0]exp(^+felog  ^ 

-iexp(  1.294(0.5) +1.604(2.93)) 

oWJ 

=  [s0(CP(535)= 


Rv  =  0,  log  WBC  =  2.93: 

S(f,  X)  =  [SoM]exr(L294(o)+L604A93)) 
=  [SoM]exp(4'70)= 


Adjusted  Survival  Curves 

Rx  =  1,  log  WBC  =  2.93: 

S  (t,X)  =  [So  Ml400'9 
Rx  =  0,  log  WBC  =  2.93: 
S  (CX)  =  [So  h)]109-9 
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General  formulae  for  adjusted 
survival  curves  comparing  two 
groups: 


Exposed  subjects: 


exp 

5(t,X,)  =  [So(0] 


p,(i)+E 


Unexposed  subjects: 


exp 

S{t,X o)  =  [5o(0] 


Pi(o)+X)  M* 

¥i 


More  generally,  if  we  want  to  compare  survival 
curves  for  two  levels  of  an  exposure  variable, 
and  we  want  to  adjust  for  several  covariates,  we 
can  write  the  formula  for  each  curve  as  shown 
here.  Note  that  we  are  assuming  that  the  expo¬ 
sure  variable  is  variable  X\,  whose  estimated 
coefficient  is  Pi  “hat,”  and  the  value  of  Xi  is  1 
for  exposed  and  0  for  unexposed  subjects. 


General  formula  for  adjusted 
survival  curve  for  all 
covariates  in  the  model: 


S(t,X)  =  [So(0] 


exp[]T  Mi] 


Also,  if  we  want  to  obtain  an  adjusted  survival 
curve  which  adjusts  for  all  covariates  in  the 
model,  the  general  formula  which  uses  the 
mean  value  for  each  covariate  is  given  as 
shown  here.  This  formula  will  give  a  single 
adjusted  survival  curve  rather  than  different 
curves  for  each  exposure  group. 


EXAMPLE 


Single  survival  curve  for  Cox  model 
containing  Rx  and  log  WBC: 

Rx  =  0.50 


log  WBC  =  2.93 


S(f,  X)  =  [^W]exp(M.v+W°g  WBC) 
[A  /  xiexp(  1.294(0.5) +1.604(2.93)) 

=  |>oWJ 

=  [So«fP(5'35)  = 


To  illustrate  this  formula,  suppose  we  again 
consider  the  remission  data,  and  we  wish  to 
obtain  a  single  survival  curve  that  adjusts  for 
both  Rx  and  log  WBC  in  the  fitted  Cox  model 
containing  these  two  variables.  Using  the  mean 
value  of  each  covariate,  we  find  that  the  mean 
value  for  Rx  is  0.5  and  the  mean  value  for  log 
WBC  is  2.93,  as  before. 

To  obtain  the  single  survival  curve  that  adjusts 
for  Rx  and  log  WBC,  we  then  substitute  the 
mean  values  in  the  formula  for  the  adjusted 
survival  curve  for  the  model  fitted.  The  formula 
and  the  resulting  expression  for  the  adjusted 
survival  curve  are  shown  here.  (Note  that  for 
the  remission  data,  where  it  is  of  interest  to 
compare  two  exposure  groups,  the  use  of  a 
single  survival  curve  is  not  appropriate.) 


Compute  survival  probability  by 
specifying  value  for  t  in 
S{t,  X)  =  [So  «]210'6 

Computer  uses  t’s  which  are  failure 
times. 


From  this  expression  for  the  survival  curve,  a 
survival  probability  can  be  computed  for  any 
value  of  t  that  is  specified.  When  graphing  this 
survival  curve  using  a  computer  package,  the 
values  of  t  that  are  chosen  are  the  failure  times 
of  all  persons  in  the  study  who  got  the  event. 
This  process  is  automatically  carried  out  by  the 
computer  without  having  the  user  specify  each 
failure  time. 
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EXAMPLE  I 

Adjusted  Survival  Curves  for 

Treatment  and  Placebo  Groups 

5(0 

1.0 

(Remission  data) 

kl 

0.8 

-  TCl_ 

0.6 

^Treatment  (itr  =  0) 

0.4 

0.2 

0 

Placebo 
_(Rx  =  1) 

[50(O]4009 

&»]m9 

8 

16  24 

Time 

The  graph  of  adjusted  survival  curves  obtained 
from  fitting  a  Cox  model  is  usually  plotted  as  a 
step  function.  For  example,  we  show  here  the 
step  functions  for  the  two  adjusted  survival 
curves  obtained  by  specifying  either  1  or  0  for 
treatment  status  and  letting  log  WBC  be  the 
mean  value  2.93. 


Next  section:  PH  assumption 

•  explain  meaning 

•  when  PH  not  satisfied 


We  now  turn  to  the  concept  of  the  proportional 
hazard  (PH)  assumption.  In  the  next  section, 
we  explain  the  meaning  of  this  assumption  and 
we  give  an  example  of  when  this  assumption  is 
not  satisfied. 


Later  presentations: 

•  how  to  evaluate  PH 

•  analysis  when  PH  not  met 


In  later  presentations,  we  expand  on  this  subject, 
describing  how  to  evaluate  statistically  whether 
the  assumption  is  met  and  how  to  carry  out  the 
analysis  when  the  assumption  is  not  met. 


VIII.  The  Meaning  of  the 
PH  Assumption 


PH:  HR  is  constant  over  time,  i.e.,  h 
(f,X*)  =  constant  x  h  (f,X) 


HR  = 


h(t,X*) 
h  (f,  X) 


ho(t)  exp 

[£&*?' 

h0(t)  exp 

=  exp 


EMv-v) 


i=i 


where  X*  =  (x\,  X*2l  . . .  ,X*\  and 
X  =  (X|,  X2,  ■ . Xp) 
denote  the  set  of  X’s  for  two 
individuals. 


The  PH  assumption  requires  that  the  HR  is  con¬ 
stant  over  time,  or  equivalently,  that  the  hazard 
for  one  individual  is  proportional  to  the  hazard 
for  any  other  individual,  where  the  proportion¬ 
ality  constant  is  independent  of  time. 

To  understand  the  PH  assumption,  we  need 
to  reconsider  the  formula  for  the  HR  that 
compares  two  different  specifications  X*  and 
X  for  the  explanatory  variables  used  in  the 
Cox  model.  We  derived  this  formula  previously 
in  Section  V,  and  we  show  this  derivation  again 
here.  Notice  that  the  baseline  hazard  function 
h0(t)  appears  in  both  the  numerator  and 
denominator  of  the  hazard  ratio  and  cancels 
out  of  the  formula. 
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h  (t,  X) 


=  exp 


EM*;-*) 


i=l 


does  not  involve  t. 


Lety 

r 

0  =  exp 


Constant 


E  ft- « 


then 

h  (t,  X) 


HR  (X*  versus  X) 


t 


The  final  expression  for  the  hazard  ratio 
therefore  involves  the  estimated  coefficients 
P;  “hat”  and  the  values  of  X*  and  X  for  each 
variable.  However,  because  the  baseline  hazard 
has  canceled  out,  the  final  expression  does  not 
involve  time  t. 

Thus,  once  the  model  is  fitted  and  the  values 
for  X*  and  X  are  specified,  the  value  of  the 
exponential  expression  for  the  estimated  haz¬ 
ard  ratio  is  a  constant,  which  does  not  depend 
on  time.  If  we  denote  this  constant  by  0  “hat,” 
then  we  can  write  the  hazard  ratio  as  shown 
here.  This  is  a  mathematical  expression  which 
states  the  proportional  hazards  assumption. 

Graphically,  this  expression  says  that  the  esti¬ 
mated  hazard  ratio  comparing  any  two  indivi¬ 
duals  plots  as  a  constant  over  time. 


h(t,X*)  =  Oh  (t,  X) 

/ 


Proportionality  constant 
(not  dependent  on  time) 


Another  way  to  write  the  proportional  hazards 
assumption  mathematically  expresses  the  haz¬ 
ard  function  for  individual  X*  as  0  “hat”  times 
the  hazard  function  for  individual  X,  as  shown 
here.  This  expression  says  that  the  hazard 
function  for  one  individual  is  proportional  to 
the  hazard  function  for  another  individual, 
where  the  proportionality  constant  is  0  “hat,” 
which  does  not  depend  on  time. 


EXAMPLE:  Remission  Data 


ii(t,  X)  =  fc0(f)e1'294*c+l'604  log  WBC 

h(t,Rx  =  1,  log  WBC  =  2.93) 
h(t,  Rx  =  0,  log  WBC  =  2.93) 
=  exp[1.294]  =  3.65  Constant 


Placebo 

h  (t,  Rx  =  1,  log  WBC  =  2.93) 

=  3.65  h  ( t ,  Rx  =  0,  log  WBC  =  2.93) 


Treatment 

3.65  =  proportionality  constant 


To  illustrate  the  proportional  hazard  assump¬ 
tion,  we  again  consider  the  Cox  model  for  the 
remission  data  involving  the  two  variables  Rx 
and  log  WBC.  For  this  model,  the  estimated 
hazard  ratio  that  compares  placebo  (Rx  =  1) 
with  treated  (Rx  =  0)  subjects  controlling  for 
log  WBC  is  given  by  e  to  the  1.294,  which  is 
3.65,  a  constant. 

Thus,  the  hazard  for  placebo  group  (Rx  =  1)  is 
3.65  times  the  hazard  for  the  treatment  group 
(Rx  =  0),  and  the  value,  3.65,  is  the  same 
regardless  of  time.  In  other  words,  using  the 
above  model,  the  hazard  for  the  placebo  group 
is  proportional  to  the  hazard  for  the  treatment 
group,  and  the  proportionality  constant  is  3.65. 
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EXAMPLE:  PH  Not  Satisfied 


{0  if  surgery 
1  if  no  surgery 


Surgery 

Radiation  with 
no  surgery 


h(fX)=h0  (t)e^ 


Is  the  above  Cox  PH  model 
appropriate? 

Note: 

Serious  High  risk  for  death  early 
surgery  &  J 


kt.X ) 

Hazards  cross 
E  -  0  (surgery) 


E  -  1  (no  surgery) 


?(days) 


g  =  i) 
£  =  0) 


<  1 


but 


5  days: 


h(t  =  5,  E=  1) 
h(t  =  5,  E  =  0) 


>  1 


To  further  illustrate  the  concept  of  proportional 
hazards,  we  now  provide  an  example  of  a  situ¬ 
ation  for  which  the  proportional  hazards 
assumption  is  not  satisfied. 

For  our  example,  we  consider  a  study  in  which 
cancer  patients  are  randomized  to  either  surgery 
or  radiation  therapy  without  surgery.  Thus,  we 
have  a  (0,1)  exposure  variable  denoting  surgery 
status,  with  0  if  a  patient  receives  surgery  and  1  if 
not.  Suppose  further  that  this  exposure  variable 
is  the  only  variable  of  interest,  so  that  a  Cox  PH 
model  for  the  analysis  of  this  data,  as  shown 
here,  will  contain  only  the  one  variable  E,  denot¬ 
ing  exposure. 

Now  the  question  we  consider  here  is  whether 
the  above  Cox  model  containing  the  variable  E 
is  an  appropriate  model  to  use  for  this  situa¬ 
tion.  To  answer  this  question  we  note  that 
when  a  patient  undergoes  serious  surgery,  as 
when  removing  a  cancerous  tumor,  there  is 
usually  a  high  risk  for  complications  from  sur¬ 
gery  or  perhaps  even  death  early  in  the  recov¬ 
ery  process,  and  once  the  patient  gets  past  this 
early  critical  period,  the  benefits  of  surgery, 
if  any,  can  then  be  observed. 

Thus,  in  a  study  that  compares  surgery  to  no 
surgery,  we  might  expect  to  see  hazard  func¬ 
tions  for  each  group  that  appear  as  shown  here. 
Notice  that  these  two  functions  cross  at  about 
3  days,  and  that  prior  to  3  days,  the  hazard  for 
the  surgery  group  is  higher  than  the  hazard  for 
the  no  surgery  group,  whereas  after  3  days,  the 
hazard  for  the  surgery  group  is  lower  than  the 
hazard  for  the  no  surgery  group. 

Looking  at  the  above  graph  more  closely,  we 
can  see  that  at  2  days,  when  t  =  2,  the  hazard 
ratio  of  non-surgery  {E  =  1 )  to  surgery  (E  =  0) 
patients  yields  a  value  less  than  1 .  In  contrast, 
at  t  =  5  days,  the  hazard  ratio  of  nonsurgery  to 
surgery  yields  a  value  greater  than  1 . 
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EXAMPLE:  (continued) 


Given  the  above  description,  HR  is 

not  constant  over  time. 


Cox  PH  model  inappropriate  because 
PH  model  assumes  constant  HR: 

Ht,X)  =  h0(t)epE 

—  h(t,  E  =  1)  a 

HR  =  T- - =  e* 

h(t,  E  =  0) 


Thus,  if  the  above  description  of  the  hazard 
functions  for  each  group  is  accurate,  the  haz¬ 
ard  ratios  are  not  constant  over  time.  That  is, 
the  hazard  ratio  is  some  number  less  than  1 
before  3  days  and  greater  than  1  after  3  days. 

It  is  therefore  inappropriate  to  use  a  Cox  PH 
model  for  this  situation,  because  the  PH  model 
assumes  a  constant  hazard  ratio  across  time, 
whereas  our  situation  yields  a  hazard  ratio  that 
varies  with  time. 

In  fact,  if  we  use  a  Cox  PH  model,  shown  here 
again,  the  estimated  hazard  ratio  comparing 
exposed  to  unexposed  patients  at  any  time  is 
given  by  the  constant  value  e  to  the  p  “hat,” 
which  does  not  vary  over  time. 


General  rule: 

If  the  hazards  cross,  then  a  Cox  PH 
model  is  not  appropriate. 


This  example  illustrates  the  general  rule  that  if 
the  hazards  cross,  then  the  PH  assumption 
cannot  be  met,  so  that  a  Cox  PH  model  is  inap¬ 
propriate. 


Analysis  when  Cox  PH  model  not 
appropriate?  See  Chapters  5  and  6. 


It  is  natural  to  ask  at  this  point,  if  the  Cox  PH 
model  is  inappropriate,  how  should  we  cany 
out  the  analysis?  The  answer  to  this  question  is 
discussed  in  Chapters  5  and  6.  However,  we 
will  give  a  brier  reply  with  regard  to  the  surgery 
study  example  just  described. 


EXAMPLE:  (continued) 


Surgery  study  analysis  options: 

•  stratify  by  exposure  (use  KM 
curves) 

•  start  analysis  at  3  days;  use  Cox 
PH  model 

•  fit  PH  model  for  <  3  days  and  for 
>3  days;  get  HR  (<  3  days)  and 
HR  (>  3  days) 

•  include  time-dependent  variable 
(e.g.,  E  x  f);  use  extended  Cox 
model 


Actually  for  the  surgery  study  there  are  several 

oplions  available  for  the  analysis.  These  include: 

•  analyze  by  stratifying  on  the  exposure 
variable;  that  is,  do  not  fit  any  model,  and, 
instead  oblain  Kaplan-Meier  curves  for 
each  exposure  group  separately; 

•  start  the  analysis  at  three  days,  and  use  a 
Cox  PH  model  on  three-day  survivors; 

•  fit  Cox  model  for  less  than  three  days  and  a 
different  Cox  model  lor  greater  than  three 
days  to  get  two  different  hazard  ratio 
estimates,  one  for  each  of  these  two  lime 
periods; 

•  fit  a  modified  Cox  model  that  includes  a 
time-dependent  variable  which  measures 
the  interaction  of  exposure  with  time.  This 
model  is  called  an  extended  Cox  model. 
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Different  options  may  lead  to  dif-  Further  discussion  of  these  options  is  given  in 
ferent  conclusions.  subsequent  chapters.  We  point  out  here,  that 

different  options  may  lead  to  different  con¬ 
clusions,  so  that  the  investigator  may  have  to 
weigh  the  relative  merits  of  each  option  in  light 
of  the  data  actually  obtained  before  deciding 
on  any  particular  option  as  best. 


Hazards 

cross 

but 


=>•  PH  not  met 


?  =>■  PH  met 


See  Chapter  4:  Evaluating  PH 
Assumption 


One  final  comment  before  concluding  this 
section:  although  we  have  shown  that  when 
the  hazards  cross,  the  PH  assumption  is  not 
met,  we  have  not  shown  how  to  decide  when 
the  PH  assumption  is  met.  This  is  the  subject 
of  Chapter  4  entitled,  “Evaluating  the  PH 
Assumption.” 


IX.  The  Cox  Likelihood 


Likelihood 

•  Typically  based  on  outcome 
distribution 

•  Outcome  distribution  not 
specified  for  Cox  model 

•  Cox  likelihood  based  on  order 
of  events  rather  than  their 
distribution 

O  Called  partial  likelihood 

Illustration 

Scenario: 

•  Gary,  Larry,  Barry  have  lottery 
tickets 

•  Winning  tickets  chosen  at 
times  ti,  t2,  -  -  - 

•  Each  person  ultimately  chosen 

•  Can  be  chosen  only  once 

Question: 

What  is  the  probability  that  the 

order  chosen  is  as  follows? 

1 .  Barry 

2.  Gary 

3.  Larry 


Typically,  the  formulation  of  a  likelihood 
function  is  based  on  the  distribution  of  the 
outcome.  However,  one  of  the  key  features  of 
the  Cox  model  is  that  there  is  not  an  assumed 
distribution  for  the  outcome  variable  (i.e.,  the 
time  to  event).  Therefore,  in  contrast  to  a  para¬ 
metric  model,  a  full  likelihood  based  on  the 
outcome  distribution  cannot  be  formulated 
for  the  Cox  PH  model.  Instead,  the  construc¬ 
tion  of  the  Cox  likelihood  is  based  on  the 
observed  order  of  events  rather  than  the 
joint  distribution  of  events.  Thus  the  Cox  likeli¬ 
hood  is  called  a  “partial”  likelihood. 


To  illustrate  the  idea  underlying  the  formula¬ 
tion  of  the  Cox  model,  consider  the  following 
scenario.  Suppose  Gary,  Larry,  and  Barry  are 
each  given  a  lottery  ticket.  Winning  tickets  are 
chosen  at  times  tj  (j  =  1,2,  . . .).  Assume  each 
person  is  ultimately  chosen  and  once  a  person 
is  chosen  he  cannot  be  chosen  again  (i.e.,  he  is 
out  of  the  risk  set).  What  is  the  probability  that 
the  order  each  person  is  chosen  is  first  Barry, 
then  Gary,  and  finally  Larry? 
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Answer: 

Probability  =  -x-x-  =  - 
^3216 

/  T  \ 

Barry  Gary  Larry 


Scenario: 

Barry  -  4  tickets 
Gary  -  1  ticket 
Larry  -  2  tickets 


The  probability  that  Barry’s  ticket  is  chosen 
before  Gary’s  and  Larry’s  is  one  out  of  three. 
Once  Barry’s  ticket  is  chosen,  it  cannot  be 
chosen  again.  The  probability  that  Gary’s  ticket 
is  then  chosen  before  Larry’s  is  one  out  of 
two.  Once  Barry’s  and  Gary’s  tickets  are  chosen, 
they  cannot  be  chosen  again  which  means  that 
Larry’s  ticket  must  be  chosen  last.  This  yields  a 
probability  of  1/6  for  this  given  order  of  events 
(see  left). 

Now  consider  a  modification  of  the  previous 
scenario.  Suppose  Barry  has  4  tickets,  Gary 
has  1  ticket,  and  Larry  has  2  tickets;  now  what 
is  the  probability  that  the  order  each  person  is 
chosen  is  first  Barry,  then  Gary,  and  finally 
Larry? 


Question: 

What  is  the  probability  that  the 
order  chosen  is  as  follows? 

1 .  Barry 

2.  Gary 

3.  Larry 


Barry,  Gary,  and  Larry  have  7  tickets  in  all  and 
Barry  owns  4  of  them  so  Barry’s  probability  of 
being  chosen  first  is  4  out  of  7.  After  Barry  is 
chosen,  Gary  has  1  of  the  3  remaining  tickets 
and  after  Barry  and  Gary  are  chosen,  Larry 
owns  the  remaining  2  tickets.  This  yields  a 
probability  of  4/21  for  this  order  (see  left). 


Answer: 

,  ,  4  12  4 

Probability  =  -x-x-  =  — 


For  this  scenario 

Subject’s  number  of  tickets 
affects  probability 

For  Cox  model 


For  this  scenario,  the  probability  of  a  particu¬ 
lar  order  is  affected  by  the  number  of  tickets 
held  by  each  subject.  For  a  Cox  model,  the 
likelihood  of  the  observed  order  of  events  is 
affected  by  the  pattern  of  covariates  of  each 
subject. 


Subject’s  pattern  of  covariates 
affects  likelihood  of  ordered 
events 
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ID 

TIME 

STATUS 

SMOKE 

Barry 

2 

1 

1 

Gary 

3 

1 

0 

Harry 

5 

0 

0 

Larry 

8 

1 

1 

SURVT  =  Survival  time  (in  years) 
STATUS  =  1  for  event,  0  for  censor¬ 
ship 

SMOKE  =  i  for  a  smoker,  0  for  a 
nonsmoker 


Cox  PH  model 

h(t)  =  h0(t)e^,SMOKE 


ID 

Hazard 

Barry 

h0(t)e^ 

Gary 

h0(t)e° 

Harry 

O 

'-+• 

c 

Larry 

hQ(t)e?1 

Individual  hazards  (Cox  likeli¬ 
hood)  analogous  to  number  of  tick¬ 
ets  (lottery  scenario)  For  example, 
smokers  analogous  to  persons  with 
extra  lottery  tickets 

Cox  Likelihood 

[ _ ho(t)e^ _ ' 

li0(t)ePi  +  hQ(t)e°  +  h0(t)e°  +  h0(t)e^ 

’ _ ho(t)e° _ ' 

h0(t)e°  +  h0(t)e°  +  h0(t)e 

' hQ(t)e 

[h0(t)eVi 

Likelihood  is  product  of  3  terms 
L=L\  xL2xL3 


h0(t)e^ 

ko(t)efo  +  ho(t)e°  +  ho(t)e° 
ho(t)e° 

+  h0{t)eV\ 

ho(t)e°  +  h0(t)e°  +  h0(t)e^\ 

ho(t)e^' 

h0(t)e P>_ 

To  illustrate  this  connection,  consider  the 
dataset  shown  on  the  left.  The  data  indicate 
that  Barry  got  the  event  at  TIME  =  2  years. 
Gary  got  the  event  at  3  years,  Harry  was  censored 
at  5  years,  and  Larry  got  the  event  at  8  years. 
Furthermore,  Barry  and  Larry  were  smokers 
whereas  Gary  and  Harry  were  nonsmokers. 


Consider  the  Cox  proportional  hazards  model 
with  one  predictor,  SMOKE.  Under  this  model 
the  hazards  for  Barry,  Gary,  Harry,  and  Larry 
can  be  expressed  as  shown  on  the  left.  The 
individual  hazards  are  determined  by  whether 
the  subject  was  a  smoker  or  nonsmoker. 

The  individual  level  hazards  play  an  analogous 
role  toward  the  construction  of  the  Cox  likeli¬ 
hood  as  the  number  of  tickets  held  by  each 
subject  plays  for  the  calculation  of  the  prob¬ 
abilities  in  the  lottery  scenario  discussed  ear¬ 
lier  in  this  section.  The  subjects  who  smoke  are 
analogous  to  persons  given  extra  lottery  tick¬ 
ets,  thereby  affecting  the  probability  of  a  par¬ 
ticular  order  of  events. 

On  the  left  is  the  Cox  likelihood  for  these  data. 
Notice  the  likelihood  is  a  product  of  three 
terms,  which  correspond  to  the  three  event 
times.  Barry  got  the  event  first  at  TIME  = 
2  years.  At  that  time,  all  four  subjects  were 
at  risk  for  the  event.  The  first  product  (Li)  has 
the  sum  of  the  four  subjects’  hazards  in  the 
denominator  and  Barry’s  hazard  in  the  numer¬ 
ator.  Gary  got  the  event  next  at  3  years  when 
Gary,  Harry,  and  Larry  were  still  in  the  risk  set. 
Consequently,  the  second  product  (L2)  has  the 
sum  of  the  three  hazards  for  the  subjects  still 
at  risk  in  the  denominator  and  Gary’s  hazard  in 
the  numerator.  Harry  was  censored  at  5  years, 
which  occurred  between  the  second  and  third 
event.  Therefore,  when  Larry  got  the  final  event 
at  8  years,  nobody  else  was  at  risk  for  the  event. 
As  a  result,  the  third  product  (L3)  just  has 
Larry’s  hazard  in  the  denominator  and  the 
numerator. 
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ti,  time  =  2,  four  at  risk  (Lj) 
t2,  time  =  3,  three  at  risk  (L2) 
t3,  time  =  8,  one  at  risk  (L3) 

For  each  term: 

Numerator  -  single  hazard 
Denominator  -  sum  of  hazards 

Baseline  hazard,  h0(t)  cancels 

_  r  gp. 

ePi  _|_  e0  _|_  e0  _|_  eP[ 

DP.l 


Thus,  L  does  not  depend  on  h0(t) 


e°  +  e°  +  ePi 


Data  A 


ID 

TIME 

STATUS 

SMOKE 

Barry 

2 

1 

1 

Gary 

3 

1 

0 

Harry 

5 

0 

0 

Larry 

8 

1 

1 

Data  B 

ID 

TIME 

STATUS 

SMOKE 

Barry 

1 

1 

1 

Gary 

7 

1 

0 

Harry 

8 

0 

0 

Larry 

63 

1 

1 

Comparing  datasets 


•  TIME  variable  differs 

•  Order  of  events  the  same 

•  Cox  PH  likelihood  the  same 


To  summarize,  the  likelihood  in  our  example 
consists  of  a  product  of  three  terms  (Llf  L2,  and 
L3)  corresponding  to  the  ordered  failure  times 
(t1(  t2,  and  t3).  The  denominator  for  the  term 
corresponding  to  time  tj  (j  =  1,  2,  3)  is  the  sum 
of  the  hazards  for  those  subjects  still  at  risk  at 
time  tj,  and  the  numerator  is  the  hazard  for  the 
subject  who  got  the  event  at  tj. 

A  key  property  of  the  Cox  likelihood  is  that 
the  baseline  hazard  cancels  out  in  each  term. 
Thus,  the  form  of  the  baseline  hazard  need  not 
be  specified  in  a  Cox  model,  as  it  plays  no  role 
in  the  estimation  of  the  regression  parameters. 
By  factoring  h0(t)  in  the  denominator  and  then 
canceling  it  out  of  each  term,  the  likelihood 
for  Barry,  Gary,  and  Larry  can  be  rewritten  as 
shown  on  the  left. 

As  we  mentioned  earlier,  the  Cox  likelihood  is 
determined  by  the  order  of  events  and  censor¬ 
ships  and  not  by  the  distribution  of  the  outcome 
variable.  To  illustrate  this  point,  compare  data¬ 
sets  A  and  B  on  the  left,  and  consider  the  likeli¬ 
hood  for  a  Cox  PH  model  with  smoking  status 
as  the  only  predictor.  Although  the  values  for 
the  variable  TIME  differ  in  the  two  datasets,  the 
Cox  likelihood  will  be  the  same  using  either 
dataset  because  the  order  of  the  outcome 
(TIME)  remains  unchanged. 
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General  Approach 

•  k  failure  times 

•  Likelihood  a  product  of  K  terms 

•  Construction  of  each  term 
similar  to  Barry,  Gary,  and  Larry 

L  =  L\  x  L2  x  L3  x  . . .  x  Lk 

=il 

/=  1 


We  have  used  a  small  dataset  (four  observations 
with  three  failure  times)  for  ease  of  illustration. 
However,  the  approach  can  be  generalized. 
Consider  a  dataset  with  k  failure  times  and  let 
Lf  denote  the  contribution  to  the  likelihood 
corresponding  to  the  f-th  failure  time.  Then  the 
Cox  likelihood  can  be  formulated  as  a  product  of 
each  of  the  k  terms  as  shown  on  the  left.  Each  of 
the  terms  Lf  is  constructed  in  a  similar  manner 
as  with  the  data  for  Gary,  Larry,  and  Barry. 


Obtaining  maximum  likelihood 
estimates 


Solve  system  of  equations 


din  L 

^pT 


=  0, 


i=  1,  2,  3,  ...,  p 


p  =  #  of  parameters 


Once  the  likelihood  is  formulated,  the  question 
becomes:  which  values  of  the  regression  para¬ 
meters  would  maximize  L?  The  process  of  max¬ 
imizing  the  likelihood  is  typically  carried  out 
by  setting  the  partial  derivative  of  the  natural 
log  of  L  to  zero  and  then  solving  the  system  of 
equations  (called  the  score  equations). 


X.  Using  Age  as  the 
Time  Scale 


Outcome  variable: 

time  until  an  event  occurs 

where  “time”  is  measured  as 

time-on-study  (years,  months, 
etc.,  of  follow-up  from  study 
entry) 
or 

age  at  follow-up 


Recall  that  when  we  introduced  the  topic  of 
survival  analysis  in  Chapter  1,  we  wrote  that 
the  "time”  variable  used  as  the  outcome  vari¬ 
able  could  be  measured  as  time-on-study  (i.e., 
follow-up  time  since  study  entry)  in  years, 
months,  weeks,  or  days  from  the  beginning  of 
follow-up.  We  also  wrote  that,  alternatively,  we 
might  use  age  as  the  time  scale,  so  that  time  is 
measured  as  age  at  follow-up  until  either 
an  event  or  censorship  occurs.  In  this  section, 
we  focus  on  the  use  of  age  as  the  time  scale,  and 
describe  when  such  use  is  appropriate,  provide 
the  form  of  the  Cox  PH  model  in  this  situation, 
and  illustrate  its  use. 


Time  0: 

starting  time  of  the  true  survival 
time 

Possible  choices  for  time  0: 

•  Study  entry 

•  Beginning  of  treatment 

•  Disease  onset 

•  Disease  diagnosis 

•  Surgery 

•  Point  in  calendar  time 

•  Birth 

•  Conception 


A  key  decision  in  any  survival  analysis  is  where 
to  define  the  starting  point  for  determining 
individual’s  “true”  survival  time,  which  we  call 
time  0.  Depending  on  the  study,  choices  for 
time  0  might  be:  the  time  the  subject  enters  the 
study,  the  time  the  subject  begins  treatment, 
the  time  of  disease  onset,  the  time  of  diagnosis, 
a  point  in  calendar  time,  the  time  of  a  seminal 
event  (e.g.,  surgery),  birth,  or  conception.  If  we 
define  time  0  at  birth,  then  an  individual’s 
survival  time  is  represented  by  their  age. 
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Time  0  not  necessarily  equal  to  t0, 

where 

to  =  time  when  subject’s  survival 
time  is  first  observed 

e.g.,  if  survival  time  is  measured  by 
age  and  subject  enters  study  at 
age  45 

t0  =  age  45  but  time  0  <  age  45 
since  time  0  =  age-at-birth 

Left  truncation: 

•  subject  not  observed  before  t0 

•  if  subject  has  event  before  t0, 
then  not  included  in  the  study 

•  if  subject  has  event  after  t0, 
then  included  in  the  study  and 
assumed  not  at  risk  for  event 
until  t0 


Time  0  is  not  necessarily  the  time  point  where  a 
subject’s  survival  time  is  first  observed  (which 
we  call  time  t0).  For  example,  if  survival  time  is 
measured  by  age  at  follow-up  and  a  subject 
enters  the  study  at  age  45,  then  t0=45  years 
for  this  subject.  In  this  example,  the  subject’s 
survival  time  has  been  left-truncated  at  t0  =  45, 
which  we  now  define. 


Left  truncation  at  time  t0  is  defined  as  follows: 

The  subject  is  not  observed  from  time  0  to  t0. 
If  the  subject  has  the  event  before  time  t0,  then 
that  subject  is  not  included  in  the  study.  If  the 
subject  has  the  event  after  time  t0,  the  subject  is 
included  in  the  study  but  with  the  caveat  that 
the  subject  was  not  at  risk  to  be  an  observed 
event  until  time  t0. 


Two  types  of  left  truncation: 

Type  1 :  subject  has  event  before  t0 
and  not  included  in  the 
study, 

e.g., 

E  causes  death  before  study  entry 

If 

Bias:  effect  of  E  underestimated 


We  note  that  there  are  two  types  of  left  trunca¬ 
tion  at  t0.  The  first  type  of  left  truncation  occurs 
if  the  subject  has  the  event  before  t0  and  thus  is 
not  included  in  the  study.  If,  for  example,  the 
exposure  (E)  under  study  causes  individuals  to 
die  before  they  could  enter  the  study,  this  could 
lead  to  a  (selective)  survival  bias  that  would 
underestimate  the  effect  of  exposure. 


Type  2:  t0  >  0 
and 
t  >  t0 

where  t  =  observed  survival  time 


The  second  type  of  left  truncation  occurs  if  the 
subject  survives  beyond  time  t0  (i.e.,  t  >  t0).  This 
is  required  in  order  for  the  subject  to  have  his/ 
her  survival  time  observed. 


Study  entry  =>•  subject  survives 
until  t0 

Type  1 :  subject  not  included  in  the 
study 

Type  2:  subject  included  in  the  study 


Thus,  a  condition  of  the  subject’s  entry  into  the 
study  is  that  they  survive  until  time  t0.  If  they 
do  not  meet  that  condition,  then  their  left  trun¬ 
cation  is  of  the  first  type  and  thus  not  included 
in  the  study.  If  they  do  survive  past  time  t0, 
then  their  left  truncation  is  of  the  second  type. 


Subject  A 


* 

? 


Left  censored  at  tA 


0 

:  x  denotes  failure 
|  0  denotes  censored 


t0=o 

\ 


Subject  first  observed 


time 


Left  truncation  (of  both  types)  at  time  t  is  com¬ 
monly  confused  with  left  censorship  at  time  t. 
If  a  subject  is  left  censored  at  time  t,  then  that 
subject  is  (i)  included  in  the  study,  (ii)  known 
to  be  event  free  at  time  0,  (iii)  known  to  be  at 
risk  for  the  event  after  time  0,  and  (iv)  known  to 
have  had  the  event  before  time  t  but  with  the 
exact  time  of  event  being  unknown. 
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Example:  left  censored  data 
Subject  A:  t0  =  0 
tA=  6 

true  t  =  ?  <  6 


["Type 


Subject  B 
Subject  C 


1  left  truncation 


^J-|Type 


2  left  truncation 


-time 


^  h  to  tc 

\ 

observation  begins 


Example:  Type  1  left  truncation 
Subject  B:  Time  0  <  t0 

not  included  in  the 
study 

Example:  Type  2  left  truncation 
Subject  C:  Time  0  <  t0  but  first 
observed  at  t0 

Being  observed  at  time  t  means: 
If  event  at  t  =>■  recorded  event  at  t 


2  approaches  for  measuring  sur¬ 
vival  time: 

Time-on-study 

vs. 

Age-at-follow-up 

•  Choice  determines  the  risk  set. 


For  example,  for  subject  A  in  the  above  graph, 
suppose  death  is  the  outcome  and  a  patient  is 
first  treated  for  some  illness  at  the  time  of  their 
first  visit  to  the  clinic  (to=0).  Further,  suppose 
that  the  patient  does  not  show  up  at  the  next 
scheduled  clinic  visit  6  months  later  because 
that  patient  had  died  in  the  interim.  If  the  spe¬ 
cific  month  of  death  is  unable  to  be  ascertained, 
then  that  patient  is  included  in  the  study  and 
left  censored  at  tA  =  6  months. 

In  contrast,  the  diagram  on  the  left  illustrates 
the  two  types  of  left  truncation. 

In  this  diagram,  subject  B  provides  an  example  of 
left  truncation  of  the  first  type  that  would  occur  if 
an  individual  died  between  disease  onset  (time  0) 
and  disease  diagnosis  (time  t0)  and  thus  was 
never  included  in  the  study.  In  this  example, 
having  the  disease  was  a  necessary  condition 
for  study  inclusion,  whereas  subject  B  died 
before  it  was  known  that  he/she  had  the  disease. 

Subject  C  illustrates  Type  2  left  truncation, 
since  he/she  developed  the  disease  (time  0) 
prior  to  being  diagnosed  with  the  disease  at 
time  t0  and  was  observed  after  t0. 

One  clarifying  point  is  that  when  we  say  a  sub¬ 
ject  is  observed  at  time  t,  we  do  not  necessarily 
mean  that  the  subject  is  observed  in  an  active 
prospective  manner.  Rather,  what  we  mean  by 
a  subject  being  observed  at  time  t  is  as  follows: 
if  that  subject  had  an  event  at  time  t,  then  the 
subject  would  be  recorded  in  the  study  as  having 
had  an  event  at  time  t. 

We  now  compare  two  approaches  of  measur¬ 
ing  survival  time.  One  approach  is  to  measure 
survival  time  as  time-on-study  and  the  other  is 
to  measure  survival  time  as  age-at-follow-up 
until  either  an  event  or  censorship.  The  choice 
of  approach  determines  the  risk  set  at  the 
time  of  each  event.  We  illustrate  this  idea  with 
hypothetical  data. 
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Hypothetical  Survival  Data 

Subject  t  d  a0 

a 

Consider  the  data  shown  on  the  left  on  four 
different  subjects,  for  each  of  which  we  have 

H 

2 

1 

65 

67 

identified  time-on-follow-up  (t),  whether  failed 

I 

6 

0 

65 

71 

or  censored  (d),  age  at  study  entry  (a0),  and  age 

J 

6 

0 

74 

80 

at  the  end  of  follow-up  (a).  Note  that  t  is  simply 

K 

3 

1 

75 

78 

a  —  a0,  the  difference  between  age  at  follow-up 

time  and  age  at  study  entry. 


Using  time-on-study  (i.e.,  from  entry  into  the 
R(t(f))  study)  as  the  time  scale,  the  data  layout  based 

H, I,J,K  on  ordered  follow-up  times  is  shown  on  the 

I, J,K  left,  below  which  is  shown  a  graphical  repre¬ 

sentation  that  follows  each  subject  from  the 
time  of  study  entry.  There  are  only  two  failures 
and  these  occur  at  follow-up  times  2  (subject  H) 
and  3  (subject  K). 

Time-on-study  as  Time  Scale  The  risk  set  at  the  first  failure  time  (t(i)  =2) 

consists  of  all  four  subjects,  and  the  risk  set  at 
the  second  failure  time  (t(2)  =3)  contains  sub¬ 
jects  I,  J,  and  K.  Subjects  I  and  J  are  censored 
after  time  t(2)  =3,  i.e.,  the  value  for  q2  is  2  at  this 
time.  Notice  that  the  risk  set  at  time  t(2)  =3  is 
contained  in  the  risk  set  at  time  t(1)  =2,  which 
is  generally  true  when  the  outcome  variable 
R(t(i>  =  2)  =  [H,  I,  J,  K]  being  considered  is  time-on-follow-up.  The  data 

R(t(2)  =  3)  =  [I,  J,  K]  layout  represented  here,  in  which  the  size  of  the 

I  and  J  censored  after  t(2)  =>  q2  2  risk  set  always  decreases  over  time,  is  called  a 
[I,  J,  K]  contained  in  [H,  I,  J,  Kj  closed  cohort. 


Age  as  Time  Scale  Layout 

Now  let's  consider  the  data  layout  that  would 

f 

a(f) 

nf 

mf 

qf 

R(af) 

result  if  we  used  age  as  the  time  scale,  which  is 

1 

67 

2 

1 

l 

H,I 

shown  on  the  left.  Below  this  layout  is  a  graph¬ 

2 

78 

2 

1 

l 

J,K 

ical  representation  that  follows  each  subject 

from  age  at  study  entry. 


Using  age  as  the  time  scale,  the  first  failure 
time  is  at  age  a(i)  =  67  (for  subject  H),  and 
there  are  two  subjects  (H  and  I)  in  the  risk  set 
at  this  time;  subject  I  is  in  the  risk  set  at  a(i)  = 
67  because  (s)he  entered  the  study  at  age  65 
and  was  still  at  risk  when  subject  H  failed. 
However,  because  subjects  J  and  K  did  not 
enter  the  study  until  ages  74  and  75,  these 
subjects  are  not  in  the  risk  set  at  a(1)  =67. 

I  still  at  risk  at  aq)  =  67  but 
J  and  K  not  in  study  at  a(i)  =  67 


Age  as  Time  Scale  w.  Left  Truncation 


65  67  69  71  73  75  77  79  81 


First  failure:  R(a(i)  =  67)  =  [H,  1} 


Time-on-study  Layout 
f  t(f)  nf  mf  qf 

1  2  4  0  0 

2  3  3  2  2 
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Second  failure:  R(a(2)  =  78)  =  [J,  K) 

H  and  J  no  longer  at  risk  at  a(2)  =  78 
I  censored  between  =  67  and 
a(2)  =  78  =>  qi  =1 
J  censored  after  a(2)  =  78  =>■  q2  =1 

[J,  K]  not  contained  in  [H,  I] 


Time-on-Study  vs.  Age  as  Time 
Scale 

•  Closed  cohort  vs.  Open  cohort 

•  How  we  decide  which  to  use? 


Key  issue: 

Did  all  subjects  first  become  at 
risk  at  their  study  entry? 


Clinical  trial: 

•  Subjects  start  to  be  followed  for 
the  outcome  after  random 
allocation 

•  Reasonable  to  assume  subjects 
start  to  be  at  risk  upon  study 
entry 

D 

Time-on-Study  typically  used 
as  the  outcome 

(Covariates  may  also  be  controlled) 


Observational  study: 

•  Subjects  already  at  risk  prior  to 
study  entry 

•  Unknown  time  or  age  when 
first  at  risk 


The  second  failure  time  is  at  age  a(2)  =  78  (for 
subject  K).  The  only  two  subjects  in  the  risk  set 
at  a(2)  =  78  are  subjects  J  and  K,  since  subject  H 
failed  at  age  67  and  subject  I  was  censored  at 
age  71.  The  values  in  the  q  column  are  1  at 
failure  age  67  (for  subject  I)  and  1  at  failure 
age  72  (for  subject  J).  In  contrast  to  the  previ¬ 
ous  data  layout,  the  risk  set  at  the  later  failure 
age  (containing  J,  K)  is  not  a  subset  of  the  risk 
set  at  the  first  failure  age  (containing  H,  I),  but, 
rather,  is  a  mutually  exclusive  subset.  This  data 
layout,  in  which  the  size  of  the  risk  set  may 
increase  or  decrease  over  time,  is  called  an 
open  cohort. 

We  thus  see  that  using  time-on-study  as  the  time 
scale  can  give  a  different  view  of  the  survival 
data  (i.e.,  a  closed  cohort)  than  found  when 
using  age  as  the  time  scale  (i.e.,  an  open  cohort). 
So  which  time  scale  should  be  used  and  how  do 
we  make  such  a  decision  in  general? 

To  answer  this  question,  a  key  issue  is  to  deter¬ 
mine  whether  all  subjects  in  the  study  first 
begin  to  be  at  risk  for  the  outcome  at  the  time 
they  enter  the  study. 

Suppose  the  study  is  a  clinical  trial  to  com¬ 
pare,  say,  treatment  and  placebo  groups,  and 
subjects  start  to  be  followed  shortly  after  ran¬ 
dom  allocation  into  one  of  these  two  groups. 

Then,  it  may  be  reasonable  to  assume  that  study 
subjects  begin  to  be  at  risk  for  the  outcome 
upon  entry  into  the  study.  In  such  a  situation, 
using  time-on-study  as  the  time  scale  is  typically 
appropriate.  Further,  covariates  of  interest  may 
be  controlled  for  by  stratification  and/or  being 
entered  into  a  regression  model  (e.g.,  Cox  PH 
model)  as  predictors  in  addition  to  the  treatment 
status  variable. 

Suppose,  instead  of  the  above  scenario,  the 
study  is  observational  (i.e.,  not  a  clinical  trial) 
and  subjects  are  already  at  risk  for  the  outcome 
prior  to  their  study  entry.  Also,  suppose  the  time 
or  age  at  which  subjects  first  became  at  risk  is 
unknown. 
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•  Example:  Subjects  with  high 
blood  pressure  enter  study,  but 
unknown  date  or  age  when  first 
diagnosed  (prior  to  study 
entry). 


•  Reasonable  to  assume  that 
T  =  tr  +  t 

where 

T  =  true  survival  time 

tr  =  time  at  risk  prior  to  study  entry 

t  =  observed  time-on-study 


Left-truncated  survival  data 
Time-on-study  questionable 


Subject 

t 

d 

a0 

a 

H 

2 

1 

65 

67 

I 

6 

0 

65 

71 

J 

6 

0 

74 

80 

K 

3 

1 

75 

78 

Subject  J  is  9  years  older  than 
Subject  I 

D- 

h(t|  subject  J)  >  h(t|  subject  I). 

But,  using  time-on-study  approach 
does  not  account  for  this  difference. 


One  modified  approach: 

Use  time-on-study,  but  control  for 
a0,  e.g., 

h(t,  X,  fl0)  =  MO  exPE  PE  +  y«o] 

11 

OK  provided  model  correctly  spe¬ 
cified  but  not  always  appropriate. 

Alternatively,  may  consider  using 
age  as  the  time  scale. 


For  example,  the  subjects  may  all  have  high 
blood  pressure  when  the  study  begins  and 
are  then  followed  until  a  coronary  event  occurs 
(or  censorship);  such  subjects  already  had  high 
blood  pressure  when  recruited  for  the  study,  but 
the  date  or  their  age  when  their  high  blood 
pressure  condition  was  first  diagnosed  is 
assumed  unknown. 

In  this  situation,  it  seems  reasonable  that  the 
time  at  risk  prior  to  study  entiy  (tr),  which  is 
unknown,  contributes  to  the  true  survival  time 
(T)  for  the  individual,  although  only  the  observed 
time-on-study  (t)  is  actually  available  to  analyze. 
The  individual’s  true  (i.e.,  total)  survival  time  is 
therefore  underestimated  by  the  time-on-study 
information  (obtained  from  study  entry),  i.e.,  the 
true  survival  time  is  left-truncated. 

So,  for  the  situation  where  we  have  left-truncated 
survival  data,  the  use  of  time-on-study  follow-up 
times  that  ignores  unknown  delayed  entry  time 
may  be  questioned. 

Recall  that  although  both  subjects  I  and  J  were 
censored  at  follow-up  time  6  (say,  in  weeks) 
from  study  entry,  subject  I  entered  the  study  at 
age  65,  whereas  subject  J  entered  the  study 
at  age  74.  Because  subject  J  is  9  years  older 
than  subject  I  upon  study  entry,  and  recognizing 
that  age  is  a  well-known  risk  factor  for  most 
diseases,  e.g.,  coronary  disease,  we  would  expect 
subject  J  to  have  higher  potential  for  failing  (i.e., 
higher  hazard  rate)  than  subject  I  at  study  entry. 
However,  if  we  just  use  time-on-study  follow-up 
times  in  our  analysis,  we  are  not  taking  into 
account  the  increased  failure  potential  for  sub¬ 
ject  J  over  subject  I  at  study  entry. 

One  way  to  account  for  the  age  difference  at 
entry  would  simply  to  control  for  age  at  entry 
(i.e.,  a0)  as  a  covariate  in  one’s  survival  analysis 
by  adding  the  variable  a0  to  a  Cox  PH  model. 
This  approach  is  reasonable  provided  the 
model  is  specified  correctly  (e.g.,  proportional 
hazards  assumption  is  met  for  age). 

Alternatively,  considering  subjects  I  and  J,  who 
have  entered  at  the  same  time  but  are  9  years 
different  in  age,  we  might  consider  using  age  as 
the  time  scale  to  represent  a  subject’s  potential 
for  failure,  which  we  now  describe. 
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h(a,  X)  =  ho(a)  exp[^  (3,X,] 

X  denotes  set  of  covariates, 
e.g.,  X  =  (Rx,  BMI,  SMK) 
h0(a)  =  baseline  hazard 


Age-at-event  Time-on-study 

'  V  / 

h(a,  X)  versus  h(t,  X,  ao): 

Which  to  use?  Does  it  matter? 


It  depends! 

And,  it  might  not  matter! 

(often  same  results,  if  model 
well-specified) 


Prefer  h(a,  X)  provided 

•  age  is  stronger  determinant  of 
outcome  than  time-on-study 

•  h0(a)  is  unspecified,  so  that  age 
is  not  modeled  as  a  covariate 
i.e.,  avoids  mispecifying  the 
model  as  linear  in  a0  when  a02 
also  needed 


Prefer  h(t,  X,  a0)  provided 

•  time-on-study  is  stronger 
determinant  of  outcome  than  age 

•  age  at  entry  (a0)  is  effectively 
controlled  in  the  model  using  a 
linear  and/  or  possibly  higher/ 
order  term  (or  age  is  controlled 
by  stratification) 


In  a  Cox  PH  model  that  uses  age  as  the  time 
scale  (as  shown  on  the  left),  the  outcome  vari¬ 
able  will  be  age-at-event  (a)  rather  than  time- 
on-study  (t).  X  denotes  the  set  of  covariates  in 
the  model,  e.g.,  X  =  (Rx,  BMI,  SMK).  The  base¬ 
line  hazard  function  h0(a)  is  an  unspecified 
function  of  a  (rather  than  t). 

At  this  point,  we  might  again  ask,  when,  if  at 
all,  would  using  a  model  based  on  h(a,  X)  be 
preferable  to  simply  using  a  model  of  the  form 
h(t,  X,  a0)  where  t  denotes  time-on  follow-up, 
and  a0  denotes  age  at  entry? 

The  answer  is  that  “it  depends”.  Moreover,  in 
many  situations,  it  might  not  matter,  since  use 
of  either  model  form  will  often  lead  to  essen¬ 
tially  the  same  results,  provided  the  model  is 
well-specified  in  each  case. 

On  one  hand,  using  h(a,  X)  may  be  preferable 
if  age  is  a  much  stronger  determinant  of  the 
outcome  than  time-on-study,  i.e.,  age  at  event 
may  have  a  larger  effect  on  the  hazard  than 
time-on-study  (Korn  et  al.  1997).  Also,  because 
age  is  taken  into  account  in  an  unspecified  base¬ 
line  hazard  h0(a),  a  more  effective  control  of  age 
may  result  that  avoids  the  possibility  of  mispe¬ 
cifying  the  way  the  age  at  entry  (a0)  might  be 
entered  into  a  time-on-study  model,  e.g.,  using 
only  a  linear  term  when  a  quadratic  term  such  as 
is  also  required  for  model  adequacy. 

On  the  other  hand,  h(t,  X,  a0)  may  be  preferable 
if  time-on-study  is  a  stronger  determinant  of  the 
outcome  than  age  at  the  event,  as  in  a  rando¬ 
mized  clinical  trial.  Also,  a  time-on-study  model 
would  seem  appropriate  if  age  at  entry  (a0)  is 
"effectively  controlled”  (e.g.,  using  a  quadratic 
term  if  necessaiy)  or  is  stratified  in  the  model. 
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Alternative  Cox  PH  models  for 
age-truncated  survival  data: 

Let 

t  =  follow-up  time, 
a  =  attained  age  at  event  or  censor¬ 
ship 

a0  =  age  at  enrollment  into  study 
(Note:  t  =  a-a0) 

X  =  (X1;  X2). . Xk),  vector  of  pre¬ 
dictors,  not  including  a0 
pi  =  regression  coeff.  corresponding 
to  Xj. 

y  =  regression  coeff.  if  a0  included 
in  model 

Model  0: 

h(t,X)=h0(t)expppiX1]  , 
unadjusted  for  ao 

Model  1: 

h(t,X,a0)  =h0(t)exp[Sp1Xi  +y1a0]  , 
adjusted  for  ao  as  linear  covariate 

Model  2: 

h(t,X,a0)  =  hottjexppPiX;  +  y^o  +  y2a02] 
adjusted  for  ao  with  quadratic  covariate 

Model  3: 

hg(t,X )  =  hogltjexpppiXj]  , 

^  stratified  by  ao  or  birth  cohort,  g=  1,  s 

Model  4: 

h(a,X )  =  ho(a)exp[EpiXj]  , 
unadjusted  for  left  truncation  at  ao 

Model  5: 

h(a,X)  =  ho(a|ao)exp[ZpiXj]  , 
adjusted  for  left  truncation  at  ao 

Model  6: 

hg(a,X)  =  h0g(a|ao)exp[E|3jXj]  , 
adjusted  for  left  truncation  at  ao  and 
stratified  by  birth  cohort,  g=  1,  s 


age  (I 
as  J 
time  A 
scale 


time 

on 

study 


We  now  specify  several  alternative  forms  that 
a  Cox  PH  model  might  take  to  account  for 
risk-truncated  survival  data.  As  previously 
introduced,  our  notation  uses  t  to  denote  time- 
on-study  follow-up  time,  a  to  denote  attained 
age  at  the  event  or  censorship,  a0  to  denote  age 
at  study  entry,  X  to  denote  the  vector  of  predic¬ 
tor  variables,  not  including  age,  p;  to  denote  the 
vector  of  Cox  model  coefficients  corresponding 
to  X,  and  yi  to  denote  the  coefficient  of  a0  if 
model  includes  a0. 


On  the  left,  we  provide  seven  different  Cox  PH 
models  that  might  be  considered  to  analyze 
risk-truncated  survival  data. 

Models  0-3  use  an  analysis  based  on  time-on- 
study  follow-up,  whereas  Models  4-6  consider 
age  as  the  time  scale. 


Of  all  these  models,  Model  0  is  the  least  appro¬ 
priate  since  this  model  uses  time-on-study  as 
the  outcome  and  does  not  adjust  for  age  at 
entry  (a0)  in  any  way. 
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Models  1-3  control  for  a0  differently 
Model  1 :  linear  effect  of  a0 
Model  2:  quadratic  effect  of  a0 
Model  3:  stratifies  on  a0  or  on 
birth  cohorts  defined 
from  a0  (uses  Stratified 
Cox  PH  model) 


Models  1-3  are  time-on-study  models  that 
control  for  age  at  entry  (a0),  but  do  so  differ¬ 
ently.  Model  1  controls  for  a0  as  a  covariate  and 
assumes  a  linear  effect  of  a0.  Model  2,  in  con¬ 
trast,  assumes  that  a0  has  both  linear  and 
quadratic  effects.  Model  3  stratifies  on  either 
a0  or  on  birth  cohort  defined  from  a0.  Model  3 
is  called  a  Stratified  Cox  (SC)  PH  model,  which 
we  describe  in  detail  in  Chapter  5. 


Models  1-3  reasonable 

•  if  all  study  subjects  begin  risk  at 
study  entry 

•  if  models  provide  effective 
control  of  a0 


Models  1-3  are  all  reasonable  if  we  assume  that 
study  subjects  begin  to  be  at  risk  upon  study 
entry,  as  in  a  randomized  clinical  trial.  More¬ 
over,  even  for  an  observational  study  design  in 
which  subjects  have  different  ages  at  entry, 
these  models  may  appear  justifiable  if  they 
provide  effective  control  of  a0. 


Model  3 

hg(t,  X)  =  h0g{t)  exp[]T  P,X,'] 


•  alternative  method  of  control 

•  may  account  for  advances  in 
medical  management  if 
stratified  on  birth  cohort 

•  stratifying  on  either  a0  or  on 
birth  cohort  likely  to  give  similar 
results  unless  enrollment  over 
long  time  period 


Model  3  controls  for  entry  age  by  stratifying 
either  on  age  at  entry  (a0)  or  on  birth  cohort 
based  on  a0.  Model  3  provides  an  alternative 
way  to  control  for  age  without  explicitly  putting 
a0  as  a  covariate  in  the  model  (as  was  done  in 
Models  1  and  2).  If  we  stratify  by  birth  cohort 
instead  of  by  a0,  we  can  account  for  possible 
advances  in  medical  management  in  later  birth 
cohorts.  Nevertheless,  stratifying  by  age  at  entry 
or  stratifying  by  birth  cohort  would  likely  give 
similar  results  unless  enrollment  happens  over 
a  long  period  of  time.  In  the  latter  case,  we 
recommend  stratifying  on  birth  cohort. 


Models  4-6: 

•  outcome  is  age-at-event 

•  differ  in  baseline  hazard 


Models  4-6  use  age-at-event  or  censorship 
rather  than  time-on-study  as  the  outcome  vari¬ 
able.  These  models  differ  in  the  way  the  base¬ 
line  hazard  function  is  specified. 


Model  4:  h (a,  X)  =  h0(a)  exp[J^  ff-X,] 

•  does  not  adjust  for  left- 
truncation  at  a0 

•  assumes  risk  starts  at  birth 

•  data  layout  describes  closed 
cohort 


Model  4  uses  h0(a)  to  indicate  that,  although 
age  is  the  outcome,  the  model  does  not  adjust 
for  left  truncation  at  the  entry  age  (a0).  In 
effect,  this  baseline  hazard  assumes  that  each 
subject’s  observed  risk  period  started  at  birth. 
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Q 


— ^ - 1 - 1 — I - 1 - Age 

0  a0P  aP  a0Q  aQ 

R(a)  =  [P,  Q]  using  Model  4  even 
though  Q  enrolled  after  P  failed 

Previous  example: 


Age  as  Time  Scale  w.  Left  Truncation 


65  67  69  71  73  75  77  79  81 


R(a  =  67)  =  (H,  I,  J,  K]  using  Model 
4  since  all  four  subjects  at  risk  from 
birth  x  until  H  fails  at  age  a  =  67 

Model  5:  h(a,  X)  =  h0(a|ao)  exp[J]  pjX;] 

•  adjusts  for  left-truncation  at  a0 

•  data  layout  describes  open 
cohort 

p  x 


— i - 1 - 1 — I - 1 - ^A9e 

u  a0p  aP  a0Q  aQ 

R(a)  =  [P]  and  R(a*)  =  [Q]  using 
Model  5  because  Q  enrolled  after  P 
failed 

Previous  example  with  H,  I,  J, 
K:  R(a=67)  =  (H,  I]  and  R(a=78)  = 
[J,  K}  using  Model  5  since  J  and  K 
had  not  enrolled  when  H  failed  at 
67  and  {H,  I)  were  not  used  in 
study  when  K  failed  at  78. 

Model  3  Model  6 

Stratifies  on 

birth  cohort?  Yes  Yes 

Adjusts  for  age- 

truncation?  No  Yes 


In  other  words,  Model  4  allows  keeping  in  the 
risk  set  R(aP)  any  subject  (e.g.,  subject  Q  in 
the  figure  at  left)  who  was  not  under  study  at 
age  aP  but  who  enrolled  later  (at  age  a0O). 
Here,  subject  Q  is  in  the  risk  set  R(aP)  because 
we  assume  he  is  at  risk  from  birth  (Age=0) 
when  subject  P  fails  at  aP.  The  data  layout  with 
ordered  failure  ages  is  thus  a  closed  cohort  that 
starts  with  all  subjects  in  the  risk  set  at  birth. 

If  Model  4  were  applied  to  our  previous  exam¬ 
ple  involving  four  subjects,  subjects  J  and  K 
would  be  incorrectly  included  in  the  risk  set  R 
(a=67)  when  subject  H  failed,  even  though 
both  these  subjects  were  enrolled  after  age  67. 
This  model  inappropriately  assumes  that  all 
subjects  were  at  risk  from  birth;  it  does  not 
adjust  for  age-truncation. 


Model  5,  on  the  other  hand,  accounts  for  left 
truncation  by  age  at  entry.  The  baseline  hazard 
h0(a  [  a0)  is  used  to  indicate  that  the  data  layout 
with  ordered  failure  ages  is  an  open  cohort. 
For  this  model,  the  risk  set  R(a)  at  time  a  con¬ 
tains  only  those  subjects  who  are  under  study 
at  age  a. 


If  Model  5  was  applied  to  our  previous  example, 
subjects  J  and  K,  who  had  not  enrolled  when 
subject  H  failed  at  67,  would  not  be  in  the  risk 
set  R(a=67).  Also,  subjects  H  and  I,  who  were 
no  longer  in  the  study  when  subject  K  failed  at 
age  78,  would  not  be  in  the  risk  set  R(a=78). 

Model  6  is  similar  to  Model  3  stratified  on  birth 
cohort.  However,  Model  6  adjusts  for  age  trunca¬ 
tion,  whereas  Model  3  does  not.  As  with  Model  3, 
Model  6  is  intended  to  account  for  possible 
advances  in  medical  management  in  later 
birth  cohorts.  Model  6  would  not  be  necessary 
if  we  are  considering  a  study  in  which  eveiyone 
is  enrolled  within  a  short  period  of  time. 
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Summary  about  Models  0-6: 

Models  0  and  4: 

•  Both  inappropriate 

•  Model  0  does  not  adjust  for  age 

•  Model  4  incorrectly  assumes 
that  all  subjects  are  at  risk  from 
birth 

Models  1-3,  5,  6 

•  All  adjust  for  age-at-entry  (a0) 

•  Question:  Do  they  differ  in 
practice? 

Pencina  et  al.  (2007): 

•  Compare  estimated  regression 
coefficients  for  Models  1-6 

•  Consider  Model  5  (age- 
truncated  age  scale)  most 
appropriate  conceptually 

•  Consider  Models  1  and 

2  (covariate  adjusted  for  a0) 
“attempts  to  approximate 
Model  5” 

•  Used  numerical  simulations 
and  practical  examples  from 
Framingham 

Conclusions: 

•  correct  adjustment  for  the  age 
at  entry  is  crucial 

•  Model  1  inferior  (and  biased) 

•  Little  practical  or  meaningful 
difference  between  Models  2 
through  6 


In  summary,  of  the  seven  models  we  have 
presented,  Models  0  and  4  are  inappropriate 
because  Model  0  does  not  account  for  age  at 
all  and  Model  4  ignores  age  truncation  by 
incorrectly  assuming  that  all  study  subjects 
were  observed  for  the  outcome  from  birth. 

The  other  five  models  (i.e.,  1-3,  5,  6)  all  adjust 
for  age  at  study  entry  in  some  way.  A  logical 
question  at  this  point  is  whether  in  practice, 
it  makes  a  difference  which  model  is  used  to 
analyze  age-truncated  survival  data? 

The  above  question  was  actually  addressed  by 
Pencina  et  al.  (Statist.  Med.,  2007)  by  compar¬ 
ing  Models  1-6  above  in  terms  of  the  estimated 
regression  coefficients  they  produce.  These 
authors  consider  Model  5,  the  age-truncated 
age  scale  model,  to  be  "possibly  the  most  appro¬ 
priate  refinement”  to  account  for  age-truncation. 
They  also  view  time-on-study  Models  1  and  2, 
which  use  linear  and/or  quadratic  terms  to 
adjust  for  entiy  age  as  a  covariate  as  “attempts 
to  approximate”  Model  5. 

Nevertheless,  by  considering  numerical  simula¬ 
tions  as  well  as  four  practical  examples  from  the 
Framingham  Heart  Study,  Pencina  et  al.  con¬ 
clude  that  correct  adjustment  for  the  age  at 
entry  is  crucial  in  reducing  bias  of  the  estimated 
coefficients.  The  unadjusted  age-scale  model 
(Model  1)  is  inferior  to  any  of  the  five  other 
models  considered,  regardless  of  their  choice  of 
time  scale.  Moreover,  if  correct  adjustment  for 
age  at  entry  is  made  when  considering  Models 
2-6,  their  analyses  suggest  that  there  exists 
little  if  any  practical  or  meaningful  difference 
in  the  estimated  regression  coefficients 
depending  on  the  choice  of  time  scale. 


Time-on- 

study 


Cox  PH  Regression  Coefficients  (±  se)  for  two 
CHD  risk  factors  among  men-  Framingham 
Heart  Disease  Study  (Pencina  et  al,  2007) 

Model  jAge-time-scale  ] 


r^. 

linear 


2 

quad 


3 

strat 


4 

unadj 


5  6 

age-trunc  strat 


Diabetic  versus  non-diabetic  («=2439) 


To  illustrate,  we  show  on  the  left  results  from 
Pencina  et  al.  corresponding  to  Models  1-6 
applied  to  12-year  follow-up  Framingham 
Heart  Study  data.  The  outcome  considered 
here  is  coronary  heart  disease  (CHD)  in  men. 


0.48* 

0.49* 

0.48* 

0.23* 

0.47* 

0.45* 

±0.21 

±0.21 

±0.21 

±0.20 

±0.21 

±0.21 

Education:  post-  HS 

versus  HS  or  less  ( n  =  2177) 

-0.43* 

-0.40* 

-0.43* 

0.18 

-0.43* 

-0.38* 

±0.15 

±0.15 

±0.15 

±0.15 

±0.16 

±0.15 

*  The  coefficient  is  significantly  different  from  zero  at 
the  0.05  level. 


These  results  focus  on  two  risk  factors  measured 
at  baseline:  diabetes  mellitus  status  and  edu¬ 
cation  status,  the  latter  categorized  into  two 
groups  defined  by  post-high-school  education 
(yes/no).  The  estimated  regression  coefficients 
(separately)  relating  these  two  risk  factors  to 
CHD  outcome  are  presented  in  the  table. 
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Summary  of  Framingham  results 

from  Pencina  et  al.: 

•  Model  4  inferior  to  other  models 

•  Results  for  Models  1-3,5,  and  6 
are  similar 

•  Directions  of  estimated 
coefficients  were  as  anticipated 
conceptually,  e.g.,  +  diabetes 
and  smoking  —  education 

•  Quadratic  terms  (Model  2) 
were  significant,  suggesting 
that  Model  1  is  mispecified  but 

•  Did  not  materially  influence 
magnitude  or  significance  of 
exposure  variables  (e.g., 
diabetes,  smoking,  education) 


Data  Layout  for  Age-as-Time  Scale: 

CP  format  for  age-truncated 
survival  data  (Model  5) 


Subj# 

d 

a0 

a 

X!  . 

.  Xp 

1 

di 

a01 

ai 

x„  . 

•  Xlp 

2 

ao2 

a2 

X21  ■ 

•  X2p 

3 

d3 

a03 

a3 

X31  • 

•  X3p 

n 

dn 

a0n 

an 

Xnl  . 

•  Xnp 

Model  4  layout:  Set  a0  =  0  for  all 
subjects  or  use  “standard”  layout 
(w/0  a0  column) 


Subj# 

d 

a 

X, 

...  Xp 

1 

di 

ai 

X„ 

...  Xlp 

2 

d2 

a2 

X21 

...  X2p 

3 

d3 

a3 

X31 

...  X3p 

n 

dn 

an 

Xnl 

...  Xnp 

As  expected,  the  table  shows  a  substantial  dif¬ 
ference  in  the  coefficient  of  the  risk  group  vari¬ 
able  estimated  by  the  unadjusted  age-scale 
model  (Model  4)  and  the  five  other  models. 
Moreover,  the  results  for  Models  1-3,  5,  and  6 
are  all  quite  similar. 

Pencina  also  point  out  that  the  directions  of 
coefficients  for  these  five  models  are  in  the 
directions  anticipated  conceptually,  e.g.,  diabe¬ 
tes  coefficients  are  positive,  whereas  education 
coefficients  are  negative. 

The  quadratic  baseline  age  term  (Model  2)  was 
significant  for  both  CHD  risk  factors.  This 
suggests  potential  misspecification  in  the  mod¬ 
eling  of  the  relationship  between  CHD  and  age 
introduced  by  Model  1 ,  which  treats  entry  age 
as  linear.  However,  its  inclusion  in  the  time-on- 
study  model  did  not  materially  influence  the 
magnitude  or  significance  of  the  estimated 
exposure  variable  (diabetes  or  education  status) 
coefficient. 

When  using  age  as  the  time  scale  and  account¬ 
ing  for  age  truncation  (i.e.,  using  Model  5 
above),  the  data  layout  requires  the  counting 
process  (CP)  in  start-stop  format  previously 
introduced  in  Section  VI  of  Chapter  1  with  a0 
as  the  start  variable  and  a  as  the  stop  variable. 
However,  since  we  are  not  considering  recur¬ 
rent  events  data  here,  the  CP  format  for  age- 
truncated  survival  data  has  a  simpler  form, 
involving  only  one  line  of  data  for  each  study 
subject,  as  shown  on  the  left.  The  computer 
code  needed  to  program  the  analysis  is 
described  in  the  Computer  Appendix  for 
STATA,  SAS,  SPSS,  or  R  packages. 

Note  that  the  CP  format  corresponding  to 
Model  4,  which  assumes  the  starting  time  is 
birth,  would  modify  the  Model  4  layout  by  let¬ 
ting  ao  =  0  in  the  a0  column  for  all  subjects. 
Nevertheless,  this  layout  would  be  equivalent 
to  the  “standard"  layout  that  omits  a0  column 
and  simply  treats  the  a  column  data  as  time- 
on-study  information.  Again,  since  Model  4 
appears  to  be  inferior  to  the  other  models,  we 
caution  the  reader  not  to  use  this  format  unless 
the  risk  period  was  observed  since  birth. 
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XI.  Summary 


In  this  section  we  briefly  summarize  the 
content  covered  in  this  presentation. 


1.  Review:  S  (t),  h  ( t ),  data  layout,  • 
etc. 

2.  Computer  example  of  Cox  model: 

•  estimate  HR 

•  test  hypothesis  about  HR 

•  obtain  confidence  intervals  • 

3.  Cox  model  formula: 

h(t,  X)  =  /;o(f)e,=1 


4.  Why  popular:  Cox  PH  model  is  • 
"robust” 


5.  ML  estimation:  maximize  a  a 
partial  likelihood  L  =  L(/l)  = 
joint  probability  of  observed  data 


6.  Hazard  ratio  formula: 


HR  =  exp 


EM*;-*.) 


7.  Interval  estimation-interaction:  • 
HR=exp[/], 


where  l=$x+  SjWh  +  §2W2  +  . . .  +  5kWK 
(L  =  coeff .  of  Xi ,  and 
Sj  =  coeff.  of  X  x  Wj,  j  =  1 , . . . ,  k 


95%  Cl  for  HR=expM: 
exp[i  ±  1.96^/vari)] 

Most  computer  packages,  e.g.,  SAS, 
STATA,  compute  Vart  as  part  of  the 
program  options  (see  Computer 
Appendix). 


We  began  with  a  computer  example  that 
uses  the  Cox  PH  model.  We  showed  how 
to  use  the  output  to  estimate  the  HR,  and 
how  to  test  hypotheses  and  obtain 
confidence  intervals  about  the  hazard  ratio. 
We  then  provided  the  formula  for  the  hazard 
function  for  the  Cox  PH  model  and  described 
basic  features  of  this  model.  The  most 
important  feature  is  that  the  model  contains 
two  components,  namely,  a  baseline  hazard 
function  of  time  and  an  exponential  function 
involving  X s  but  not  time. 

We  discussed  reasons  why  the  Cox  model  is 
popular,  the  primary  reason  being  that  the 
model  is  “robust”  for  many  different 
survival  analysis  situations. 

We  then  discussed  ML  estimation  of  the 
parameters  in  the  Cox  model,  and  pointed 
out  that  the  ML  procedure  maximizes  a 
“partial”  likelihood  that  focuses  on 
probabilities  at  failure  times  only. 

Next,  we  gave  a  general  formula  for 
estimating  a  hazard  ratio  that  compared 
two  specifications  of  the  I’s,  defined  as  X* 
and  X.  We  illustrated  the  use  of  this 
formula  when  comparing  two  exposure 
groups  adjusted  for  other  variables. 

We  then  described  how  to  obtain  a  95%  Cl 
for  the  HR  when  the  hazard  model  contains 
interaction  terms  of  the  form  Xj  x  Wj,  where 
X]  is  a  (0,1)  exposure  variable  and  Wj  is  an 
effect  modifier  of  exposure.  The  formula  is 
shown  at  the  left.  In  this  formula,  the  Var  i 
is  difficult  to  calculate  without  the  use  of  a 
computer  program. 

Fortunately,  most  computer  packages  have 
procedures  for  calculating  this  formula  as 
part  of  the  program  options,  e.g.,  SAS’s 
“contrast”  option  and  STATA's  “lincom” 
option. 
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8.  Adjusted  survival  curves^  0  or  1 
Comparing  E  groups: 


S(t,X)  =  [So(0] 
Single  curve: 

=  [So(0] 


exp 


exp[X^  pa] 


9.  PH  assumption:  • 

h(t,X*) 

, - =  6  (a  constant  over  t) 

h(t,  X) 

i.e.,  h(t,X*)  =  6h(t,  X) 

Hazards  cross  =>■  PH  not  met 

10.  Derivation  of  Cox  PH  Likeli¬ 
hood 

1 1 .  Using  “age-as-the-time  scale" 
instead  of  “time-on-follow-up’’  # 
Reason:  account  for  left  trun¬ 
cation  of  age 

Cox  PH  model  that  adjusts  for 
age  truncation: 

h(a,X)  =  h0(a\a0)  exp[^  (3,X,] 

where  a  =  age  at  event  or 
censorship 

a0  =  age  at  study  entry 

Data  Layout:  CP  (start-stop) 

format 


We  then  defined  an  adjusted  survival  curve 
and  presented  formulas  for  adjusted  curves 
comparing  two  groups  adjusted  for  other 
variables  in  the  model  and  a  formula  for  a 
single  adjusted  curve  that  adjusts  for  all  X’s 
in  the  model.  Computer  packages  for  these 
formulae  use  the  mean  value  of  each  X 
being  adjusted  in  the  computation  of  the 
adjusted  curve. 

We  described  the  PH  assumption  as 
meaning  that  the  hazard  ratio  is  constant 
over  time,  or  equivalently  that  the  hazard 
for  one  individual  is  proportional  to  the 
hazard  for  any  other  individual,  where  the 
proportionality  constant  is  independent  of 
time.  We  also  showed  that  for  study 
situations  in  which  the  hazards  cross,  the 
PH  assumption  is  not  met. 

We  then  showed  how  the  Cox  likelihood  is 
derived  using  ordered  failure  times. 

Finally,  we  considered  the  use  of  “age  as  the 
time  scale”  instead  of  “time-on-follow-up” 
as  the  outcome  variable,  described  why 
such  use  is  appropriate  to  account  for  left 
truncation  of  age,  provided  the  form  of  the 
Cox  PH  model  in  this  situation,  illustrate  its 
use,  and  described  the  data  layout  required 
using  a  “stop-start”  Counting  Process  (CP) 
format. 


Chapters 

1 .  Introduction  to  Survival 


4.  Evaluating  the  Proportional 
Hazards  Assumption 

5.  The  Stratified  Cox  Procedure 

6.  Extension  of  the  Cox 
Proportional  Hazards  Model 
for  Time-Dependent  Variables 


Analysis 

Kaplan-Meier  Survival  Curves 

and  the  Log-Rank  Test _ 

The  Cox  Proportional  Hazards 
Model  and  Its  Characteristics 


This  presentation  is  now  complete.  We  recom¬ 
mend  that  the  reader  review  the  detailed  out¬ 
line  that  follows  and  then  do  the  practice 
exercises  and  test. 

The  next  Chapter  (4)  describes  how  to  evaluate 
the  PH  assumption.  Chapters  5  and  6  describe 
methods  for  carrying  out  the  analysis  when  the 
PH  assumption  is  not  met. 
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Detailed 

Outline 


I.  A  computer  example  using  the  Cox  PH  model 

(pages  100-108) 

A.  Printout  shown  for  three  models  involving  leu¬ 
kemia  remission  data. 

B.  Three  explanatory  variables  of  interest:  treat¬ 
ment  status,  log  WBC,  and  product  term;  out¬ 
come  is  time  until  subject  goes  out  of  remission. 

C.  Discussion  of  how  to  evaluate  which  model  is 
best. 

D.  Similarity  to  classical  regression  and  logistic 
regression. 


II. 


The  formula  for  the  Cox  PH  model 

(pages  108-110) 


A. 


h(t,  X) 


h0(t)  exp 


E  P,x, 


0=1 


B.  h0  (t)  is  called  the  baseline  hazard  function. 

C.  X  denotes  a  collection  of  p  explanatory  vari¬ 
ables  Xlt  X2:..„  Xp. 

D.  The  model  is  semiparametric  because  h0(t)  is 
unspecified. 

E.  Examples  of  the  Cox  model  using  the  leukemia 
remission  data. 

F.  Survival  curves  can  be  derived  from  the  Cox  PH 
model. 


III.  Why  the  Cox  PH  model  is  popular  ( pages  110-112) 

A.  Can  get  an  estimate  of  effect  (the  hazard  ratio) 
without  needing  to  know  h0(t). 

B.  Can  estimate  h0(t),  h{t,  X),  and  survivor  func¬ 
tions,  even  though  h0(t)  is  not  specified. 

C.  The  e  part  of  the  formula  is  used  to  ensure  that 
the  fitted  hazard  is  nonnegative. 

D.  The  Cox  model  is  “robust":  it  usually  fits  the 
data  well  no  matter  which  parametric  model  is 
appropriate. 

IV.  ML  estimation  of  the  Cox  PH  model 

(pages  112-114) 

A.  Likelihood  function  is  maximized. 

B.  L  is  called  a  partial  likelihood,  because  it  uses 
survival  time  information  only  on  failures,  and 
does  not  use  censored  information  explicitly. 

C.  L  makes  use  of  the  risk  set  at  each  time  that  a 
subject  fails. 

D.  Inferences  are  made  using  standard  large  sam¬ 
ple  ML  techniques,  e.g.,  Wald  or  likelihood  ratio 
tests  and  large  sample  confidence  intervals 
based  on  asymptotic  normality  assumptions. 
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V. 


Computing  the  hazard  ratio  (pages  114-117) 

A.  Formula  for  hazard  ratio  comparing  two  indi¬ 
viduals,  and  X  =  (Xu  X2,  ■  . .,  Xp): 


h(t ,  X*) 
h(t ,  X) 


=  exp 


E  p.-  (** 


B.  Examples  are  given  using  a  (0,  1)  exposure  vari¬ 
able,  potential  confounders,  and  potential 
effect  modifiers. 


C.  Typical  coding  identifies  X*  as  the  group  with 
the  larger  hazard  and  X  as  the  group  with  the 
smaller  hazard,  e.g.,  X\  =  I  for  unexposed 
group  and  Xx  =  0  for  exposed  group. 


VI.  Interval  estimation:  interaction  ( pages  117-119) 

A.  Example-  Model  3  from  Remission  Time  Data 

i.  h(t,  X)  =  h0(l)exp[p)|  Rx  +  p2logWBC 

+  p3(Rx  x  logWBC)] 

ii.  HR  =  exp[p!  +  p3logWBC] 

B.  General  form  of  HR: 

HR  =  exp[(],  where 

l  =  Pi  +  8iW!  +  52W2  +  . . .  +  §k  Wk, 

Xi  =  (0,  1)  exposure  variable,  pt  =  coeff  of  X1; 
and  8j  =  coeff  of  Xj  x  Wj,  j=l,...,  k 

C.  General  form  of  95%  Cl  for  HR  =  exp[fj: 


exp 


ta  1.96  dVart) 


where 


Var0  =  Var0  +  3tWi  +  . . .  +  8kWk) 


D.  Computation  of  var  complicated. 

i.  Computer  programs,  e.g.,  in  SAS,  STATA, 
can  do  this  for  the  user. 

ii.  Otherwise,  user  must  carry  out  complicated 
calculation  using  formula  for  var: 

\ar(tj  =  War  0)  +  E  W.f Var0) 
j 

+  2  E  WjCov  0 ,  5j) 
j 

+2EEwjwkC5v(^’^k) 

j  k 


iii.  Variances  and  covariances  provided  in 
computer  output 

iv.  User  specifies  W’s  values  of  interest. 

v.  Model  3  formula  for  Vdr(£): 

Y dr  0  =  War  0)  +  (log  WBC)2Var  0) 

+  2(log  WBC)2Cov  0,  0 


E.  Example  of  95%  Cl:  Model  3 
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VII.  Adjusted  survival  curves  using  the  Cox  PH 
model  (pages  120-123) 

A.  Survival  curve  formula  can  be  obtained  from 
hazard  ratio  formula: 

S(t,X)  =  [S0(r)]expE  A*'] 

where  S0(t)  is  the  baseline  survival  function  that 
corresponds  to  the  baseline  hazard  function  h0(t). 

B.  To  graph  S(t,  X),  must  specify  values  for 
X=(XuX2,...,Xp). 

C.  To  obtain  “adjusted”  survival  curves,  usually 
use  overall  mean  values  for  the  X’s  being 
adjusted. 

D.  Examples  of  “adjusted”  S(t,  X)  using  leukemia 
remission  data. 

VIII.  The  meaning  of  the  PH  assumption 

(pages  123-127) 

A.  Hazard  ratio  formula  shows  that  hazard  ratio  is 
independent  of  time: 

h(t,X*) 
h  (t,  X) 

B.  Baseline  hazard  function  not  involved  in  the 
HR  formula. 

C.  Hazard  ratio  for  two  X’s  are  proportional: 

h(t,X*)  =  6  h(t,X) 

D.  An  example  when  the  PH  assumption  is  not 
satisfied:  hazards  cross 

IX.  The  Cox  likelihood  ( pages  127-131) 

A.  Lottery  Example 

B.  Likelihood  based  on  order  of  events 

X.  Using  age  as  the  time  scale  (pages  131  - 142) 

A.  Definition  of  Left  Truncation 

i.  Type  I:  subject  has  event  before  t0  and  not 
included  in  study 

ii.  Type  II:  t0  >  0  and  t>  t0  where  t0  time  when 
first  observed  t  =  observed  survival  time 

B.  Left  Truncation  versus  Left  Censoring 

C.  Time-on-study  versus  Age-as-time  scale:  Closed 
cohort  versus  Open  cohort 

D.  When  to  use  Age-as-time-scale 
i.  It  depends 

a.  Type  of  study 

b.  Well-defined  model  involving  a0  where 
a0  denotes  age  at  entry 
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E.  Alternative  Models 

i.  Time-on-study 

a.  outcome  is  t  =  time  since  first  observed 

b.  consider  control  for  a0,  e.g., 

h(t,  X,  a0)  =  h0(t)  exp  ^  ftXj  +  yla0  or 
h(t,  X,  a0)  =  h0(t)  exp  ^  ftXi  +  y^o  +  y2&l 

ii.  Age-as-time-scale 

a.  outcome  is  a  =  age  at  event  or  censor¬ 
ship 

b.  adjusting  for  age  truncation,  e.g., 

h(a,  X)  =  h0 (a  |  a0)  exp  or 

hg(a,  X)  =  h0g(a  |  a0)  exp  ^  &X; 

F.  Example  from  Pencina  et  al  (2007) 

i.  age-as-time-scale  model 

a.  need  to  adjust  for  age  truncation  sub¬ 
jects 

b.  incorrect  results  if  subjects  assumed  to 
be  observed  from  birth 

ii.  time-on-study  model:  when  a0  is  controlled, 

results  similar  to  age-as-time-scale  model. 

iii.  overall  recommendation:  correct  adjust¬ 
ment  for  the  age  at  entry  is  crucial 

XI.  Summary  (pages  143-144) 
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Practice 

Exercises 


1.  In  a  10-year  follow-up  study  conducted  in  Evans 
County,  Georgia,  involving  persons  60  years  or  older, 
one  research  question  concerned  evaluating  the  rela¬ 
tionship  of  social  support  to  mortality  status.  A  Cox 
proportional  hazards  model  was  fit  to  describe  the 
relationship  of  a  measure  of  social  network  to  time 
until  death.  The  social  network  index  was  denoted  as 
SNI,  and  took  on  integer  values  between  0  (poor  social 
network)  to  5  (excellent  social  network).  Variables  to  be 
considered  for  control  in  the  analysis  as  either  potential 
confounders  or  potential  effect  modifiers  were  AGE 
(treated  continuously),  RACE  (0,1),  and  SEX  (0,1). 

a.  State  an  initial  PH  model  that  can  be  used  to  assess 
the  relationship  of  interest,  which  considers  the 
potential  confounding  and  interaction  effects  of 
the  AGE,  RACE,  and  SEX  (assume  no  higher  than 
two-factor  products  involving  SNI  with  AGE, 
RACE,  and  SEX). 

b.  For  your  model  in  part  1  a,  give  an  expression  for  the 
hazard  ratio  that  compares  a  person  with  SNI  =  4  to 
a  person  with  SNI  =  2  and  the  same  values  of  the 
covariates  being  controlled. 

c.  Describe  how  you  would  test  for  interaction  using 
your  model  in  part  la.  In  particular,  state  the  null 
hypothesis,  the  general  form  of  your  test  statistic, 
with  its  distribution  and  degrees  of  freedom  under 
the  null  hypothesis. 

d.  Assuming  a  revised  model  containing  no  interac¬ 
tion  terms,  give  an  expression  for  a  95%  interval 
estimate  for  the  adjusted  hazard  ratio  comparing  a 
person  with  SNI  =  4  to  a  person  with  SNI  =  2  and 
the  same  values  of  the  covariates  in  your  model. 

e.  For  the  no-interaction  model  described  in  part  Id, 
give  an  expression  (i.e.,  formula)  for  the  estimated 
survival  curve  for  a  person  with  SNI  =  4,  adjusted 
for  AGE,  RACE,  and  SEX,  where  the  adjustment 
uses  the  overall  mean  value  for  each  of  the  three 
covariates. 

f.  Using  the  no-interaction  model  described  in  part  Id, 
if  the  estimated  survival  curves  for  persons  with 
SNI  =  4  and  SNI  =  2  adjusted  for  (mean)  AGE, 
RACE,  and  SEX  are  plotted  over  time,  will  these 
two  estimated  survival  curves  cross?  Explain  briefly. 

g.  For  the  (interaction)  model  described  in  Part  la, 
what  is  the  formula  for  the  95%  Cl  for  the  HR  that 
compares  a  person  with  SNI  =  4  to  a  person  with 
SNI  =  2  and  the  same  values  of  the  covariates  being 
controlled? 
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2.  For  this  question,  we  consider  the  survival  data  for  137 
patients  from  the  Veteran’s  Administration  Lung 
Cancer  Trial  cited  by  Kalbfleisch  and  Prentice  in  their 
book  ( The  Statistical  Analysis  of  Sun’ival  Time  Data, 
Wiley,  1980).  The  variables  in  this  dataset  are  listed  as 
follows: 


Variable#  Variable  name  Coding 


1 

Treatment 

Standard  =  1 ,  test  =  2 

Four  I 

f2 

Cell  type  1 

Large  =  1,  other  =  0 

indicator  J 

1  3 

Cell  type  2 

Adeno  =  1 ,  other  =  0 

variables  | 

I4 

Cell  type  3 

Small  =  1 ,  other  =  0 

for  cell  type  ' 

Cell  type  4 

Squamous  =  1 ,  other  =  0 

6 

Survival  time 

(Days)  integer  counts 

7 

Performance 

status 

0  =  worst, . . . ,  100  =  best 

8 

Disease  duration 

(Months)  integer  counts 

9 

Age 

(Years)  integer  counts 

10 

Prior  therapy 

None  =  0,  some  =  10 

11 

Status 

0  =  censored,  1  =  died 

For  these  data,  a  Cox  PFI  model  was  fitted  yielding  the 
following  edited  computer  results: 

Response:  survival  time 


Variable  name 

Coef. 

Std.  Err. 

p  >  M 

Haz.  Ratio 

[95%  Conf.  interval] 

1  Treatment 

0.290 

0.207 

0.162 

1.336 

0.890 

2.006 

3  Adeno  cell 

0.789 

0.303 

0.009 

2.200 

1.216 

3.982 

4  Small  cell 

0.457 

0.266 

0.086 

1.579 

0.937 

2.661 

5  Squamous  cell 

-0.400 

0.283 

0.157 

0.671 

0.385 

1.167 

7  Perf.  status 

-0.033 

0.006 

0.000 

0.968 

0.958 

0.978 

8  Disease  dur. 

0.000 

0.009 

0.992 

1.000 

0.982 

1.018 

9  Age 

-0.009 

0.009 

0.358 

0.991 

0.974 

1.010 

1 0  Prior  therapy 

0.007 

0.023 

0.755 

1.007 

0.962 

1.054 

Log  likelihood  =  —475.180 


a.  State  the  Cox  PH  model  used  to  obtain  the  above 
computer  results. 

b.  Using  the  printout  above,  what  is  the  hazard 
ratio  that  compares  persons  with  adeno  cell 
type  with  persons  with  large  cell  type?  Explain 
your  answer  using  the  general  hazard  ratio  for¬ 
mula  for  the  Cox  PH  model. 
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c.  Using  the  printout  above,  what  is  the  hazard 
ratio  that  compares  persons  with  adeno  cell 
type  with  persons  with  squamous  cell  type? 
Explain  your  answer  using  the  general  hazard 
ratio  formula  for  the  Cox  PH  model. 

d.  Based  on  the  computer  results,  is  there  an  effect 
of  treatment  on  survival  time?  Explain  briefly. 

e.  Give  an  expression  for  the  estimated  survival 
curve  for  a  person  who  was  given  the  test  treat¬ 
ment  and  who  had  a  squamous  cell  type,  where 
the  variables  to  be  adjusted  are  performance 
status,  disease  duration,  age,  and  prior  therapy. 

f.  Suppose  a  revised  Cox  model  is  used  which  con¬ 
tains,  in  addition  to  the  variables  already 
included,  the  product  terms:  treatment  x  perfor¬ 
mance  status;  treatment  x  disease  duration; 
treatment  x  age;  and  treatment  x  prior  therapy. 
For  this  revised  model,  give  an  expression  for  the 
hazard  ratio  for  the  effect  of  treatment,  adjusted 
for  the  other  variables  in  the  model. 

3.  The  data  for  this  question  contain  survival  times  of  65 
multiple  myeloma  patients  (references  Krall  et  al.,  "A 
Step-up  Procedure  for  Selecting  Variables  Associated 
with  Survival  Data,”  Biometrics,  vol.  31,  pp.  49-57, 
1975).  A  partial  list  of  the  variables  in  the  dataset  is 
given  below: 

Variable  1:  observation  number 
Variable  2:  survival  time  (in  months)  from  time  of 
diagnosis 

Variable  3:  survival  status  (0  =  alive,  1  =  dead) 

Variable  4:  platelets  at  diagnosis  (0  =  abnormal, 

1  =  normal) 

Variable  5:  age  at  diagnosis  (years) 

Variable  6:  sex  (1  =  male,  2  =  female) 
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Below,  we  provide  edited  computer  results  for  several 
different  Cox  models  that  were  fit  to  this  dataset.  A  num¬ 
ber  of  questions  will  be  asked  about  these  results. 


Model  1: 


Variable 

Coef. 

Std.  Err. 

p  >  M 

Haz.  Ratio 

[95%  Conf.  Interval] 

Platelets 

0.470 

2.854 

.869 

1.600 

0.006 

429.689 

Age 

0.000 

0.037 

.998 

1.000 

0.930 

1.075 

Sex 

0.183 

0.725 

.801 

1.200 

0.290 

4.969 

Platelets  x  age 

-0.008 

0.041 

.850 

0.992 

0.915 

1.075 

Platelets  x  sex 

-0.503 

0.804 

.532 

0.605 

0.125 

2.924 

Log  likelihood  =  — 

153.040 

Model  2: 

Platelets 

-0.725 

0.401 

.071 

0.484 

0.221 

1.063 

Age 

-0.005 

0.016 

.740 

0.995 

0.965 

1.026 

Sex 

-0.221 

0.311 

.478 

0.802 

0.436 

1.476 

Log  likelihood  =  — 

153.253 

Model  3: 

Platelets 

-0.706 

0.401 

.078 

0.493 

0.225 

1.083 

Age 

-0.003 

0.015 

.828 

0.997 

0.967 

1.027 

Log  likelihood  =  — 

153.509 

Model  4: 

Platelets 

-0.705 

0.397 

.076 

0.494 

0.227 

1.075 

Sex 

-0.204 

0.307 

.506 

0.815 

0.447 

1.489 

Log  likelihood  =  — 

153.308 

Model  5: 

Platelets 

-0.694 

0.397 

.080 

0.500 

0.230 

1.088 

Log  likelihood  =  —153.533 

a.  For  model  1,  give  an  expression  for  the  hazard 
ratio  for  the  effect  of  the  platelet  variable 
adjusted  for  age  and  sex. 

b.  Using  your  answer  to  part  3a,  compute  the  esti¬ 
mated  hazard  ratio  for  a  40-year-old  male.  Also 
compute  the  estimated  hazard  ratio  for  a  50- 
year-old  female. 

c.  Carry  out  an  appropriate  test  of  hypothesis  to 
evaluate  whether  there  is  any  significant  inter¬ 
action  in  model  1 .  What  is  your  conclusion? 

d.  Considering  models  2-5,  evaluate  whether  age 
and  sex  need  to  be  controlled  as  confounders? 

e.  Which  of  the  five  models  do  you  think  is  the  best 
model  and  why? 
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f.  Based  on  your  answer  to  part  3e,  summarize  the 
results  that  describe  the  effect  of  the  platelet 
variable  on  survival  adjusted  for  age  and  sex. 

g.  Why  might  you  consider  using  age-as-the-time- 
scale  instead  of  time-on-follow-up  as  the  out¬ 
come  to  analyze  these  data? 


Test 


1 .  Consider  a  hypothetical  2-year  study  to  investigate  the 
effect  of  a  passive  smoking  intervention  program  on  the 
incidence  of  upper  respiratory  infection  (URI)  in  new¬ 
born  infants.  The  study  design  involves  the  random 
allocation  of  one  of  three  intervention  packages  (A,  B, 
C)  to  all  healthy  newborn  infants  in  Orange  County, 
North  Carolina,  during  1985.  These  infants  are  fol¬ 
lowed  for  2  years  to  determine  whether  or  not  URI 
develops.  The  variables  of  interest  for  using  a  survival 
analysis  on  these  data  are: 

T  =  time  (in  weeks)  until  URI  is  detected  or  time  until 
censored 

s  =  censorship  status  (=  1  if  URI  is  detected,  =  0  if 
censored) 

PS  =  passive  smoking  index  of  family  during  the  week 
of  birth  of  the  infant 

DC  =  daycare  status  (=  1  if  outside  daycare,  =  0  if  only 
daycare  is  in  home) 

BF  =  breastfeeding  status  (=  1  if  infant  is  breastfed,  =  0 
if  infant  is  not  breastfed) 

T i  =  first  dummy  variable  for  intervention  status  (=  1  if 
A,  =  0  if  B,  =  -1  if  C) 

T2  =  second  dummy  variable  for  intervention  status 
(=  l  ifB,  =  Oif  A,  =  -1  if  C). 

a.  State  the  Cox  PH  model  that  would  describe  the 
relationship  between  intervention  package  and  sur¬ 
vival  time,  controlling  for  PS,  DC,  and  BF  as  con- 
founders  and  effect  modifiers.  In  defining  your 
model,  use  only  two  factor  product  terms  involving 
exposure  (i.e.,  intervention)  variables  multiplied  by 
control  variables  in  your  model. 

b.  Assuming  that  the  Cox  PH  model  is  appropriate, 
give  a  formula  for  the  hazard  ratio  that  compares 
a  person  in  intervention  group  A  with  a  person  in 
intervention  group  C,  adjusting  for  PS,  DC,  and  BF, 
and  assuming  interaction  effects. 

c.  Assuming  that  the  PH  model  in  part  la  is  appropri¬ 
ate,  describe  how  you  would  carry  out  a  chunk  test 
for  interaction;  i.e.,  state  the  null  hypothesis, 
describe  the  test  statistic  and  give  the  distribution 
of  the  test  statistic  and  its  degrees  of  freedom  under 
the  null  hypothesis. 


1 54  3.  The  Cox  Proportional  Hazards  Model  and  Its  Characteristics 

d.  Assuming  no  interaction  effects,  how  would  you 
test  whether  packages  A,  B,  and  C  are  equally  effec¬ 
tive,  after  controlling  for  PS,  DC,  and  BF  in  a  Cox 
PH  model  without  interaction  terms  (i.e.,  state  the 
two  models  being  compared,  the  null  hypothesis, 
the  test  statistic,  and  the  distribution  of  the  test 
statistic  under  the  null  hypothesis). 

e.  For  the  no-interaction  model  considered  in  parts  lc 
and  1  d,  give  an  expression  for  the  estimated  survival 
curves  for  the  effect  of  intervention  A  adjusted  for 
PS,  DC,  and  BF.  Also,  give  similar  (but  different) 
expressions  for  the  adjusted  survival  curves  for 
interventions  B  and  C. 

2.  The  data  for  this  question  consists  of  a  sample  of  50 
persons  from  the  1967-1980  Evans  County  Study. 
There  are  two  basic  independent  variables  of  interest: 
AGE  and  chronic  disease  status  (CHR),  where  CHR  is 
coded  as  0  =  none,  1  =  chronic  disease.  A  product  term 
of  the  form  AGE  x  CHR  is  also  considered.  The  depen¬ 
dent  variable  is  time  until  death,  and  the  event  is  death. 
The  primary  question  of  interest  concerns  whether 
CHR,  considered  as  the  exposure  variable,  is  related 
to  survival  time,  controlling  for  AGE.  The  edited  output 
of  computer  results  for  this  question  is  given  as  follows: 


Model  1: 


Variable 

Coef. 

Std.  Err.  Chi-sq 

p  >  M 

CHR 

0.8595 

0.3116  7.61 

.0058 

Log  likelihood  =  —142.87 

Model  2: 

CHR 

0.8051 

0.3252  6.13 

.0133 

AGE 

0.0856 

0.0193  19.63 

.0000 

Log  likelihood  =  —132.45 

Model  3: 

CHR 

1.0009 

2.2556  0.20 

.6572 

AGE 

0.0874 

0.0276  10.01 

.0016 

CHR  x  AGE 

-0.0030 

0.0345  0.01 

.9301 

Log  likelihood  =  —132.35 
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a.  State  the  Cox  PH  model  that  allows  for  main 
effects  of  CHR  and  AGE  as  well  as  the  interaction 
effect  of  CHR  with  AGE. 

b.  Carry  out  the  test  for  significant  interaction;  i.e., 
state  the  null  hypothesis,  the  test  statistic,  and  its 
distribution  under  the  null  hypothesis.  What  are 
your  conclusions  about  interaction? 

c.  Assuming  no  interaction,  should  AGE  be  con¬ 
trolled?  Explain  your  answer  on  the  basis  of  con¬ 
founding  and/or  precision  considerations. 

d.  If,  when  considering  plots  of  various  hazard  func¬ 
tions  over  time,  the  hazard  function  for  persons 
with  CHR  =  1  crosses  the  hazard  function  for  per¬ 
sons  with  CHR  =  0,  what  does  this  indicate  about 
the  use  of  any  of  the  three  models  provided  in  the 
printout? 

e.  Using  model  2,  give  an  expression  for  the  esti¬ 
mated  survival  curve  for  persons  with  CHR  =  1, 
adjusted  for  AGE.  Also,  give  an  expression  for  the 
estimated  survival  curve  for  persons  with  CHR  =  0, 
adjusted  for  AGE. 

f.  What  is  your  overall  conclusion  about  the  effect  of 
CHR  on  survival  time  based  on  the  computer  results 
provided  from  this  study? 

3 .  The  data  for  this  question  contain  remission  times  of  42 
multiple  leukemia  patients  in  a  clinical  trial  of  a  new 
treatment.  The  variables  in  the  dataset  are  given  below: 
Variable  1:  survival  time  (in  weeks) 

Variable  2:  status  (1  =  in  remission,  0  =  relapse) 
Variable  3:  sex  (1  =  female,  0  =  male) 

Variable  4:  log  WBC 

Variable  5:  Rx  status  (1  =  placebo,  0  =  treatment) 
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Below,  we  provide  computer  results  for  several  differ¬ 
ent  Cox  models  that  were  fit  to  this  dataset.  A  number 
of  questions  will  be  asked  about  these  results  starting 
below. 


Model  1: 


Variable 

Coef. 

Std.  Err.  p  >  |z| 

Haz.  Ratio 

[95%  Conf.Interval] 

Rx 

0.894 

1.815 

.622 

2.446 

0.070 

85.812 

Sex 

-1.012 

0.752 

.178 

0.363 

0.083 

1.585 

log  WBC 

1.693 

0.441 

.000 

5.437 

2.292 

12.897 

Rx  x  Sex 

1.952 

0.907 

.031 

7.046 

1.191 

41.702 

Rx  x  log  WBC 

-0.151 

0.531 

.776 

0.860 

0.304 

2.433 

Log  likelihood  = 

-69.515 

Model  2: 

Rx 

0.405 

0.561 

.470 

1.500 

0.499 

4.507 

Sex 

-1.070 

0.725 

.140 

0.343 

0.083 

1.422 

log  WBC 

1.610 

0.332 

.000 

5.004 

2.610 

9.592 

Rx  x  Sex 

2.013 

0.883 

.023 

7.483 

1.325 

42.261 

Log  likelihood  = 

-69.555 

Model  3: 

Rx 

0.587 

0.542 

.279 

1.798 

0.621 

5.202 

Sex 

-1.073 

0.701 

.126 

0.342 

0.087 

1.353 

Rx  x  Sex 

1.906 

0.815 

.019 

6.726 

1.362 

33.213 

Log  likelihood  = 

-83.475 

Model  4: 

Rx 

1.391 

0.457 

.002 

4.018 

1.642 

9.834 

Sex 

0.263 

0.449 

.558 

1.301 

0.539 

3.139 

log  WBC 

1.594 

0.330 

.000 

4.922 

2.578 

9.397 

Log  likelihood  =  —72.109 


a.  Use  the  above  computer  results  to  carry  out  a  chunk 
test  to  evaluate  whether  the  two  interaction  terms  in 
model  1  are  significant.  What  are  your  conclusions? 

b.  Evaluate  whether  you  would  prefer  model  1  or 
model  2.  Explain  your  answer. 

c.  Using  model  2,  give  an  expression  for  the  hazard 
ratio  for  the  effect  of  the  Rx  variable  adjusted  for 
SEX  and  log  WBC. 

d.  Using  your  answer  in  part  3c,  compute  the  hazard 
ratio  for  the  effect  of  Rx  for  males  and  for  females 
separately. 

e.  By  considering  the  potential  confounding  of  log 
WBC,  determine  which  of  models  2  and  3  you  pre¬ 
fer.  Explain. 

f.  Of  the  models  provided  which  model  do  you  con¬ 
sider  to  be  best?  Explain. 
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Answers  to 

Practice 

Exercises 


t. 


2. 


a.  h(t,X)  =  h0(t )  exp[p!  SNI  +  p2  AGE  +  p3  RACE 
+  p4  SEX  +  p5  SNI  x  AGE  +  p6  SNI  x  RACE 
+  p7  SNI  x  SEX] 

b.  HR  =  exp[2p!  +  2(AGE)p5  +  2(RACE)p6  +  2(SEX)p7] 

c.  H0:  p5  =  p6  =  p7  =  0.  Likelihood  ratio  test  statistic: 
—2  In  Lr  —  (—2  In  LF),  which  is  approximately  X\ 
under  H0,  where  R  denotes  the  reduced  model  (con¬ 
taining  no  product  terms)  under  H0,  and  F  denotes 
the  full  model  (given  in  part  la  above). 

d.  95%  Cl  for  adjusted  HR: 


e. 

f. 


g- 

a. 

b. 


exp 


2/?!  ±  1.96  x  2  Jwdr[p^j 


S(t  X)  =  [50(f)]exP[4^1+(AGE)l®2+(RACE)ft+(SEx)A] 


The  two  su  rvi val  curves  will  not  cross,  because  both 
are  computed  using  the  same  proportional  hazards 
model,  which  has  the  property  that  the  hazard  func¬ 
tions,  as  well  as  their  corresponding  estimated  sur¬ 
vivor  functions,  will  not  cross. 

95%  Cl  for  HR  =  exp[f]:  exp \l  ±  1.96\/Var  l~\ 
where  t  =  2(h  +  2(AGE)p5  +  2(RACE)p6  +  2(SEX)p7 
h  (t,X)  =  ho  ( t )  exp  [pi^G  +  P3X3  +  P4A4  +  P5A5  +  p7X7 
+  ■  ■  •  +  P10X10] 


Adeno  cell  type:  X*  =  (treatment,  1,  0,  0,  perfstat, 
disdur,  age,  prther) 

Large  cell  type:  X  =  (treatment,  0,  0,  0,  perfstat, 
disdur,  age,  prther) 


h(t,  X *) 

HR  =  y  ’  =  exp 
h(t,  X) 


i=  1 


=  exp  [0  +  /J3(l  -0)  +^4(0-0) 
+  ^5(0  —  0)  +  0  +  . . .  +  0] 


=  .  exp 


/k,  =  exp[0.789]  =  2.20 


c.  Adeno  cell  type:  X*  =  (treatment,  1,  0,  0,  perfstat, 
disdur,  age,  prther) 

Squamous  cell  type:  X  =  (treatment,  0,  0,  1,  perfstat, 
disdur,  age,  prther) 


HR  = 


h(t,  X*) 

h(t ~  X) 


=  exp 


£&(*?-*<) 


=  exp  [0  +  j83(l-0)  +  ^4(0-0) 


+  P  5(0  - 1)  +  0  +  . . .  +  0] 


=  exp 


P3  ~  Ps 


exp  [0.789 


-  (-0.400)]  =  exp  [1.189]  =  3.28 
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d.  There  does  not  appear  to  be  an  effect  of  treatment 
on  survival  time,  adjusted  for  the  other  variables  in 
the  model.  The  hazard  ratio  is  1.3,  which  is  close  to 
the  null  value  of  one,  the  p-value  of  0.162  for  the 
Wald  test  for  treatment  is  not  significant,  and  the 
95%  confidence  interval  for  the  treatment  effect 

correspondingly  includes  the  null  value. _ 

e  S(t  X)—  \S0{t)]  exp[2ft+ft+(perfstat)^+  (disdur)ft+(age)&+(Prther)fto] 

f'  HR  =  ^  =  exp[/?j  +  (perfstat)jSn  +  (disdur)/?12 

+  (age)/?!,  +  (prther)/?14] 

where  (h  is  the  coefficient  of  the  treatment  variable 
and  Pn,  p12,  P13,  and  p14  are  the  coefficients  of  prod¬ 
uct  terms  involving  treatment  with  the  four  vari¬ 
ables  indicated. 

3.  a.  HR  =  exp[0.470  +  (— 0.008)age  +  (— 0.503)sex] 

b.  40-year-old  male: 

HR  =  exp  [0.470  +  (-0.008)40  +  (-0.503)1]  =  0.70 
50-year-old  Female: 

HR  =  exp  [0.470  +  (-0.008)50  +  (-0.503)2]  =  0.39 

c.  The  LR  (chunk)  test  for  the  significance  of  both 
interaction  terms  simultaneously  yields  the  follow¬ 
ing  likelihood  ratio  statistic  which  compares  mod¬ 
els  1  and  2: 

LR  =  [(-2  x  -153.253)  -  (-2  x  -153.040)] 

=  306.506  -  306.080  =  0.426 

This  statistic  is  approximately  chi-square  with 
2  degrees  of  freedom  under  the  null  hypothesis  of 
no  interaction.  This  LR  statistic  is  highly  nonsignif¬ 
icant.  Thus,  we  conclude  that  there  is  no  significant 
interaction  in  the  model  (1). 

d.  The  gold-standard  hazard  ratio  is  0.484,  which  is 
obtained  for  model  2.  Note  that  model  2  contains  no 
interaction  terms  and  controls  for  both  covariates 
of  interest.  When  either  age  or  sex  or  both  are 
dropped  from  the  model,  the  hazard  ratio  (for  pla¬ 
telets)  does  not  change  appreciably.  Therefore,  it 
appears  that  neither  age  nor  sex  need  to  be  con¬ 
trolled  for  confounding. 
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e.  Models  2-5  are  all  more  or  less  equivalent,  since 
they  all  give  essentially  the  same  hazards  ratio  and 
confidence  interval  for  the  effect  of  the  platelet  var¬ 
iable.  A  political  choice  for  best  model  would  be  the 
gold-standard  model  (2),  because  the  critical 
reviewer  can  see  both  age  and  sex  being  controlled 
in  model  2. 

f.  •  The  point  estimate  of  the  hazard  ratio  for  normal 

versus  abnormal  platelet  count  is  0.484  =  1/2.07, 
so  that  the  hazard  for  an  abnormal  count  is  twice 
that  for  a  normal  count. 

•  There  is  a  borderline  significant  effect  of  platelet 
count  on  survival  adjusted  for  age  and  sex 
(P  =.071). 

•  The  95%  Cl  for  the  hazard  ratio  is  given  by  0.221 
<  HR  <  1.063,  which  is  quite  wide  and  therefore 
shows  a  very  imprecise  estimate. 

4 .  Subjects  may  already  at  risk  for  the  outcome  prior  to  their 
study  entry  (at  diagnosis) .  If  so,  then  the  time  at  risk  prior 
to  study  entry  contributes  to  the  true  survival  time  (say, 
T)  for  the  individual,  although  only  the  observed  time-on- 
study  (t),  is  actually  available  to  be  analyzed.  The  indivi¬ 
dual’s  survival  time  is  therefore  underestimated  by  the 
time-on-study  information  (obtained  from  study  entry), 
i.e.,  the  true  survival  time  is  left-truncated.  However,  if 
age-as-the-time-scale  as  the  outcome  is  considered,  then 
it  is  possible  to  adjust  for  this  left  truncation  by  using  age 
at  entry  in  a  hazard  model  of  the  form 

h(a,  X)  =  h0(a|a0)  exp  ^  ftXj 

where  a  denotes  age  at  follow-up,  a0  denotes  age  at 
study  entry,  and  h0(a|a0)  is  a  baseline  hazard  that 
adjusts  for  age  truncation  at  a0. 
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Introduction 


Abbreviated 

Outline 


We  begin  with  a  brief  review  of  the  characteristics  of  the 
Cox  proportional  hazards  (PH)  model.  We  then  give  an 
ovei'view  of  three  methods  for  checking  the  PH  assump¬ 
tion:  graphical,  goodness-of-fit  (GOF),  and  time-dependent 
variable  approaches. 

We  then  focus  on  each  of  the  above  approaches,  starting 
with  graphical  methods.  The  most  popular  graphical 
approach  involves  the  use  of  “log-log”  survival  curves. 
A  second  graphical  approach  involves  the  comparison  of 
"observed”  with  "expected”  survival  curves. 

The  GOF  approach  uses  a  test  statistic  or  equivalent 
p-value  to  assess  the  significance  of  the  PH  assumption. 
We  illustrate  this  test  and  describe  some  of  its  advantages 
and  drawbacks. 

Finally,  we  discuss  the  use  of  time-dependent  variables  in 
an  extended  Cox  model  as  a  third  method  for  checking  the 
PH  assumption.  A  more  detailed  description  of  the  use  of 
time-dependent  variables  is  provided  in  Chapter  6. 

The  outline  below  gives  the  user  a  preview  of  the  material 
to  be  covered  by  the  presentation.  A  detailed  outline  for 
review  purposes  follows  the  presentation. 

I.  Background  (pages  164-165) 

II.  Checking  the  proportional  hazards  assumption: 
overview  ( pages  165-167) 

III.  Graphical  approach  1 :  log-log  plots 

(pages  167-175) 

IV.  Graphical  approach  2:  observed  versus  expected 
plots (pages  175-180) 

V.  The  goodness-of-fit  (GOF)  testing  approach 

(pages  181-183) 

VI.  Assessing  the  PH  assumption  using  time- 
dependent  covariates  (pages  183-187) 
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Objectives 


Upon  completing  this  chapter,  the  learner  should  be  able  to: 

1 .  State  or  recognize  three  general  approaches  for 
evaluating  the  PH  assumption. 

2.  Summarize  how  log-log  survival  curves  may  be  used  to 
assess  the  PH  assumption. 

3 .  Summarize  how  observed  versus  expected  plots  may  be 
used  to  assess  the  PH  assumption. 

4.  Summarize  how  GOF  tests  may  be  used  to  assess  the 
PH  assumption. 

5 .  Summarize  how  time-dependent  variables  may  be  used 
to  assess  the  PH  assumption. 

6.  Describe  —  given  survival  data  or  computer  output 
from  a  survival  analysis  that  uses  a  Cox  PH  model  — 
how  to  assess  the  PH  assumption  for  one  or  more 
variables  in  the  model  using: 

a.  a  graphical  approach 

b.  the  GOF  approach 

c.  an  extended  Cox  model  with  time-dependent 
covariates 

7.  State  the  formula  for  an  extended  Cox  model  that 
provides  a  method  for  checking  the  PH  assumption  for 
one  or  more  of  the  time-independent  variables  in  the 
model,  given  survival  analysis  data  or  computer  output 
that  uses  a  Cox  PH  model. 
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Presentation 


This  presentation  describes  three  approaches 
for  evaluating  the  proportional  hazards  (PH) 
assumption  of  the  Cox  model  —  a  graphical 
procedure,  a  goodness-of-fit  testing  procedure, 
and  a  procedure  that  involves  the  use  of  time- 
dependent  variables. 


I.  Background 


Cox  PH  model: 

Eft*,- 

h(t,  X)  =  ho(t)ei=l 


X  =  (X|,  X2).  ■  -,  Xp)  explanatory/ 
predictor  variables 


Recall  from  the  previous  chapter  that  the  gen¬ 
eral  form  of  the  Cox  PH  model  gives  an  expres¬ 
sion  for  the  hazard  at  time  t  for  an  individual 
with  a  given  specification  of  a  set  of  explana¬ 
tory  variables  denoted  by  the  bold  X. 

The  Cox  model  formula  says  that  the  hazard  at 
time  /  is  the  product  of  two  quantities.  The  first 
of  these,  h0(t),  is  called  the  baseline  hazard 
function.  The  second  quantity  is  the  exponential 
expression  e  to  the  linear  sum  of  fS,X,:,  where  the 
sum  is  over  the  p  explanatory  X  variables. 


hojt)  x 
Baseline  hazard 
Involves  t  but 
not  X’s 


Eft*, 
e  •=' 

Exponential 
Involves  X'  shut 
not  /  (X’s  are  time 
independent) 


An  important  feature  of  this  formula,  which 
concerns  the  proportional  hazards  (PH) 
assumption,  is  that  the  baseline  hazard  is  a 
function  of  t,  but  does  not  involve  the  Xs, 
whereas  the  exponential  expression  involves 
the  X’s,  but  does  not  involve  t.  The  X’s  here  are 
called  time-independent  X’s. 


X’s  involving  t:  time-dependent 
Requires  extended  Cox  model 
(no  PH) 


Chapter  6 


It  is  possible,  nevertheless,  to  consider  X’s 
that  do  involve  t.  Such  Xs  are  called  time- 
dependent  variables.  If  time-dependent  vari¬ 
ables  are  considered,  the  Cox  model  form  may 
still  be  used,  but  such  a  model  no  longer  satis¬ 
fies  the  PH  assumption,  and  is  called  the 
extended  Cox  model.  We  will  discuss  this 
extended  Cox  model  in  Chapter  6  of  this  series. 
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Hazard  ratio  formula: 


HR  =  exp 


Eft  (*?-*.•) 


i=l 


where  X*  =  (X|,  X*,---  ,X^j 

and  X  =  (Xu  X2>. . Xp) 
denote  the  two  sets  of  Xs. 


Adjusted  survival  curves 
comparing  E  groups:  ^ 


S(t,X)  =  [S0(t)\ 


exp 


¥i 


Single  curve: 


S(t,X)  =  [So(0] 


exp 


Ew 


From  the  Cox  PH  model,  we  can  obtain  a 
general  formula,  shown  here,  for  estimating  a 
hazard  ratio  that  compares  two  specifications 
of  the  I's,  defined  as  X*  and  X. 


We  can  also  obtain  from  the  Cox  model  an 
expression  for  an  adjusted  survival  curve. 
Here  we  show  a  general  formula  for  obtaining 
adjusted  survival  curves  comparing  two  groups 
adjusted  for  other  variables  in  the  model. 
Below  this,  we  give  a  formula  for  a  single 
adjusted  survival  curve  that  adjusts  for  all  I’s 
in  the  model.  Computer  packages  for  these  for¬ 
mulae  use  the  mean  value  of  each  X  being 
adjusted  in  the  computation  of  the  adjusted 
curve. 


PH  assumption: 


h(t,  X) 
i.e.,  h(t,X*) 


constant  over  t 
=  Qh(t,  X) 


The  Cox  PH  model  assumes  that  the  hazard 
ratio  comparing  any  two  specifications  of  pre¬ 
dictors  is  constant  over  time.  Equivalently,  this 
means  that  the  hazard  for  one  individual  is 
proportional  to  the  hazard  for  any  other  indi¬ 
vidual,  where  the  proportionality  constant  is 
independent  of  time. 


Hazards  cross:  =>■  PH  not  met  The  PH  assumption  is  not  met  if  the  graph  of 

the  hazards  cross  for  two  or  more  categories  of 
a  predictor  of  interest.  However,  even  if  the 
Hazards  don’t  cross  PH  met  hazard  functions  do  not  cross,  it  is  possible 

that  the  PH  assumption  is  not  met.  Thus, 
rather  than  checking  for  crossing  hazards,  we 
must  use  other  approaches  to  evaluate  the  rea¬ 
sonableness  of  the  PH  assumption. 


II.  Checking  the  There  are  three  general  approaches  for  asses- 

Proportional  Hazards  Sing  the  PH  assumption,  again  listed  here.  We 
Assumption:  Overview  now  briefly  overview  each  approach,  starting 

with  graphical  techniques. 

Three  approaches: 

(•  graphical  ) 

•  goodness-of-fit  test 

•  time-dependent  variables 
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Graphical  techniques: 

-ln(-ln)  S  curves  parallel? 

— ln(— In)  S 

”"H - , 

I _ 

l  |  Males _ 

i _ 

Females 

Time 


There  are  two  types  of  graphical  techniques 
available.  The  most  popular  of  these  involves 
comparing  estimated  -ln(-ln)  survivor  curves 
over  different  (combinations  of)  categories  of 
variables  being  investigated.  We  will  des¬ 
cribe  such  curves  in  detail  in  the  next  section. 
Parallel  curves,  say  comparing  males  with 
females,  indicate  that  the  PH  assumption  is 
satisfied,  as  shown  in  this  illustration  for  the 
variable  Sex. 


Observed  vs.  predicted:  Close? 
S 


Predicted  for  males 
(sex  in  model) 

Observed  for  males 


An  alternative  graphical  approach  is  to  com¬ 
pare  observed  with  predicted  survivor  curves. 
The  observed  curves  are  derived  for  categories 
of  the  variable  being  assessed,  say,  Sex,  with¬ 
out  putting  this  variable  in  a  PH  model.  The 
predicted  curves  are  derived  with  this  variable 
included  in  a  PH  model.  If  observed  and  pre¬ 
dicted  curves  are  close,  then  the  PH  assump¬ 
tion  is  reasonable. 


Goodness-of-fit  (GOF)  tests: 

•  Large  sample  Z  or  chi-square 
statistics 

•  Gives  p-value  for  evaluating  PH 
assumption  for  each  variable  in 
the  model. 

p-value  large  =>  PH  satisfied 
(e.g.  P  >  0.10) 

p-value  small  =>•  PH  not  satisfied 
(e.g.  P  <  0.05) 


A  second  approach  for  assessing  the  PH 
assumption  involves  goodness-of-fit  (GOF) 
tests.  This  approach  provides  large  sample  Z 
or  chi-square  statistics  which  can  be  computed 
for  each  variable  in  the  model,  adjusted  for  the 
other  variables  in  the  model.  A  p-value  derived 
from  a  standard  normal  statistic  is  also  given 
for  each  variable.  This  p-value  is  used  for 
evaluating  the  PH  assumption  for  that  variable. 
A  nonsignificant  (i.e.,  large)  p-value,  say 
greater  than  0.10,  suggests  that  the  PH  assump¬ 
tion  is  reasonable,  whereas  a  small  p-value,  say 
less  than  0.05,  suggests  that  the  variable  being 
tested  does  not  satisfy  this  assumption. 


Time-dependent  covariates: 

Extended  Cox  model: 

Add  product  term  involving  some 
function  of  time. 


When  time-dependent  variables  are  used  to 
assess  the  PH  assumption  for  a  time-indepen- 
dent  variable,  the  Cox  model  is  extended  to 
contain  product  (i.e.,  interaction)  terms 
involving  the  time-independent  variable  being 
assessed  and  some  function  of  time. 
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EXAMPLE 


h(t,  X )  =  h0(t)  exp[(3Sex  +  S(Sex  x  t)] 
8^0.  PH  assumption  violated 


For  example,  if  the  PH  assumption  is  being 
assessed  for  Sex,  a  Cox  model  might  be 
extended  to  include  the  variable  “Sex  x  t”  in 
addition  to  Sex.  If  the  coefficient  of  the  prod¬ 
uct  term  turns  out  to  be  significant,  we  can 
conclude  that  the  PH  assumption  is  violated 
for  Sex. 


GOF  provides  test  statistic 
Graphical:  subjective 
Time-dependent:  computationally 
cumbersome 

GOF:  global,  may  not  detect  specific 
departures  from  PH 


The  GOF  approach  provides  a  single  test 
statistic  for  each  variable  being  assessed.  This 
approach  is  not  as  subjective  as  the  graphical 
approach  nor  as  cumbersome  computationally 
as  the  time-dependent  variable  approach. 
Nevertheless,  a  GOF  test  may  be  too  "global” 
in  that  it  may  not  detect  specific  departures 
from  the  PH  assumption  that  may  be  observed 
from  the  other  two  approaches. 


III.  Graphical  Approach  1 : 
Log-Log  Plots 

•  log-log  survival  curves 

•  observed  versus  expected 
survival  curves 


The  two  graphical  approaches  for  checking 
the  PH  assumption  are  comparing  log-log 
survival  curves  and  comparing  observed  versus 
expected  survival  curves.  We  first  explain  what 
a  —In  —In  survival  curve  is  and  how  it  is  used. 


log— log  S  =  transformation  of  S 
=  —In— In  S ) 

•  In  S  is  negative  =>  —  (In  S)  is 

positive. 

•  can't  take  log  of  In  S,  but  can 
take  log  of  (—In  S). 

•  -ln(— In  S)  may  be  positive  or 
negative. 


A  log-log  survival  curve  is  simply  a  transforma¬ 
tion  of  an  estimated  survival  curve  that  results 
from  taking  the  natural  log  of  an  estimated 
survival  probability  twice.  Mathematically,  we 
write  a  log-log  curve  as  — ln(-ln  S).  Note  that 
the  log  of  a  probability  such  as  S  is  always  a 
negative  number.  Because  we  can  only  take 
logs  of  positive  numbers,  we  need  to  negate 
the  first  log  before  taking  the  second  log.  The 
value  for  — ln(— In  S)  may  be  positive  or  nega¬ 
tive,  either  of  which  is  acceptable,  because  we 
are  not  taking  a  third  log.1 


'An  equivalent  way  to  write  to  —  ln(— InS)  is 
—  ln(/g  h(u)du ) ,  where  fg  h(u)du  is  called  the  "cumulative 
hazard”  function.  This  result  follows  from  the  formula 
S(t)  =  exp[—  fgh(u)du],  which  relates  the  survivor  func¬ 
tion  to  the  hazard  function  (see  p.  15  in  Chapter  1). 
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EXAMPLE 


As  an  example,  in  the  graph  at  left,  the 
estimated  survival  probability  of  0.54  is  trans¬ 
formed  to  a  log-log  value  of  0.484.  Similarly, 
the  point  0.25  on  the  survival  curve  is  trans¬ 
formed  to  a  —In  —In  value  of  —0.327. 

Note  that  because  the  survival  curve  is  usually 
plotted  as  a  step  function,  so  will  the  log-log 
curve  be  plotted  as  a  step  function. 


EXAMPLE 


(s  =  0.54:)  want  -ln(-ln  0.54) 

— ln(— In  0.54)  =  — ln(0.616) 

since  ln(0.54)  =  -0.616 
-ln(0.616)  =  0.484 

since  ln(0. 616)  = -0.484 
Thus,  (-ln(-ln  0.54)  =  0.484  ] 


To  illustrate  the  computation  of  a  log-log  value, 
suppose  we  start  with  an  estimated  survival 
probability  of  0.54.  Then  the  log-log  trans¬ 
formation  of  this  value  is  — ln(— In  0.54),  which 
is  — ln(0.616),  because  ln(0.54)  equals  -0.616. 
Now,  continuing  further,  — ln(0.616)  equals 
0.484,  because  ln(0.616)  equals  -0.484.  Thus, 
the  transformation  —  ln(— In  0.54)  equals  0.484. 


ANOTHER  EXAMPLE 


fs  =  0.25:)  want  — ln(— In  0.25) 

— ln(— In  0.25)  =  -ln(  1.386)  =  -0.327 
Thus,  (— ln(— In  0.25)  =  -0.327) 


As  another  example,  if  the  estimated  survival 
probability  is  0.25,  then  -ln(— In  0.25)  equals 
-ln(  1.386),  which  equals  -0.327. 


y-axis  scale: 


1 

0 


S 


+00 

— oo 


ln(l  —  ln)5 


Note  that  the  scale  of  the  y-axis  of  an  estimated 
survival  curve  ranges  between  0  and  1 ,  whereas 
the  corresponding  scale  for  a  — ln(— In)  curve 
ranges  between  -oo  and  +oo. 


log-log  S  for  the  Cox  PH  model:  We  now  show  why  the  PH  assumption  can  be 

assessed  by  evaluating  whether  or  not  log-log 
curves  are  parallel.  To  do  this,  we  must  first 
describe  the  log-log  formula  for  the  Cox  PH 
model. 
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Cox  PH  hazard  function: 


Eft* 

h(t,  X)  =  /2o(f)e,=1 

From  math 

Cox  PH  survival  function: 

P 

x>*< 

5(t,X)  =  isQ(t)Y'=l 

Baseline  survival  function. 


We  start  with  the  formula  for  the  survival  curve 
that  corresponds  to  the  hazard  function  for  the 
Cox  PH  model.  Recall  that  there  is  a  mathemat¬ 
ical  relationship  between  any  hazard  function 
and  its  corresponding  survival  function.  We 
therefore  can  obtain  the  formula  shown  here 
for  the  survival  curve  for  the  Cox  PH  model. 
In  this  formula,  the  expression  S0(t)  denotes 
the  baseline  survival  function  that  corresponds 
to  the  baseline  hazard  function  h0(t). 


log-log^  takes  logs  twice  The  log-log  formula  requires  us  to  take  logs  of 

this  survival  function  twice.  The  first  time  we 
take  logs  we  get  the  expression  shown  here. 


log  #1: 

Ek 

In  5(f,X)  =  e'=i  x  In  So(t) 

0  <  S(t,X)  <  1 

ln(probability)  =  negative  value,  so 
In  S(t,X)  and  In  ,S0(/)  are  negative. 

But  —In  S(t,X )  is  positive,  which 
allows  us  to  take  logs  again. 


Now  since  S(t,  X)  denotes  a  survival  probabil¬ 
ity,  its  value  for  any  t  and  any  specification  of 
the  vector  X  will  be  some  number  between 
0  and  1.  It  follows  that  the  natural  log  of  any 
number  between  0  and  1  is  a  negative  number, 
so  that  the  log  of  S(t,  X)  as  well  as  the  log  of 
S0(t)  are  both  negative  numbers.  This  is  why  we 
have  to  put  a  minus  sign  in  front  of  this  expres¬ 
sion  before  we  can  take  logs  a  second  time, 
because  there  is  no  such  thing  as  the  log  of  a 
negative  number. 


log  #2: 

ln[-  lnS(r,  X)] 


=  In 


=  In 


— e‘='  x  In  So(0 


-  ln[—  In  S0(t)} 

=  Y  Pix<  +  ln[-  In  Xo(0] 

/=1 

=  Y  &x»+ln[- InSoW] 


or 


-ln[-ln  S(t,  X)] 

=  ~Y  In  So(0] 


Thus,  when  taking  the  second  log,  we  must 
obtain  the  log  of  —In  S(t,  X),  as  shown  here. 
After  using  some  algebra,  this  expression  can 
be  rewritten  as  the  sum  of  two  terms,  one  of 
which  is  the  linear  sum  of  the  (fX,  and  the 
other  is  the  log  of  the  negative  log  of  the 
baseline  survival  function. 


This  second  log  may  be  either  positive  or 
negative,  and  we  aren’t  taking  any  more  logs, 
so  we  actually  don’t  have  to  take  a  second 
negative.  However,  for  consistency’s  sake,  a 
common  practice  is  to  put  a  minus  sign  in 
front  of  the  second  log  to  obtain  the  —In  —In 
expression  shown  here.  Nevertheless,  some 
software  packages  do  not  use  a  second  minus 
sign. 
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Two  individuals: 

Xi  =  (Xn,X12,...,Xlp) 

X2  =  (X2UX22,...,X2p) 

'  ln[—  In  S(t,Xi)} 

=  Y  A'Xii  +  ln[— In  50(r)] 
<  1=1 

ln[-ln  S(r,X2)] 

p 

=  Y  frX2i + m-  ln 

.  i=i 

ln[-ln  S(t, Xi)] 

-  (ln[—  ln  S(t, X2)]) 

=  Y  Pi(Xu~X2i) 

i=  1 

does  not  involve  t 


-ln[-ln  S(t,  Xj)] 

=  ln[—  ln  S(t,  X2)] 

+  Y  Pi(xu~X2i) 

i=i 


Now  suppose  we  consider  two  different  speci¬ 
fications  of  the  X  vector,  corresponding  to  two 
different  individuals,  Xi  and  X2. 

Then  the  corresponding  log-log  curves  for 
these  individuals  are  given  as  shown  here, 
where  we  have  simply  substituted  X,  and  X2 
for  X  in  the  previous  expression  for  the  log-log 
curve  for  any  individual  X. 


Subtracting  the  second  log-log  curve  from  the 
first  yields  the  expression  shown  here.  This 
expression  is  a  linear  sum  of  the  differences  in 
corresponding  predictor  values  for  the  two 
individuals.  Note  that  the  baseline  survival 
function  has  dropped  out,  so  that  the  differ¬ 
ence  in  log-log  curves  involves  an  expression 
that  does  not  involve  time  t. 

Alternatively,  using  algebra,  we  can  write  the 
above  equation  by  expressing  the  log-log  sur¬ 
vival  curve  for  individual  X|  as  the  log-log 
curve  for  individual  X2  plus  a  linear  sum  term 
that  is  independent  of  t. 


*2 


t 


The  above  formula  says  that  if  we  use  a  Cox  PH 
model  and  we  plot  the  estimated  log-log  sur¬ 
vival  curves  for  individuals  on  the  same  graph, 
the  two  plots  would  be  approximately  parallel. 
The  distance  between  the  two  curves  is  the 
linear  expression  involving  the  differences  in 
predictor  values,  which  does  not  involve  time. 
Note,  in  general,  if  the  vertical  distance 
between  two  curves  is  constant,  then  the  curves 
are  parallel. 


Graphical  approach  using  log-log  The  parallelism  of  log-log  survival  plots  for  the 
plots:  PH  model  is  appropriate  if  Cox  PH  model  provides  us  with  a  graphical 
“empirical”  plots  of  log-log  sur-  approach  for  assessing  the  PH  assumption, 
vival  curves  are  parallel.  That  is,  if  a  PH  model  is  appropriate  for  a  given 

set  of  predictors,  one  should  expect  that  empiri¬ 
cal  plots  of  log-log  survival  curves  for  different 
individuals  will  be  approximately  parallel. 
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Empirical  plots:  use  — ln[— In  5] 
where 

1 .  S  is  a  KM  curve 

2.  S  is  an  adjusted  survival  curve 
for  predictors  satisfying  the  PH 
assumption;  predictor  being 
assessed  not  included  in  model 


EXAMPLE 


Clinical  trial  of  leukemia  patients: 

T  =  weeks  until  patient  goes  out  of 
remission 

Predictors  (Xs): 

Rx  (=  1  if  placebo,  0  if  treatment) 
log  WBC 

Cox  PH  model: 

h(t,  X)  =  /j0(t)exp[Piitx  +  p2  log  WBC] 
Assessing  PH  assumption:  compare 
log-log  survival  curves  for  categories 
of  Rx  and  log  WBC 


One-at-a-time  strategy:  Rx  variable 
ln(-ln)S 


I _  1 - Treatment 

i - , 

! - Placebo 

- 1 

One-at-a-time  strategy:  log  WBC 
ln(-ln)S 


!_  _  Low 
Medium 

.  High 

- 1 


By  empirical  plots,  we  mean  plotting  log-log 
survival  curves  based  on  Kaplan-Meier  (KM) 
estimates  that  do  not  assume  an  underlying 
Cox  model.  Alternatively,  one  could  plot 
log-log  survival  curves  which  have  been 
adjusted  for  predictors  already  assumed  to  sat¬ 
isfy  the  PH  assumption  but  have  not  included 
the  predictor  being  assessed  in  a  PH  model. 

As  an  example,  suppose  we  consider  the  com¬ 
parison  of  treatment  and  placebo  groups  in  a 
clinical  trial  of  leukemia  patients,  where  sur¬ 
vival  time  is  time,  in  weeks,  until  a  patient 
goes  out  of  remission.  Two  predictors  of 
interest  in  this  study  are  treatment  group 
status  (1  =  placebo,  0  =  treatment),  denoted  as 
Rx,  and  log  white  blood  cell  count  (log  WBC), 
where  the  latter  variable  is  being  considered  as 
a  confounder. 

A  Cox  PH  model  involving  both  these  predic¬ 
tors  would  have  the  form  shown  at  the  left.  To 
assess  whether  the  PH  assumption  is  satisfied 
for  either  or  both  of  these  variables,  we  would 
need  to  compare  log-log  survival  curves  involv¬ 
ing  categories  of  these  variables. 

One  strategy  to  take  here  is  to  consider  the 
variables  one  at  a  time.  For  the  Rx  variable, 
this  amounts  to  plotting  log-log  KM  curves 
for  treatment  and  placebo  groups  and  asses¬ 
sing  parallelism.  If  the  two  curves  are  approxi¬ 
mately  parallel,  as  shown  here,  we  would 
conclude  that  the  PH  assumption  is  satisfied 
for  the  variable  Rx.  If  the  two  curves  intersect 
or  are  not  parallel  in  some  other  way,  we  would 
conclude  that  the  PH  assumption  is  not  satis¬ 
fied  for  this  variable. 

For  the  log  WBC  variable,  we  need  to  categorize 
this  variable  into  categories  -  say,  low,  medium, 
and  high  -  and  then  compare  plots  of  log-log 
KM  curves  for  each  of  the  three  categories.  In 
this  illustration,  the  three  log-log  Kaplan-Meier 
curves  are  clearly  nonparallel,  indicating  that 
the  PH  assumption  is  not  met  for  log  WBC. 
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EXAMPLE:  Computer  Results 


Problems  with  log-log  survival 
curve  approach: 

How  parallel  is  parallel? 
Recommend: 

•  subjective  decision 

•  conservative  strategy:  assume 
PH  is  OK  unless  strong 
evidence  of  nonparallelism 


The  above  examples  are  sketches  of  some  of  the 
possibilities  that  could  occur  from  compari¬ 
sons  of  log-log  curves.  For  the  actual  data  set 
containing  42  leukemia  patients,  computer 
results  are  shown  here  for  each  variable  sepa¬ 
rately.  Similar  output  using  Stata,  SAS,  SPSS, 
and  R  packages  is  provided  in  the  Computer 
Appendix. 

We  first  show  the  log-log  KM  curves  by  treat¬ 
ment,  Rx.  Notice  that  the  two  log-log  curves 
are  roughly  parallel,  indicating  that  the  Rx  var¬ 
iable  satisfies  the  PH  assumption  when  being 
considered  by  itself. 

Here  we  show  the  log-log  KM  curves  by  log 
WBC,  where  we  have  divided  this  variable  into 
low  (below  2.3),  medium  (between  2.3  and  3), 
and  high  (above  3)  values.  Notice  that  there  is 
some  indication  of  nonparallelism  below  8  days, 
but  that  overall  the  three  curves  are  roughly 
parallel.  Thus,  these  plots  suggest  that  the  PH 
assumption  is  more  or  less  satisfied  for  the 
variable  log  WBC,  when  considered  alone. 

As  a  third  example,  we  consider  the  log-log  KM 
plots  categorized  by  Sex  from  the  remission 
data.  Notice  that  the  two  curves  clearly  inter¬ 
sect,  and  are  therefore  noticeably  nonparallel. 
Thus,  the  variable,  Sex,  when  considered  by 
itself,  does  not  appear  to  satisfy  the  PH  assump¬ 
tion  and  therefore  should  not  be  incorporated 
directly  into  a  Cox  PH  model  containing  the 
other  two  variables,  Rx  and  log  WBC. 

The  above  examples  suggest  that  there  are 
some  problems  associated  with  this  graphical 
approach  for  assessing  the  PH  assumption.  The 
main  problem  concerns  how  to  decide  “how 
parallel  is  parallel?”  This  decision  can  be  quite 
subjective  for  a  given  data  set,  particularly  if  the 
study  size  is  relatively  small.  We  recommend 
that  one  should  use  a  conservative  strategy  for 
this  decision  by  assuming  the  PH  assumption 
is  satisfied  unless  there  is  strong  evidence  of 
nonparallelism  of  the  log-log  curves. 


Presentation:  III.  Graphical  Approach  1 :  Log-Log  Plots  1 73 


How  parallel  is  parallel? 

Recommend: 

•  many  categories  =>■  data  “thins 
out” 

•  different  categorizations  may 
give  different  graphical 
pictures 

Recommend: 

•  small  #  of  categories  (2  or  3) 

•  meaningful  choice 

•  reasonable  balance  (e.g., 
terciles) 

How  to  evaluate  several  variables 

simultaneously? 


Strategy: 

•  categorize  variables  separately 

•  form  combinations  of 
categories 

•  compare  log-log  curves  on 
same  graph 

Drawback: 

•  data  “thins  out” 

•  difficult  to  identify  variables 
responsible  for  nonparallelism 


EXAMPLE 


Remission  Data: 


Treatment 

Placebo 


Low  Medium  High 


Another  problem  concerns  how  to  categorize  a 
continuous  variable  like  log  WBC.  If  many  cate¬ 
gories  are  chosen,  the  data  “thins  out”  in  each 
category,  making  it  difficult  to  compare  different 
curves.  [Also,  one  categorization  into,  say,  three 
groups  may  give  a  different  graphical  picture 
from  a  different  categorization  into  three  groups.] 


In  categorizing  continuous  variables,  we  recom¬ 
mend  that  the  number  of  categories  be  kept 
reasonably  small  (e.g.,  two  or  three)  if  possible, 
and  that  the  choice  of  categories  be  as  meaning¬ 
ful  as  possible  and  also  provide  reasonable  bal¬ 
ance  of  numbers  (e.g.,  as  when  using  terciles). 

In  addition  to  the  two  problems  just  described, 
another  problem  with  using  log-log  survival 
plots  concerns  how  to  evaluate  the  PH  assump¬ 
tion  for  several  variables  simultaneously. 

One  strategy  for  simultaneous  comparisons  is 
to  categorize  all  variables  separately,  form 
combinations  of  categories,  and  then  compare 
log-log  curves  for  all  combinations  on  the 
same  graph. 


A  drawback  of  this  strategy  is  that  the  data  will 
again  tend  to  “thin  out”  as  the  number  of  com¬ 
binations  gets  even  moderately  large.  Also, 
even  if  there  are  sufficient  numbers  for  each 
combined  category,  it  is  often  difficult  to  deter¬ 
mine  which  variables  are  responsible  for  any 
nonparallelism  that  might  be  found. 

As  an  example  of  this  strategy,  suppose  we  use 
the  remission  data  again  and  consider  both  Rx 
and  log  WBC  together.  Because  we  previously 
had  two  categories  of  Rx  and  three  categories 
of  log  WBC,  we  get  a  total  of  six  combined 
categories,  consisting  of  treated  subjects  with 
low  log  WBC,  placebo  subjects  with  low  log 
WBC,  treated  subjects  with  medium  log  WBC, 
and  so  on. 
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EXAMPLE:  (continued) 


Log-log  KM  curves  by  six  combinations 
of  Rx  by  log  WBC 


Plots  suggest  PH  not  satisfied.  However, 
the  study  is  small,  i.e.,  plots  are 
unreliable. 


The  computer  results  are  shown  here  for  the 
log-log  curves  corresponding  to  each  of  the  six 
combinations  of  Rx  with  log  WBC.  Notice  that 
there  are  several  points  of  intersection  among 
the  six  curves.  Therefore,  these  results  suggest 
that  the  PH  assumption  is  not  satisfied  when 
considering  Rx  and  log  WBC  together. 


However,  the  sample  sizes  used  to  estimate 
these  curves  are  quite  small,  ranging  between 
four  subjects  for  group  4  (Rx  =  1,  log  WBC  = 
low)  to  12  subjects  for  group  6  (Rx  =  1,  log 
WBC  =  high),  with  the  total  study  size  being 
42.  Thus,  for  this  small  study,  the  use  of  six 
log-log  curves  provides  unreliable  information 
for  assessing  the  PH  assumption. 


Alternative  strategy: 

Adjust  for  predictors  already  satis¬ 
fying  PH  assumption,  i.e.,  use 
adjusted  log-log  S  curves 


An  alternative  graphical  strategy  for  consider¬ 
ing  several  predictors  together  is  to  assess  the 
PH  assumption  for  one  predictor  adjusted  for 
other  predictors  that  are  assumed  to  satisfy  the 
PH  assumption.  Rather  than  using  Kaplan- 
Meier  curves,  this  involves  a  comparison  of 
adjusted  log-log  survival  curves. 


EXAMPLE 


Remission  data: 

•  compare  Rx  categories  adjusted 
for  log  WBC 

•  fit  PH  model  for  each  Rx  stratum 

•  obtain  adjusted  survival  curves 
using  overall  mean  of  log  WBC 

Log-log  S  curves  for  Rx  groups  using 
PH  model  adjusted  for  log  WBC 


As  an  example,  again  we  consider  the  remis¬ 
sion  data  and  the  predictors  Rx  and  log  WBC. 
To  assess  the  PH  assumption  for  Rx  adjusted 
for  log  WBC,  we  would  compare  adjusted 
log-log  survival  curves  for  the  two  treatment 
categories,  where  each  adjusted  curve  is 
derived  from  a  PH  model  containing  log  WBC 
as  a  predictor.  In  computing  the  adjusted  sur¬ 
vival  curve,  we  need  to  stratify  the  data  by 
treatment,  fit  a  PH  model  in  each  stratum, 
and  then  obtain  adjusted  survival  probabilities 
using  the  overall  mean  log  WBC  in  the  esti¬ 
mated  survival  curve  formula  for  each  stratum. 

For  the  remission  data  example,  the  estimated 
log-log  survival  curves  for  the  two  treatment 
groups  adjusted  for  log  WBC  are  shown  here. 
Notice  that  these  two  curves  are  roughly  paral¬ 
lel,  indicating  that  the  PH  assumption  is  satis¬ 
fied  for  treatment. 
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EXAMPLE:  (continued) 


Log-log  S  curves  for  log  WBC  groups 
using  PH  model  adjusted  for  Rx 


Remission  data: 

Assess  PH  assumption  for  Sex: 

•  use  PH  model  containing  Rx  and 
log  WBC 

•  use  Rx  and  log  WBC  in  survival 
probability  formula 

Log-log  S  curves  for  Sex  adjusted  for 
Rx  and  log  WBC 


As  another  example,  we  consider  adjusted 
log-log  survival  curves  for  three  categories  of 
log  WBC,  adjusted  for  the  treatment  status  (Rx) 
variable.  The  adjusted  survival  probabilities  in 
this  case  use  the  overall  mean  Rx  score,  i.e.,  0.5, 
the  proportion  of  the  42  total  subjects  that  are 
in  the  placebo  group  (i.e.,  half  the  subjects  have 
a  score  of  Rx  =  1). 

The  three  log-log  curves  adjusted  for  treatment 
status  are  shown  here.  Although  two  of  these 
curves  intersect  early  in  follow-up,  they  do  not 
suggest  a  strong  departure  from  parallelism 
overall,  suggesting  that  the  PH  assumption  is 
reasonable  for  log  WBC,  after  adjusting  for 
treatment  status. 

As  a  third  example,  again  using  the  remission 
data,  we  assess  the  PH  assumption  for  Sex, 
adjusting  for  both  treatment  status  and  log 
WBC  in  the  model.  This  involves  obtaining 
log-log  survival  curves  for  males  and  females 
separately,  using  a  PH  model  that  contains 
both  treatment  status  and  log  WBC.  The  adjust¬ 
ment  uses  the  overall  mean  treatment  score  and 
the  overall  mean  log  WBC  score  in  the  formula 
for  the  estimated  survival  probability. 

The  estimated  log-log  survival  curves  for  Sex, 
adjusted  for  treatment  and  log  WBC  are  shown 
here.  These  curves  clearly  cross,  indicating  that 
the  PH  assumption  is  not  satisfied  for  Sex,  after 
adjusting  for  treatment  and  log  WBC. 


S  1 .  log-log  survival  curves 
2.  observed  versus  expected 
survival  curves 


We  have  thus  described  and  illustrated  one  of 
the  two  graphical  approaches  for  checking  the 
PH  assumption,  that  is,  using  log-log  survival 
plots.  In  the  next  section,  we  describe  an  alter¬ 
native  approach  that  compares  "observed” 
with  “expected”  survival  curves. 


IV.  Graphical  Approach  2: 
Observed  Versus 
Expected  Plots 


Graphical  analog  of  GOF  test 


The  use  of  observed  versus  expected  plots 
to  assess  the  PH  assumption  is  the  graphical 
analog  of  the  goodness-of-fit  (GOF)  testing 
approach  to  be  described  later,  and  is  therefore 
a  reasonable  alternative  to  the  log-log  survival 
curve  approach. 
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Two  strategies: 

1.  One-at-a-time:  uses  KM  curves 
to  obtain  observed  plots 

2 .  Adjusting  for  other  variables: 
uses  stratified  Cox  PH  model  to 
obtain  observed  plots  (see 
Chapter  5) 


As  with  the  log-log  approach,  the  observed  ver¬ 
sus  expected  approach  may  be  carried  out 
using  either  or  both  of  two  strategies — (1) 
assessing  the  PH  assumption  for  variables 
one-at-a-time,  or  (2)  assessing  the  PH  assump¬ 
tion  after  adjusting  for  other  variables.  The 
strategy  which  adjusts  for  other  variables  uses 
a  stratified  Cox  PH  model  to  form  observed 
plots,  where  the  PH  model  contains  the  vari¬ 
ables  to  be  adjusted  and  the  stratified  variable 
is  the  predictor  being  assessed.  The  stratified 
Cox  procedure  is  described  in  Chapter  5. 


Here,  we  describe  only  the  one-at-a-time  strat¬ 
egy,  which  involves  using  KM  curves  to  obtain 
observed  plots. 


One-at-a-time: 

•  stratify  data  by  categories  of 
predictor 

•  obtain  KM  curves  for  each 
category 


Using  the  one-at-a-time  strategy,  we  first  must 
stratify  our  data  by  categories  of  the  predictor 
to  be  assessed.  We  then  obtain  observed  plots 
by  deriving  the  KM  curves  separately  for  each 
category. 


EXAMPLE:  Remission  Data 


KM  (Observed)  Plots  by  Rx  Group 


Expected  Survival  Plots  by  Rx  Group 
Using  PH  Model 


As  an  example,  for  the  remission  data  on  42 
leukemia  patients  we  have  illustrated  earlier, 
the  KM  plots  for  the  treatment  and  placebo 
groups,  with  21  subjects  in  each  group,  are 
shown  here.  These  are  the  "observed”  plots. 


To  obtain  “expected”  plots,  we  fit  a  Cox  PH 
model  containing  the  predictor  being  assessed. 
We  obtain  expected  plots  by  separately  substi¬ 
tuting  the  value  for  each  category  of  the  predic¬ 
tor  into  the  formula  for  the  estimated  survival 
curve,  thereby  obtaining  a  separate  estimated 
survival  curve  for  each  category. 

As  an  example,  again  using  the  remission  data, 
we  fit  the  Cox  PH  model  with  Rx  as  its  only 
variable.  Using  the  corresponding  survival 
curve  formula  for  this  Cox  model,  as  given  in 
the  box  at  the  left,  we  then  obtain  separate 
expected  plots  by  substituting  the  values  of 
0  (for  treatment  group)  and  1  (for  placebo 
group).  The  expected  plots  are  shown  here. 
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EXAMPLE:  (continued) 


Observed  Versus  Expected  Plots  by  Rx 


S 


To  compare  observed  with  expected  plots  we 
then  put  both  sets  of  plots  on  the  same  graph  as 
shown  here. 


If  observed  and  expected  plots  are: 

•  close,  complies  with  PH 
assumption 

•  discrepant,  PH  assumption 
violated 


If  for  each  category  of  the  predictor  being 
assessed,  the  observed  and  expected  plots  are 
“close”  to  one  another,  we  then  can  conclude 
that  the  PH  assumption  is  satisfied.  If,  how¬ 
ever,  one  or  more  categories  show  quite  dis¬ 
crepant  observed  and  expected  plots,  we 
conclude  that  the  PH  assumption  is  violated. 


Example:  Remission  Data  (continued) 


Observed  and  expected  plots  are  close 
for  each  treatment  group. 

Conclude  PH  assumption  not  violated. 


For  the  example  shown  above,  observed  and 
expected  curves  appear  to  be  quite  close  for 
each  treatment  group.  Thus,  we  would  conclude 
using  this  graphical  approach  that  the  treat¬ 
ment  variable  satisfies  the  PH  assumption. 


Drawback:  How  close  is  close?  An  obvious  drawback  to  this  graphical 

approach  is  deciding  “how  close  is  close” 
Recommend:  PH  not  satisfied  only  when  comparing  observed  versus  expected 
when  plots  are  strongly  discrepant,  curves  for  a  given  category.  This  is  analogous 

to  deciding  “how  parallel  is  parallel”  when 
comparing  log-log  survival  curves.  Here,  we 
recommend  that  the  PH  assumption  be  consid¬ 
ered  as  not  satisfied  only  when  observed  and 
expected  plots  are  strongly  discrepant. 
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Example:  Remission  Data 


Observed  Versus  Expected  Plots  by  Sex 


PH  assumption  not  satisfied  for  Sex. 
Same  conclusion  as  with  log-log  curves. 


As  another  example,  again  using  the  remission 
data,  we  consider  observed  versus  expected 
plots  by  Sex,  as  shown  here.  Note  that  the 
observed  plots  for  males  and  females,  which 
are  described  by  the  thicker  lines,  cross  at 
about  12  weeks,  whereas  the  expected  plots 
don’t  actually  intersect,  with  the  female  plot 
lying  below  the  male  plot  throughout  follow¬ 
up.  Moreover,  for  males  and  females  sepa¬ 
rately,  the  observed  and  expected  plots  are 
quite  different  from  one  another. 

Thus,  the  above  plots  suggest  that  the  PH 
assumption  is  not  satisfied  for  the  variable 
Sex.  We  came  to  the  same  conclusion  when 
using  log-log  survival  curves,  which  crossed 
one  another  and  were  therefore  clearly  non¬ 
parallel. 


Continuous  variable: 

•  form  strata  from  categories 

•  observed  plots  are  KM  curves 
for  each  category 


When  using  observed  versus  expected  plots  to 
assess  the  PH  assumption  for  a  continuous 
variable,  observed  plots  are  derived,  as  for  cat¬ 
egorical  variables,  by  forming  strata  from  cate¬ 
gories  of  the  continuous  variable  and  then 
obtaining  KM  curves  for  each  category. 


•  two  options  for  expected  plots 
1.  Use  PH  model  with  k  —  1 
dummy  variables  Xc  for  k 
categories,  i.e., 

(k-\ 

h(t,X)=ho(t)  exp  (  Y^PiXci) 

Obtain  adjusted  survival 
curve: 


However,  for  continuous  predictors,  there  are 
two  options  available  for  computing  expected 
plots.  One  option  is  to  use  a  Cox  PH  model 
which  contains  k  —  1  dummy  variables  to  indi¬ 
cate  k  categories.  The  expected  plot  for  a  given 
category  is  then  obtained  as  an  adjusted  sur¬ 
vival  curve  by  substituting  the  values  for  the 
dummy  variables  that  define  the  given  category 
into  the  formula  for  the  estimated  survival 
curve,  as  shown  here  for  category  c. 


S>,Xf)  =  [So(0] 


exp(^ftXd) 


where 

Xc  =  (Xcl,Xc2,...,XC'k_l) 
gives  values  of  dummy 
variables  for  category  c. 
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Options  for  a  continuous  variable: 

2.  Use  PH  model: 

h{t,  X)  =  h0(t)  exp(P  X) 
so  X  =  X  \ 

Continuous 
Obtain  adjusted  survival  curve: 

s{tX)  =  [s„W]“p<W‘) 


The  second  option  is  to  use  a  Cox  PH  model 
containing  the  continuous  predictor  (say,  X) 
being  assessed.  Expected  plots  are  then 
obtained  as  adjusted  survival  curves  by  spe¬ 
cifying  predictor  values  that  distinguish  cate¬ 
gories,  as,  for  example,  when  using  mean 
predictor  values  for  each  category. 


where  Xc  denotes  the  mean  value 
for  the  variable  X  within  category  c . 


Example:  Remission  Data 


Observed  (KM)  Plots  by  log  WBC 
Categories 


S 


Option  1: 

Ml,  X)  =  h0(t)  exp(PiX]  +  p2X2) 

,  ,,  f  1  if  high  ,,  f  1  if  medium 

WhereX|  =  (()  if  other  Mo  if  other 

so  that 

high  =  (1,0);  medium  =  (0,  l);low=  (0, 0) 
Expected  survival  plots: 

Xl  =  l,X2  =  0:  S(t,Xhigh)  =  [50(t)]eXp(ft) 

X1=0,X2=1:  S(t ,  If  medium)  =  [S0«] 
X1=0,X2  =  0:S(t,Xlovl)=[S0(tj\ 


As  an  example  to  illustrate  both  options,  we 
consider  the  continuous  variable  log  WBC 
from  the  remission  data  example.  To  assess 
the  PH  assumption  for  this  variable,  we  would 
first  stratify  log  WBC  into,  say,  three  categories 
-  low,  medium,  and  high.  The  observed  plots 
would  then  be  obtained  as  KM  curves  for  each 
of  the  three  strata,  as  shown  here. 


Using  option  1,  expected  plots  would  be 
obtained  by  fitting  a  Cox  PH  model  containing 
two  dummy  variables  X\  and  X2,  as  shown 
here,  where  Xi  takes  the  values  1  if  high  or 
0  if  other  and  X2  takes  the  values  1  if  medium 
or  0  if  other.  Thus,  when  log  WBC  is  high,  the 
values  of  Xx  and  X2  are  1  and  0,  respectively; 
whereas  when  log  WBC  is  medium,  the  values 
are  0  and  1 ,  respectively;  and  when  log  WBC  is 
low,  the  values  are  both  0. 

The  expected  survival  plots  for  high,  medium, 
and  low  categories  are  then  obtained  by  sub¬ 
stituting  each  of  the  three  specifications  of 
Xi  and  X2  into  the  formula  for  the  estimated 
survival  curve,  and  then  plotting  the  three 


curves. 
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EXAMPLE:  (continued) 


Expected  Plots  for  log  WBC  Using 
Option  1  (Dummy  Variables) 


S 


Observed  Versus  Expected  Plots  Using 
Option  1 


Option  2:  Treat  log  WBC  as  continuous 
h(f,  X)  =  40(f)exp[p(log  WBC)] 

log  WBChigh  =  3.83: 

S(t,  Xhigh)  =  [50(0]exp[3.S3g] 

log  WBCmed  =  2.64: 

S(t,  Vmed)  =  [S:0(O]exp[2.64p] 
log  WBClow  =  1.71: 

SU,  Xlow)  =  [S0(0]»p[l.7lft 

Observed  Versus  Expected  Plots  for 
log  WBC  Using  Option  2. 


S 


The  expected  plots  using  option  1  (the  dummy 
variable  approach)  are  shown  here  for  the 
three  categories  of  log  WBC. 


Here  we  put  the  observed  and  expected  plots  on 
the  same  graph.  Although  there  are  some  dis¬ 
crepancies,  particularly  early  in  follow-up  for 
the  low  log  WBC  category,  these  plots  suggest 
overall  that  the  PH  assumption  is  satisfied  for 
log  WBC. 


Using  option  2,  expected  plots  would  be 
obtained  by  first  fitting  a  Cox  PH  model  con¬ 
taining  the  continuous  variable  log  WBC,  as 
shown  here. 

Adjusted  survival  curves  are  then  obtained  for 
specified  values  of  log  WBC  that  summarize 
the  three  categories  used  to  form  observed 
curves.  Here,  we  find  that  the  mean  log  WBC 
scores  for  low,  medium,  and  high  categories 
are,  respectively,  1.71,  2.64,  and  3.83.  These 
values  are  substituted  into  the  estimated  sur¬ 
vival  curve  formula  as  shown  here. 

Here  are  the  observed  and  expected  plots  using 
option  2.  As  with  option  1,  although  there  are 
some  discrepancies  within  categories,  overall, 
these  plots  suggest  that  the  PH  assumption  is 
satisfied  for  the  log  WBC  variable. 
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V.  The  Goodness  of  Fit 
(GOF)  Testing  Approach 


Statistical  test  appealing 

•  Provides  p-value 

•  More  objective  decision  than 
when  using  graphical  approach 


The  GOF  testing  approach  is  appealing  because 
it  provides  a  test  statistic  and  p-value  for 
assessing  the  PH  assumption  for  a  given  predic¬ 
tor  of  interest.  Thus,  the  researcher  can  make 
a  more  objective  decision  using  a  statistical  test 
than  is  typically  possible  when  using  either  of 
the  two  graphical  approaches  described  above. 


Test  of  Harrel  and  Lee  (1986)  A  number  of  different  tests  for  assessing  the  PH 

assumption  have  been  proposed  in  the  litera- 

•  Variation  of  test  of  Schoenfeld  ture.  We  present  the  test  of  Harrel  and  Lee 

•  Uses  Schoenfeld  residuals  (1986),  a  variation  of  a  test  originally  proposed 

by  Schoenfeld  (1982)  and  based  on  the  residuals 
defined  by  Schoenfeld,  now  called  the  Schoen¬ 
feld  residuals. 


Schoenfeld  residuals  defined  for 

•  Each  predictor  in  model 

•  Every  subject  who  has  event 

Consider  Cox  PH  model 
h(t,X)  =  h0(t)exp(p!RX 

+  p2log  WBC  +  p3SEX) 

3  predictors  — >  3  Schoenfeld 

residuals  for  each 
subject  who  has 
event 


For  each  predictor  in  the  model,  Schoenfeld 
residuals  are  defined  for  every  subject  who 
has  an  event.  For  example,  consider  a  Cox  PH 
model  with  three  predictors:  RX,  LOGWBC, 
and  SEX.  Then  there  are  three  Schoenfeld  resi¬ 
duals  defined  for  each  subject  who  has  an 
event,  one  for  each  of  the  three  predictors. 


Schoenfeld  residual  for  ith  subject 
for  LOGWBC: 

Observed  LOGWBC 
-  LOGWBC  weighted  average 

Weights  are  other  subjects’  hazard 
(from  subjects  still  at  risk) 


Suppose  subject  i  has  an  event  at  time  t.  Then 
her  Schoenfeld  residual  for  LOGWBC  is  her 
observed  value  of  log  white  blood  cell  count 
minus  a  weighted  average  of  the  log  white 
blood  cell  counts  for  the  other  subjects  still  at 
risk  at  time  t.  The  weights  are  each  subject’s 
hazard. 


Underlying  idea  of  test  The  idea  behind  the  statistical  test  is  that  if  the 

If  PH  holds  then  Schoenfeld  resi-  PH  assumption  holds  for  a  particular  covar- 
duals  uncorrelated  with  time  iate  then  the  Schoenfeld  residuals  for  that 

covariate  will  not  be  related  to  survival  time. 
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Steps  for  test  implementation 

1 .  Obtain  Schoenfeld  residuals 

2.  Rank  failure  times 

3.  Test  correlation  of  residuals  to 
ranked  failure  time  H0:  p  =  0 


The  implementation  of  the  test  can  be  thought 
of  as  a  three-step  process. 

Step  1.  Run  a  Cox  PH  model  and  obtain 
Schoenfeld  residuals  for  each  predictor. 

Step  2.  Create  a  variable  that  ranks  the  order  of 
failures.  The  subject  who  has  the  first  (earliest) 
event  gets  a  value  of  1,  the  next  gets  a  value  of  2, 
and  so  on. 

Step  3.  Test  the  correlation  between  the 
variables  created  in  the  first  and  second  steps. 
The  null  hypothesis  is  that  the  correlation 
between  the  Schoenfeld  residuals  and  ranked 
failure  time  is  zero. 


H0  rejected 

Conclude  PH  assumption  violated 


Rejection  of  the  null  hypothesis  leads  to  a  con¬ 
clusion  that  the  PH  assumption  is  violated. 


PH  test  in  Stata,  SAS,  SPSS,  R 
shown  in  Computer  Appendix 

Stata  uses  scaled  Schoenfeld 
residuals  rather  than  Schoenfeld 
residuals  (typically  similar  results) 


The  implementation  of  the  test  for  the  PH 
assumption  in  Stata,  SAS,  SPSS,  and  R  is 
shown  in  the  Computer  Appendix.  Stata  uses 
a  slight  variation  of  the  test  we  just  described  in 
that  it  uses  the  scaled  Schoenfeld  residual 
rather  than  the  Schoenfeld  residual  (Grambsch 
and  Therneau,  1994).  The  tests  typically  (but 
not  always)  yield  similar  results. 


EXAMPLE:  Remission  Data 


Column  name.  Coeff.  StErr.  P(PH) 

Rx  1.294  0.422  0.917 

log  WBC _ 1.604  0.329  0.944 

Both  variables  satisfy  PH  assumption. 
Note:  P(PH )  =  0.917  assesses  PH  for 
Rx,  assuming  PH  OK  for  log  WBC. 


To  illustrate  the  statistical  test  approach,  we 
return  to  the  remission  data  example.  The 
printout  on  the  left  gives  p-values  P(PH)  for 
treatment  group  and  log  WBC  variables  based 
on  fitting  a  Cox  PH  model  containing  these  two 
variables. 

The  P(PH)  values  are  quite  high  for  both  vari¬ 
ables,  suggesting  that  both  variables  satisfy 
the  PH  assumption.  Note  that  each  of  these 
p-values  tests  the  assumption  for  one  variable 
given  that  the  other  predictors  are  included  in 
the  model.  For  example,  the  P(PH)  of  0.917 
assesses  the  PH  assumption  for  Rx,  assuming 
the  PH  assumption  is  satisfied  for  log  WBC. 
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EXAMPLE 

Column  name 

Coeff. 

StErr. 

P(PH) 

Rx 

1.391 

0.457 

0.935 

log  WBC 

1.594 

0.330 

0.828 

Sex 

0.263 

0.449 

0.038 

log  WBC  and  Rx  satisfy  PH. 

Sex  does  not  satisfy  PH. 

(Same  conclusions  using  graphical 
approaches). 


Statistical  Tests 
Null  is  never  proven 

•  May  say  not  enough  evidence  to 
reject 

p-value  can  be  driven  by  sample  size 

•  Small  sample  -  gross  violation 
of  null  may  not  be  significant 

•  Large  sample  -  slight  violation 
of  null  may  be  highly  significant 

Test  -  more  objective 
Graph  -  more  subjective,  but  can 
detect  specific  violations 
Recommend  -  Use  both  graphs  and 
tests 


As  another  example,  consider  the  computer 
results  shown  here  for  a  Cox  PH  model  con¬ 
taining  the  variable  SEX  in  addition  to  log 
WBC  and  treatment  group.  The  P(PH)  values 
for  log  WBC  and  treatment  group  are  still  non¬ 
significant.  However,  the  P(PH)  value  for  SEX 
is  significant  below  the  0.05  level.  This  result 
suggests  that  log  WBC  and  treatment  group 
satisfy  the  PH  assumption,  whereas  SEX  does 
not.  We  came  to  the  same  conclusion  about 
these  variables  using  the  graphical  procedures 
described  earlier. 

An  important  point  concerning  a  testing 
approach  is  that  the  null  hypothesis  is  never 
proven  with  a  statistical  test.  The  most  that 
may  be  said  is  that  there  is  not  enough  evidence 
to  reject  the  null.  A  p-value  can  be  driven 
by  sample  size.  A  gross  violation  of  the  null 
assumption  may  not  be  statistically  significant 
if  the  sample  is  very  small.  Conversely,  a  slight 
violation  of  the  null  assumption  may  be  highly 
significant  if  the  sample  is  very  large. 


A  statistical  test  offers  a  more  objective 
approach  for  assessing  the  PH  assumption 
compared  to  the  subjectivity  of  the  graphical 
approach.  However,  the  graphical  approach 
enables  the  researcher  to  detect  specific  kinds 
of  departures  from  the  PH  assumption;  the 
researcher  can  see  what  is  going  on  from  the 
graph.  Consequently,  we  recommend  that 
when  assessing  the  PH  assumption,  the  inves¬ 
tigator  use  both  graphical  procedures  and  sta¬ 
tistical  testing  before  making  a  final  decision. 


VI.  Assessing  the  PH 
Assumption  Using 
Time-Dependent 
Covariates 

Extended  Cox  model: 
contains  product  terms  of  the  form 
X  x  g(t),  where  g(t)  is  a  function  of 
time. 


When  time-dependent  variables  are  used  to 
assess  the  PH  assumption  for  a  time- 
independent  variable,  the  Cox  model  is  extended 
to  contain  product  (i.e.,  interaction)  terms 
involving  the  time-independent  variable  being 
assessed  and  some  function  of  time. 
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One-at-a-time  model: 

h(t,X)  =  h0(t)exp[$X+  8(Xxg(t))] 


When  assessing  predictors  one-at-a-time,  the 
extended  Cox  model  takes  the  general  form 
shown  here  for  the  predictor  X. 


Some  choices  forg(i): 

tfW  =  t 
g{t)  =  log  t 


g(t) 


1  if  t  >  t0  (heaviside 
0  it  t  <  to  function) 


One  choice  for  the  function  g(t)  is  simply  g(t) 
equal  to  t,  so  that  the  product  term  takes  the 
form  X  x  t.  Other  choices  for  g(t)  are  also  pos¬ 
sible,  for  example,  log  t,  or  the  “heaviside  func¬ 
tion”  shown  at  the  left. 


H0:> 5  =  0 

Under  H0,  the  model  reduces  to: 
h(t,X)  =  /z0(f)exp[p  X] 


Using  the  above  one-at-a-time  model,  we  assess 
the  PH  assumption  by  testing  for  the  signifi¬ 
cance  of  the  product  term.  The  null  hypothesis 
is  therefore  “8  equal  to  zero.”  Note  that  if  the 
null  hypothesis  is  true,  the  model  reduces  to  a 
Cox  PH  model  containing  the  single  variable  X. 


Use  either  Wald  statistic  or 
likelihood  ratio  statistic: 

X* 2  with  1  df  under  H0 


The  test  can  be  carried  out  using  either  a  Wald 
statistic  or  a  likelihood  ratio  statistic.  In  either 
case,  the  test  statistic  has  a  chi-square  distribu¬ 
tion  with  one  degree  of  freedom  under  the  null 
hypothesis. 


For  example,  if  the  PH  assumption  is  being 
assessed  for  Sex,  a  Cox  model  might  be 
extended  to  include  the  variable  Sex  x  t  in  addi¬ 
tion  to  Sex.  If  the  coefficient  of  the  product  term 

turns  out  to  be  significant,  we  can  conclude  that 
the  PH  assumption  is  violated  for  Sex.2 

Strategies  for  assessing  PH:  In  addition  to  a  one-at-a-time  strategy,  the 

extended  Cox  model  can  also  be  used  to  assess 

•  one-at-a-time  the  PH  assumption  for  several  predictors 

•  several  predictors  simultaneously  as  well  as  for  a  given  predictor 

simultaneously  adjusted  for  other  predictors  in  the  model. 

•  for  a  given  predictor  adjusted 
for  other  predictors 


Example 


h(t,X)  =  ZioWexplp^ex  +  p2  (Sex  x  f)l 
p2  7^  0  =  PH  assumption  violated 


2  In  contrast,  if  the  test  for  H0:  p2  =  0  is  nonsignificant, 
we  can  conclude  only  that  the  particular  version  of  the 
extended  Cox  model  being  considered  is  not  supported 
by  the  data. 


Presentation:  VI.  Assessing  the  PH  Assumption  1 85 


Several  predictors  simultaneously: 
h(t,X)  =  ho(t)  exp  (dm 

+Si(Xi  x  g,(t))]^ 

gi(t)  =  function  of  time  for  z'th 
predictor 


To  assess  the  PH  assumption  for  several 
predictors  simultaneously,  the  form  of  the 
extended  model  is  shown  here.  This  model 
contains  the  predictors  being  assessed  as 
main  effect  terms  and  also  as  product  terms 
with  some  function  of  time.  Note  that  different 
predictors  may  require  different  functions  of 
time;  hence,  the  notation gt  it)  is  used  to  define 
the  time  function  for  the  z'th  predictor. 


H0:  §!  —  §2  -  •  •  •  -  §P  -  0 
LR  =  2  In  Lph  mocJel 

—  (  2  In  Lext.  Cox  model) 

~Xp  under  H0 


Cox  PH  (reduced)  model: 

/  p  N 

h(t,X)  =  ho(t)  exp  E 


With  the  above  model,  we  test  for  the  PH 
assumption  simultaneously  by  assessing  the 
null  hypothesis  that  all  the  8,  coefficients  are 
equal  to  zero.  This  requires  a  likelihood  ratio 
chi-square  statistic  with  p  degrees  of  freedom, 
where  p  denotes  the  number  of  predictors 
being  assessed.  The  LR  statistic  computes  the 
difference  between  the  log  likelihood  statistic 
(i.e.,  —2  In  L)  for  the  PH  model  and  the  log 
likelihood  statistic  for  the  extended  Cox 
model.  Note  that  under  the  null  hypothesis, 
the  model  reduces  to  the  Cox  PH  model 
shown  here. 


EXAMPLE:  Remission  Data 


h(t,X)  =  h0(t)  exp[  jgj  (Rx) 

+  ft,  (log  WBC)  +  ft  (Sex) 
+  ft  (Rxxg(t)) 

+  82  (log  WBC  x  g(f)) 

+  ft(Sex  x  g(f))] 

,  ,  ,  (1  if  t  >  7 

where  g(t)  =  |  0  if  t  <  7 

H0:  §i  =  82  =  S3  =  0 

LR^x2  with  3  df  under  H0 

If  test  is  significant,  use  backward 
elimination  to  find  predictors  not 
satisfying  PH  assumption. 


As  an  example,  we  assess  the  PH  assumption 
for  the  predictors  Rx,  log  WBC,  and  Sex  from 
the  remission  data  considered  previously.  The 
extended  Cox  model  is  given  as  shown  here, 
where  the  functions  g,  (t)  have  been  chosen  to 
be  the  same  "heaviside”  function  defined  byg(t) 
equals  1  if  t  is  7  weeks  or  more  and  git)  equals 
0  if  t  is  less  than  7  weeks.  The  null  hypothesis  is 
that  all  three  8  coefficients  are  equals  to  zero. 
The  test  statistic  is  a  likelihood-ratio  chi-square 
with  3  degrees  of  freedom. 

If  the  above  test  is  found  to  be  significant,  then 
we  can  conclude  that  the  PH  assumption  is  not 
satisfied  for  at  least  one  of  the  predictors  in  the 
model.  To  determine  which  predictor(s)  do  not 
satisfy  the  PH  assumption,  we  could  proceed 
by  backward  elimination  of  nonsignificant 
product  terms  until  a  final  model  is  attained. 
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Heaviside  function: 


if  t  >  7 
if  t  <  7 


h(t,X)  differs  for  t  >  7  and  t  <  7. 

Properties  of  heaviside  functions 
and  numerical  results  are  described 
in  Chapter  6. 


Note  that  the  use  of  a  heaviside  function  for  g(t) 
in  the  above  example  yields  different  expres¬ 
sions  for  the  hazard  function  depending  on 
whether  t  is  greater  than  or  equal  to  7  weeks 
or  t  is  less  than  7  weeks.  Chapter  6  provides 
further  details  on  the  properties  of  heaviside 
functions,  and  also  provides  numerical  results 
from  fitting  extended  Cox  models. 


Assessing  PH  for  a  given  predictor 
adjusted  for  other  predictors: 

rt-i 


h(t,X)  =  h0(t)  exp 


E  +  P*X* 


1=1 


+  6*(X*xg{t)) 


X*  =  Predictor  of  interest 
H0:  8*  =  0 

Wald  or  LR  statistic  ~%2  with  1  df 


We  show  here  an  extended  Cox  model  that 
can  be  used  to  evaluate  the  PH  assumption 
for  a  given  predictor  adjusted  for  predictors 
already  satisfying  the  PH  assumption.  The 
predictor  of  interest  is  denoted  as  X* ,  and  the 
predictors  considered  to  satisfy  the  PH 
assumption  are  denoted  as  Xt.  The  null  hypoth¬ 
esis  is  that  the  coefficient  8*  of  the  product 
term  X*g(t)  is  equal  to  zero.  The  test  statistic 
can  either  be  a  Wald  statistic  or  a  likelihood 
ratio  statistic,  with  either  statistic  having  a  chi- 
square  distribution  with  1  degree  of  freedom 
under  the  null  hypothesis. 


Example:  Remission  Data 


for  Sex,  adjusted  for  Rx  and  log  WBC: 
h(t,X.)  =  exp  \fiiRx 

+  /J2log  WBC  +  /TSex 
+  5* (Sex  x  g(t))] 


As  an  example,  suppose,  again  considering  the 
remission  data,  we  assess  the  PH  assumption 
for  the  variable,  Sex,  adjusted  for  the  variables 
Rx  and  log  WBC,  which  we  assume  already 
satisfy  the  PH  assumption.  Then,  the  extended 
Cox  model  for  this  situation  is  shown  here. 


Two  models  for  LR  test  of  PH: 

1 .  Cox  PH  model 

2.  extended  Cox  model 

See  Computer  Appendix  for  SAS, 
Stata,  SPSS,  and  R 


To  carry  out  the  computations  for  any  of  the 
likelihood  ratio  tests  described  above,  two 
different  types  of  models,  a  PH  model  and  an 
extended  Cox  model,  need  to  be  fit.  See  the 
Computer  Appendix  for  details  on  how  the 
extended  Cox  model  is  fit  using  SAS,  Stata, 
SPSS,  and  R. 


Drawback:  choice  of  g,(f) 

Different  choices  may  lead  to 
different  conclusions  about  PH 
assumption. 


The  primary  drawback  of  the  use  of  an  extended 
Cox  model  for  assessing  the  PH  assumption 
concerns  the  choice  of  the  functions  giit)  for 
the  time-dependent  product  terms  in  the  model. 
This  choice  is  typically  not  clear-cut,  and  it  is 
possible  that  different  choices,  such  as  g(t)  equal 
to  t  versus  log  t  versus  a  heaviside  function,  may 
result  in  different  conclusions  about  whether 
the  PH  assumption  is  satisfied. 
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Chapter  6:  Time-dependent  Further  discussion  of  the  use  of  time-dependent 

covariates  covariates  in  an  extended  Cox  model  is  provided 

in  Chapter  6. 


This  presentation: 

Three  methods  for  assessing  PH. 

i.  graphical 

ii.  GOF 

iii.  time-dependent  covariates 

Recommend  using  at  least  two 
methods. 


This  presentation  is  now  complete.  We  have 
described  and  illustrated  three  methods  for 
assessing  the  PH  assumption:  graphical,  good- 
ness-of-fit  (GOF),  and  time-dependent  covariate 
methods.  Each  of  these  methods  has  both 
advantages  and  drawbacks.  We  recommend 
that  the  researcher  use  at  least  two  of  these 
approaches  when  assessing  the  PH  assumption. 


Chapters 


1 .  Introduction  to  Survival 
Analysis 

2.  Kaplan-Meier  Survival 
Curves  and  the  Log-Rank 
Test 

3.  The  Cox  Proportional 
Hazards  Model  and  Its 
Characteristics 


✓4. 


Evaluating  the  Proportional 
Hazards  Assumption _ 


We  suggest  that  the  reader  review  this  presen¬ 
tation  using  the  detailed  outline  that  follows. 
Then  answer  the  practice  exercises  and  the  test 
that  follow. 

The  next  Chapter  (5)  is  entitled  “The  Stratified 
Cox  Procedure.”  There,  we  describe  how  to  use 
a  stratification  procedure  to  fit  a  PH  model 
when  one  or  more  of  the  predictors  do  not 
satisfy  the  PH  assumption. 


Next: 

5.  The  Stratified  Cox  Procedure 

6.  Extension  of  the  Cox 
Proportional  Hazards  Model 
for  Time-Dependent  Variables 
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Detailed 

Outline 


I.  Background  (pages  164-165) 

A.  The  formula  for  the  Cox  PH  model: 


h(t,X)  =  ho(t)  exp 


E  p* 


B.  Formula  for  hazard  ratio  comparing  two  indi¬ 
viduals, 

X*  =  (x\,  XI,...  ,x;)and  X  =  (X,,  X2„  . Xp): 


C.  Adjusted  survival  curves  using  the  Cox  PH 

model: 

$(f,X)  =  [50(ftpE»] 

i.  To  graph  S  ( t ,  X),  must  specify  values  for 
X={XUX2,...,XP). 

ii.  To  obtain  “adjusted"  survival  curves,  usually 
use  overall  mean  values  for  the  Xs  being 
adjusted. 

D.  The  meaning  of  the  PH  assumption 

i.  Hazard  ratio  formula  shows  that  hazard 
ratio  is  independent  of  time: 

fo,X*)_g 

h(t,X) 

ii.  Hazard  ratio  for  two  X’s  are  proportional: 

h{t,X*)  =  9h(t,  X) 


II.  Checking  the  proportional  hazards  assumption: 

overview  (pages  165-167) 

A.  Three  methods  for  checking  the  PH  assumption: 

i.  Graphical:  compare  —In  —In  survival  curves 
or  observed  versus  predicted  curves. 

ii.  Goodness-of-fit  test:  use  a  large  sample  Z 
statistic. 

iii.  Time-dependent  covariates:  use  product  (i.e., 
interaction)  terms  of  the  form  X  x  g(t). 

B.  Abbreviated  illustrations  of  each  method  are 
provided. 


III.  Graphical  approach  1:  log-log  plots 

(pages  167-175) 

A.  A  log-log  curve  is  a  transformation  of  an  esti¬ 
mated  survival  curve,  where  the  scale  for  a 
log-log  curve  is  — oo  to  +00. 
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B.  The  log-log  expression  for  the  Cox  model 
survival  curve  is  given  by 

p 

ln[— lnS(f,  X)]  =  y  fijXj  +  ln[—  In  So(0] 

i=i 

C.  For  the  Cox  model,  the  log-log  survival  curve 
for  individual  X|  can  be  written  as  the  log-log 
curve  for  individual  X2  plus  a  linear  sum  term 
that  is  independent  of  time  t.  This  formula  is 
given  by 

ln[—  In  S(t,  Xi)] 

=  ln[—  In  S(t,  X2)]  +  J2  Pi(X u  ~  X*) 

i—  1 

D.  The  above  log-log  formula  can  be  used  to  check 
the  PH  assumption  as  follows:  the  PH  model  is 
appropriate  if  “empirical"  plots  of  log-log  sur¬ 
vival  curves  are  parallel. 

E.  Two  kinds  of  empirical  plots  for  —In  —In  S: 

i.  S  is  a  KM  curve 

ii.  S  is  an  adjusted  survival  curve  where  pre¬ 
dictor  being  assessed  is  not  included  in  the 
Cox  regression  model. 

F.  Several  examples  of  log-log  plots  are  provided 
using  remission  data  from  a  clinical  trial  of 
leukemia  patients. 

G.  Problems  with  log-log  curves: 

i.  How  parallel  is  parallel? 

ii.  How  to  categorize  a  continuous  variable? 

iii.  How  to  evaluate  several  variables  simulta¬ 
neously? 

H.  Recommendation  about  problems: 

i.  Use  small  number  of  categories,  meaning¬ 
ful  choice,  reasonable  balance. 

ii.  With  several  variables,  two  options: 

a.  Compare  log-log  curves  from  combina¬ 
tions  of  categories. 

b.  Adjust  for  predictors  already  satisfying 
PH  assumption. 

IV.  Graphical  approach  2:  observed  versus  expected 
plots  (pages  175-180) 

A.  Graphical  analog  of  the  GOF  test. 

B.  Two  strategies 

i.  One-at-a-time:  uses  KM  curves  to  obtain 
observed  plots. 

ii.  Adjusting  for  other  variables:  uses  stratified 
Cox  PH  model  to  obtain  observed  plots. 
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C.  Expected  plots  obtained  by  fitting  a  Cox  model 
containing  the  predictor  being  assessed;  substi¬ 
tute  into  the  fitted  model  the  value  for  each 
category  of  the  predictor  to  obtain  the  expected 
value  for  each  category. 

D.  If  observed  and  expected  plots  are  close,  con¬ 
clude  PH  assumption  is  reasonable. 

E.  Drawback:  how  close  is  close? 

F.  Recommend:  conclude  PH  not  satisfied  only  if 
plots  are  strongly  discrepant. 

G.  Another  drawback:  what  to  do  if  assessing  con¬ 
tinuous  variable. 

H.  Recommend  for  continuous  variable: 

i.  Form  strata  from  categories. 

ii.  Observed  plots  are  KM  curves  for  each  cat¬ 
egory. 

iii.  Two  options  for  expected  plots: 

a.  Use  PH  model  with  k  —  1  dummy  vari¬ 
ables  for  k  categories. 

b.  Use  PH  model  with  continuous  predic¬ 
tor  and  specify  predictor  values  that 
distinguish  categories. 

V.  The  goodness-of-fit  (GOF)  testing  approach 

(pages  181-183) 

A.  Appealing  approach  because 

i.  provides  a  test  statistic  (p-value). 

ii.  researcher  can  make  clear-cut  decision. 

B.  References 

i.  methodological:  Schoenfeld  (1982),  Harrel 
and  Lee  (1986). 

ii.  SAS  and  Stata  use  different  GOF  formulae. 

C.  The  method: 

i.  Schoenfeld  residuals  for  each  predictor 
uses  a  chi-square  statistic  with  1  df. 

ii.  Correlations  between  Schoenfeld’s  resi¬ 
duals  and  ranked  failure  times. 

iii.  If  p-value  small,  then  departure  from  PH. 

D.  Examples  using  remission  data. 

E.  Drawbacks: 

i.  global  test:  may  fail  to  detect  a  specific  kind 
of  departure  from  PH;  recommend  using 
both  graphical  and  GOF  methods. 

ii.  several  strategies  to  choose  from,  with  no 
one  strategy  clearly  preferable  (one-at-a- 
time,  all  variables,  each  variable  adjusted 
for  others). 
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VI.  Assessing  the  PH  assumption  using  time- 
dependent  covariates  (pages  183-187) 

A.  Use  extended  Cox  model:  contains  product 
terms  of  form  X  x  g{t),  where  g(t  ')  is  function 
of  time,  e.g.,  g(t)  =  t,  or  log  t,  or  heaviside  func¬ 
tion. 

B.  One-at-a-time  model: 

h(t,  X)  =  h0(t)  exp[pX  +  8Xg(t)] 

Test  H0:  8  =  0  using  Wald  or  LR  test  (chi-  square 
with  1  df). 

C.  Evaluating  several  predictors  simultaneously: 

h(t,  X)  =  h0(t)  exp  [  PiXj  +  5iXigi(t)] 

where  gfl)  is  function  of  time  for  zth  predictor. 
Test  H0 :  Si  =  82  =  =  8p  =  0  using  LR 

(chi-square)  test  with  p  df. 

D.  Examples  using  remission  data. 

E.  Two  computer  programs,  required  for  test: 

i.  Cox  PH  model  program. 

ii.  Extended  Cox  model  program. 

F.  Drawback:  choice  of  g(t)  not  always  clear;  dif¬ 
ferent  choices  may  lead  to  different  conclusions 
about  PH  assumption. 


Practice 

Exercises 


The  dataset  “vets.dat”  considers  survival  times  in  days 
for  137  patients  from  the  Veteran’s  Administration  Lung 
Cancer  Trial  cited  by  Kalbfleisch  and  Prentice  in  their  text 
{The  Statistical  Analysis  of  Survival  Time  Data,  Wiley, 
2002).  The  exposure  variable  of  interest  is  treatment  status 
(standard  =  1,  test  =  2).  Other  variables  of  interest  as 
control  variables  are  cell  type  (four  types,  defined  by 
dummy  variables),  performance  status,  disease  duration, 
age,  and  prior  therapy  status.  Failure  status  is  defined  by 
the  status  variable  (0  if  censored,  1  if  died).  A  complete  list 
of  the  variables  is  given  below. 

Column  1:  Treatment  (standard  =  1,  test  =  2) 

Column  2:  Cell  type  1  (large  =  1,  other  =  0) 

Column  3:  Cell  type  2  (adeno  =  1,  other  =  0) 

Column  4:  Cell  type  3  (small  =  1,  other  =  0) 

Column  5:  Cell  type  4  (squamous  =  1,  other  =  0) 
Column  6:  Survival  time  (days) 

Column  7:  Performance  status  (0  =  worst,. . .,  100  =  best) 
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Column  8: 
Column  9: 
Column  10: 
Column  11 


Disease  duration  (months) 

Age 

Prior  therapy  (none  =  0,  some  =10) 
Status  (0  =  censored,  1  =  died) 


1 .  State  the  hazard  function  form  of  the  Cox  PH  model 
that  describes  the  effect  of  the  treatment  variable  and 
controls  for  the  variables,  cell  type,  performance  sta¬ 
tus,  disease  duration,  age,  and  prior  therapy.  In  stat¬ 
ing  this  model,  make  sure  to  incorporate  the  cell  type 
variable  using  dummy  variables,  but  do  not  consider 
possible  interaction  variables  in  your  model. 

2.  State  three  general  approaches  that  can  be  used  to 
evaluate  whether  the  PH  assumption  is  satisfied  for 
the  variables  included  in  the  model  you  have  given  in 
question  1. 

3.  The  following  printout  is  obtained  from  fitting  a 
Cox  PH  model  to  these  data.  Using  the  information 
provided,  what  can  you  conclude  about  whether  the 
PH  assumption  is  satisfied  for  the  variables  used  in 
the  model?  Explain  briefly. 


Cox  regression 

Coef. 

Std.  Err. 

p  >  M 

Haz.  Ratio 

[95%  Conf. 
Interval] 

P(PH) 

Treatment 

0.290 

0.207 

0.162 

1.336 

0.890 

2.006 

0.628 

Large  cell 

0.400 

0.283 

0.157 

1.491 

0.857 

2.594 

0.033 

Adeno  cell 

1.188 

0.301 

0.000 

3.281 

1.820 

5.915 

0.081 

Small  cell 

0.856 

0.275 

0.002 

2.355 

1.374 

4.037 

0.078 

Performance  status 

-0.033 

0.006 

0.000 

0.968 

0.958 

0.978 

0.000 

Disease  duration 

0.000 

0.009 

0.992 

1.000 

0.982 

1.018 

0.919 

Age 

-0.009 

0.009 

0.358 

0.991 

0.974 

1.010 

0.198 

Prior  therapy 

0.007 

0.023 

0.755 

1.007 

0.962 

1.054 

0.145 

4.  For  the  variables  used  in  the  PH  model  in  question  1, 
describe  a  strategy  for  evaluating  the  PH  assumption 
using  log-log  survival  curves  for  variables  considered 
one-at-a-time. 

5.  Again  considering  the  variables  used  in  question  1, 
describe  a  strategy  for  evaluating  the  PH  assumption 
using  log-log  survival  curves  that  are  adjusted  for 
other  variables  in  the  model. 
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6.  For  the  variable  “performance  status,”  describe  how 
you  would  evaluate  the  PH  assumption  using 
observed  versus  expected  survival  plots? 

7.  For  the  variable  “performance  status,”  log-log  plots 
which  compare  high  (>50)  with  low  (<50)  are  given 
by  the  following  graph.  Based  on  this  graph,  what  do 
you  conclude  about  the  PH  assumption  with  regard  to 
this  variable? 


8.  What  are  some  of  the  drawbacks  of  using  the  log-log 
approach  for  assessing  the  PH  assumption  and  what 
do  you  recommend  to  deal  with  these  drawbacks? 

9.  For  the  variable  “performance  status,”  observed  ver¬ 
sus  expected  plots  that  compare  high  (>50)  with  low 
(<50)  are  given  by  the  following  graph.  Based  on  this 
graph,  what  do  you  conclude  about  the  PH  assump¬ 
tion  with  regard  to  this  variable? 


0.8 
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10.  State  the  form  of  an  extended  Cox  model  that  allows 
for  the  one-at-a-time  assessment  of  the  PH  assump¬ 
tion  for  the  variable  "performance  status,”  and 
describe  how  you  would  carry  out  a  statistical  test  of 
the  assumption  for  this  variable. 

1 1 .  State  the  form  of  an  extended  Cox  model  that  allows 
for  the  simultaneous  assessment  of  the  PH  assump¬ 
tion  for  the  variables  treatment,  cell  type,  perfor¬ 
mance  status,  disease  duration,  age,  and  prior 
therapy.  For  this  model,  describe  how  you  would 
carry  out  a  statistical  test  of  the  PH  assumption  for 
these  variables.  Also,  provide  a  strategy  for  assessing 
which  of  these  variables  satisfy  the  PH  assumption 
and  which  do  not  using  the  extended  Cox  model 
approach. 
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Test 


12.  Using  any  of  the  information  provided  above  and 
any  additional  analyses  that  you  perform  with  this 
dataset,  what  do  you  conclude  about  which  variables 
satisfy  the  PH  assumption  and  which  variables  do  not? 
In  answering  this  question,  summarize  any  additional 
analyses  performed. 


The  following  questions  consider  a  dataset  from  a  study 
by  Caplehorn  et  al.  (“Methadone  Dosage  and  Retention  of 
Patients  in  Maintenance  Treatment,”  Med.  J.  Aust.,  1991). 
These  data  comprise  the  times  in  days  spent  by  heroin 
addicts  from  entry  to  departure  from  one  of  two  metha¬ 
done  clinics.  There  are  two  additional  covariates,  namely, 
prison  record  and  maximum  methadone  dose,  believed  to 
affect  the  survival  times.  The  dataset  name  is  addicts.dat. 
A  listing  of  the  variables  is  given  below: 


Column  1: 
Column  2: 
Column  3: 

Column  4: 
Column  5: 
Column  6: 


Subject  ID 
Clinic  (1  or  2) 

Survival  status  (0  =  censored,  1  = 
departed  from  clinic) 

Survival  time  in  days 

Prison  record  (0  =  none,  1  =  any) 

Maximum  methadone  dose  (mg/day) 


1.  The  following  edited  printout  was  obtained  from  fit¬ 
ting  a  Cox  PH  model  to  these  data: 


Cox  regression 
Analysis  time_t: 
survt 

Coef. 

Std.  Err. 

p  >  M 

Haz.  Ratio 

[95%  Conf. 
Interval] 

P(PH ) 

Clinic 

-1.009 

0.215 

0.000 

0.365 

0.239 

0.556 

0.001 

Prison 

0.327 

0.167 

0.051 

1.386 

0.999 

1.924 

0.332 

Dose 

-0.035 

0.006 

0.000 

0.965 

0.953 

0.977 

0.347 

No.  of  subjects:  238  Log  likelihood  =  —673.403 

Based  on  the  information  provided  in  this  printout, 
what  do  you  conclude  about  which  variables  satisfy 
the  PH  assumption  and  which  do  not?  Explain  briefly. 
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2.  Suppose  that  for  the  model  fit  in  question  1,  log-log 
survival  curves  for  each  clinic  adjusted  for  prison  and 
dose  are  plotted  on  the  same  graph.  Assume  that  these 
curves  are  obtained  by  substituting  into  the  formula 
for  the  estimated  survival  curve  the  values  for  each 
clinic  and  the  overall  mean  values  for  the  prison  and 
dose  variables.  Below,  we  show  these  two  curves.  Are 
they  parallel?  Explain  your  answer. 


3.  The  following  printout  was  obtained  from  fitting  a 
stratified  Cox  PH  model  to  these  data,  where  the  vari¬ 
able  being  stratified  is  clinic: 


Stratified  Cox 
regression  Analysis 
time.t:  survt  (in  days) 

Coef. 

Std.  Err. 

P  >  lzl 

Haz.  Ratio 

[95%  Conf. 
Interval] 

Prison 

0.389 

0.169 

0.021 

1.475 

1.059 

2.054 

Dose 

-0.035 

0.006 

0.000 

0.965 

0.953 

0.978 

No.  of  subjects  =  238  Log  likelihood  =  —597.714  Stratified  by  clinic 

Using  the  above  fitted  model,  we  can  obtain  the 
log-log  curves  below  that  compare  the  log-log  survival 
for  each  clinic  (i.e.,  stratified  by  clinic)  adjusted  for  the 
variables  prison  and  dose.  Using  these  curves,  what  do 
you  conclude  about  whether  or  not  the  clinic  variable 
satisfies  the  PH  assumption?  Explain  briefly. 
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4.  Consider  the  two  plots  of  log-log  curves  below  that 
compare  the  log-log  survival  for  the  prison  variable 
ignoring  other  variables  and  adjusted  for  the  clinic 
and  dose  variables.  Using  these  curves,  what  do  you 
conclude  about  whether  or  not  the  prison  variable 
satisfies  the  PH  assumption?  Explain  briefly. 


2  1 

9  - 

Log-log  curves  for  prison 
ignoring  other  variables 
(i.e.,  using  log-log  KM  curves) 

12 1 
9  - 

Log-log  curves  for  prison 
adjusted  for  clinic  and  dose 
(i.e.,  stratified  by  prison) 

6  - 

6  - 

3  - 

Prison  =  1 

3  - 

Prison  =  1 

0  - 

Prison  =  0 

0  - 

Prison  =  0  - 

-3  - 

— i — i — i — i — i — i — i — i — i — i — i 

-3  - 

. . . 

0  300  600  900  0  300  600  900 


5.  How  do  your  conclusions  from  question  1  compare 
with  your  conclusions  from  question  4?  If  the  conclu¬ 
sions  differ,  which  conclusion  do  you  prefer?  Explain. 

6.  Describe  briefly  how  you  would  evaluate  the  PH 
assumption  for  the  variable  maximum  methadone 
dose  using  observed  versus  expected  plots. 

7.  State  an  extended  Cox  model  that  would  allow  you  to 
assess  the  PH  assumption  for  the  variables  clinic, 
prison,  and  dose  simultaneously.  For  this  model, 
state  the  null  hypothesis  for  the  test  of  the  PH  assump¬ 
tion  and  describe  how  the  likelihood  ratio  statistic 
would  be  obtained  and  what  its  degrees  of  freedom 
would  be  under  the  null  hypothesis. 

8.  State  at  least  one  drawback  to  the  use  of  the  extended 
Cox  model  approach  described  in  question  7. 

9.  State  an  extended  Cox  model  that  would  allow  you  to 
assess  the  PH  assumption  for  the  variable  clinic  alone, 
assuming  that  the  prison  and  dose  variables  already 
satisfy  the  PH  assumption.  For  this  model,  state  the 
null  hypothesis  for  the  test  of  the  PH  assumption,  and 
describe  how  the  likelihood  ratio  (LR)  statistic  would 
be  obtained.  What  is  the  degrees  of  freedom  of  the  LR 
test  under  the  null  hypothesis? 
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10.  Consider  the  situation  described  in  question  9,  where 
you  wish  to  use  an  extended  Cox  model  that  would 
allow  you  to  assess  the  PH  assumption  for  the  variable 
clinic  alone,  assuming  that  the  assumption  is  satisfied 
for  the  prison  and  dose  variables.  Suppose  you  use  the 
following  extended  Cox  model: 

h(t,X.)  =  ho(t)  exp[/?j  (prison)  +  /f2(dose) 

+/f3  (clinic)  +  <5i(clinic)g(f)] 


where  g(t)  is  defined  as  follows: 


if  t  >  365  days 
if  t  <  365  days 


For  the  above  model,  what  is  the  formula  for  the  hazard 
ratio  that  compares  clinic  1  to  clinic  2  when  t  is  greater 
than  365  days?  when  t  is  less  than  or  equal  to  365  days? 
In  terms  of  the  hazard  ratio  formulae  just  described, 
what  specific  departure  from  the  PH  assumption  is 
being  tested  when  the  null  hypothesis  is  H0:  =  0? 


1.  /z(t,X)  =  h0{t)  expfPi (treatment)  +  p2(CTl)  +  p3(CT2) 
+  p4(CT3)  +  p5(PS)  +  P6(DD)  +  p7(Age)  +  p8(PT)] 

where  CTi  denotes  the  cell  type  i  dummy  variable,  PS 
denotes  the  performance  status  variable  DD  denotes 
the  disease  duration  variable,  and  PT  denotes  the 
prior  therapy  variable. 

2.  The  three  general  approaches  for  assessing  the  PH 
model  for  the  above  model  are: 

(a)  graphical,  using  either  log-log  plots  or  observed 
versus  expected  plots; 

(b)  statistical  test; 

(c)  an  extended  Cox  model  containing  product  terms 
involving  the  variables  being  assessed  with  some 
function(s)  of  time. 

3.  The  P(PH)  values  given  in  the  printout  provide  good- 
ness-of-fit  tests  for  each  variable  in  the  fitted  model 
adjusted  for  the  other  variables  in  the  model.  The  P 
{PH)  values  shown  indicate  that  the  large  cell  type 
variables  and  the  performance  status  variable  do  not 
satisfy  the  PH  assumption,  whereas  the  treatment, 
age,  disease  duration,  and  prior  therapy  variables 
satisfy  the  PH  assumption,  and  the  adeno  and  small 
cell  type  variable  are  of  borderline  significance. 
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4.  A  strategy  for  evaluating  the  PH  assumption  using 
log-log  survival  curves  for  variables  considered  one- 
at-a-time  is  given  as  follows: 

For  each  variable  separately,  obtain  a  plot  of  log-log 
Kaplan-Meier  curves  for  the  different  categories  of 
that  variable.  For  the  cell  type  variable,  this  requires 
obtaining  a  plot  of  four  log-log  KM  curves,  one  for 
each  cell  type.  (Note  that  this  is  not  the  same  as  obtain¬ 
ing  four  separate  plots  of  two  log-log  curves,  where 
each  plot  corresponds  to  one  of  the  dummy  variables 
used  in  the  model.)  For  the  variables  PS,  DD,  and  Age, 
which  are  interval  variables,  each  variable  must  be 
separately  categorized  into  two  or  more  groups  -  say, 
low  versus  high  values  -  and  KM  curves  are  obtained 
for  each  group.  For  the  variable  PT,  which  is  a  dichot¬ 
omous  variable,  two  log-log  curves  are  obtained  which 
compare  the  “none”  versus  “some”  groups. 

For  each  plot  (i.e.,  one  for  each  variable),  those  plots 
that  are  noticeably  nonparallel  indicate  variables  which 
do  not  satisfy  the  PH  assumption.  The  remaining  vari¬ 
ables  are  assumed  to  satisfy  the  PH  assumption. 

5 .  One  strategy  for  evaluating  the  PH  assumption  for  each 
variable  adjusted  for  the  others  is  to  use  adjusted  log-log 
survival  curves  instead  of  KM  curves  separately  for  each 
of  the  variables  in  the  model.  That  is,  for  each  variable 
separately,  a  stratified  Cox  model  is  fit  stratifying  on  the 
given  variable  while  adjusting  for  the  other  variables. 
Those  variables  that  yield  adjusted  log-log  plots  that  are 
noticeably  nonparallel  are  then  to  be  considered  as  not 
satisfying  the  PH  assumption.  The  remaining  variables 
are  assumed  to  satisfy  the  PH  assumption. 

A  variation  of  the  above  strategy  uses  adjusted  log-log 
curves  for  only  those  variables  not  satisfying  the  PH 
assumption  from  a  one-at-a-time  approach,  adjusting 
for  those  variables  satisfying  the  PH  assumption  from 
the  one-at-a-time  approach.  This  second  iteration 
would  flag  a  subset  of  the  one-at-a-time  flagged  vari¬ 
ables  for  further  iteration.  At  each  new  iteration,  those 
variables  found  to  satisfy  the  assumption  get  added  to 
the  list  of  variables  previously  determined  to  satisfy 
the  assumption. 

6.  For  the  performance  status  (PS)  variable,  observed 
plots  are  obtained  by  categorizing  the  variable  into 
strata  (say,  two  strata:  low  versus  high)  and  then 
obtaining  KM  survival  plots  for  each  stratum. 
Expected  plots  can  be  obtained  by  fitting  a  Cox 
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model  containing  the  (continuous)  PS  variable  and 
then  obtaining  estimated  survival  curves  for  values 
of  the  performance  status  (PS)  variable  that  represent 
summary  descriptive  statistics  for  the  strata  previ¬ 
ously  identified.  For  example,  if  there  are  two  strata, 
say,  high  (PS  >  50)  and  low  (PS  <  50),  then  the  values 
of  PS  to  be  used  could  be  the  mean  or  median  PS  score 
for  persons  in  the  high  stratum  and  the  mean  or 
median  PS  score  for  persons  in  the  low  stratum. 

An  alternative  method  for  obtaining  expected  plots 
involves  first  dichotomizing  the  PS  variable  -  say,  into 
high  and  low  groups  -  and  then  fitting  a  Cox  model 
containing  the  dichotomized  PS  variable  instead  of  the 
original  continuous  variable.  The  expected  survival 
plots  for  each  group  are  estimated  survival  curves 
obtained  for  each  value  of  the  dichotomized  PS  variable. 

Once  observed  and  expected  plots  are  obtained  for  each 
stratum  of  the  PS  variable,  they  are  then  compared  on 
the  same  graph  to  determine  whether  or  not 
corresponding  observed  and  expected  plots  are  “close.” 
If  it  is  determined  that,  overall,  comparisons  for  each 
stratum  are  close,  then  it  is  concluded  that  the  PH 
assumption  is  satisfied  for  the  PH  variable.  In  determin¬ 
ing  how  close  is  close,  the  researcher  should  look  for 
noticeably  discrepant  observed  versus  expected  plots. 

7.  The  log-log  plots  that  compare  high  versus  low  PS 
groups  (ignoring  other  variables)  are  arguably  parallel 
early  in  follow-up,  and  are  not  comparable  later 
because  survival  times  for  the  two  groups  do  not  over¬ 
lap  after  400  days.  These  plots  do  not  strongly  indicate 
that  the  PH  assumption  is  violated  for  the  variable  PS. 
This  contradicts  the  conclusion  previously  obtained 
for  the  PS  variable  using  the  P(PH)  results. 

8.  Drawbacks  of  the  log-log  approach  are: 

•  How  parallel  is  parallel? 

•  How  to  categorize  a  continuous  variable? 

•  How  to  evaluate  several  variables  simultaneously? 

Recommendations  about  problems: 

•  Look  for  noticeable  nonparallelism;  otherwise  PH 
assumption  is  OK. 

•  For  continuous  variables,  use  a  small  number  of 
categories,  a  meaningful  choice  of  categories,  and 
a  reasonable  balance  in  sample  size  for  categories. 
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•  With  several  variables,  there  are  two  options: 

i.  Compare  log-log  curves  from  combinations  of 
categories. 

ii.  Adjust  for  predictors  already  satisfying  PH 
assumption. 

9.  The  observed  and  expected  plots  are  relatively  close  for 
low  and  high  groups  separately,  although  there  is  some¬ 
what  more  discrepancy  for  the  high  group  than  for  the 
low  group.  Deciding  how  close  is  close  is  quite  subjec¬ 
tive  for  these  plots.  Nevertheless,  because  there  are  no 
major  discrepancies  for  either  low  or  high  groups,  we 
consider  the  PH  assumption  satisfied  for  this  variable. 

10.  h(t,X)  =  h0(t )  exptfyfPS)  +  8(PS)g(f)] 

where  g(t)  is  a  function  of  t,  such  as  g(t)  =  t,  or  g(t)  = 
log  t,  or  a  heaviside  function.  The  PH  assumption  is 
tested  using  a  1  df  Wald  or  LR  statistic  for  H0:  8  =  0. 

11.  h(t,X)  =  h0(t )  exp[p] (treatment)  +  p2(CTl)  +  p3(CT2) 
+  p4(CT3)  +  P5(PS)  +  p6(DD)  +  p7(Age)  +  p8(PT) 

+  S^treatment  x  g(t))  +  82(CT1  x  g(t))  +  83(CT2  x  g(t)) 
+  84(CT3  x  g(t))  +  S5(PS  x  g(t))  +  86(DD  x  g(t)) 

+  87(Age  x  g(t))  +  8g(PT  x  g(t))] 

where  g(t)  is  some  function  of  time,  such  as  g(t)  =  t,  or 
g(t)  =  log  t,  or  a  heaviside  function.  To  test  the  PH 
assumption  simultaneously  for  all  variables,  the  null 
hypothesis  is  stated  as  H0:  Si  =  82  =  . . .  =  S8  =  0.  The 
test  statistic  is  a  likelihood-ratio  statistic  of  the  form 

LR  =  -2  In  LR  -  (-2  In  LF) 

where  R  denotes  the  reduced  (PH)  model  obtained 
when  all  8’s  are  0,  and  F  denotes  the  full  model  given 
above.  Under  H0l  the  LR  statistic  is  approximately  chi- 
square  with  8  df. 

1 2 .  The  question  here  is  somewhat  open-ended,  leaving  the 
reader  the  option  to  explore  additional  graphical,  GOF, 
or  extended  Cox  model  approaches  for  evaluating  the 
PH  assumption  for  the  variables  in  the  model.  The  con¬ 
clusions  from  the  GOF  statistics  provided  in  question  3 
are  likely  to  hold  up  under  further  scrutiny,  so  that  a 
reasonable  conclusion  is  that  cell  type  and  performance 
status  variables  do  not  satisfy  the  PH  assumption,  with 
the  remaining  variables  satisfying  the  assumption. 
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Introduction 


Abbreviated 

Outline 


We  begin  with  an  example  of  the  use  of  the  stratified  Cox 
procedure  for  a  single  predictor  that  does  not  satisfy  the 
PH  assumption.  We  then  describe  the  general  approach 
for  fitting  a  stratified  Cox  model,  including  the  form  of 
the  (partial)  likelihood  function  used  to  estimate  model 
parameters. 

We  also  describe  the  assumption  of  no  interaction  that  is 
typically  incorporated  into  most  computer  programs  that 
carry  out  the  stratified  Cox  procedure.  We  show  how  the 
no-interaction  assumption  can  be  tested,  and  what  can  be 
done  if  interaction  is  found. 

We  conclude  with  a  second  example  of  the  stratified  Cox 
procedure  in  which  more  than  one  variable  is  stratified. 

The  outline  below  gives  the  user  a  preview  of  the  material 
to  be  covered  by  the  presentation.  A  detailed  outline  for 
review  purposes  follows  the  presentation. 

I.  Preview  (page  204) 

II.  An  Example  (pages  204-208) 

III.  The  General  Stratified  Cox  (SC)  Model 

(pages  208-209) 

IV.  The  No-Interaction  Assumption  and  How  to  Test 

It  (pages  210-216) 

V.  A  Second  Example  Involving  Several 
Stratification  Variables  (pages  216-221) 

VI.  A  Graphical  View  of  the  Stratified  Cox  Approach 

(pages  221-222) 

VII.  The  Stratified  Cox  Likelihood  (pages  223-225) 
VIII.  Summary  (pages  225-227) 
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Objectives 


Upon  completing  the  chapter,  the  learner  should  be  able  to: 

1 .  Recognize  a  computer  printout  for  a  stratified  Cox 
procedure. 

2.  State  the  hazard  form  of  a  stratified  Cox  model  for  a 
given  survival  analysis  scenario  and/or  a  given  set  of 
computer  results  for  such  a  model. 

3.  Evaluate  the  effect  of  a  predictor  of  interest  based  on 
computer  results  from  a  stratified  Cox  procedure. 

4.  For  a  given  survival  analysis  scenario  and/or  a  given  set 
of  computer  results  involving  a  stratified  Cox  model. 

•  state  the  no-interaction  assumption  for  the  given 
model; 

•  describe  and/or  carry  out  a  test  of  the  no-interaction 
assumption; 

•  describe  and/or  carry  out  an  analysis  when  the 
no-interaction  assumption  is  not  satisfied. 
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Presentation 


I.  Preview 

Stratified  Cox  model: 


•  modification  of  Cox  PH  model 

•  Stratification  of  predictor  not 
satisfying  PH 

•  includes  predictors  satisfying 
PH 


The  “stratified  Cox  model”  is  a  modification  of 
the  Cox  proportional  hazards  (PH)  model  that 
allows  for  control  by  “stratification”  of  a  pre¬ 
dictor  that  does  not  satisfy  the  PH  assumption. 
Predictors  that  are  assumed  to  satisfy  the  PH 
assumption  are  included  in  the  model,  whereas 
the  predictor  being  stratified  is  not  included. 

In  this  presentation,  we  focus  on  how  stratifi¬ 
cation  is  carried  out  by  describing  the  analysis 
of  computer  results  and  the  form  of  the  hazard 
function  for  a  stratified  Cox  model.  We  first 
consider  stratifying  on  a  single  predictor  and 
then  later  consider  stratifying  on  two  or  more 
predictors.  Further,  we  distinguish  between  the 
use  of  a  "no-interaction”  version  of  the  stratified 
Cox  model  and  an  alternative  approach  that 
allows  interaction. 


II.  An  Example  Consider  the  computer  results  shown  here  for  a 

Cox  PH  model  containing  the  three  variables, 
log  WBC,  treatment  group  ( Rx ),  and  SEX. 
These  results  derive  from  a  clinical  trial  of  42 
leukemia  patients,  where  the  response  of  inter¬ 
est  is  days  in  remission. 

From  the  printout,  the  P{PH )  values  for  log 
WBC  and  treatment  group  are  nonsignificant. 
However,  the  P{PH)  value  for  SEX  is  signifi¬ 
cant  below  the  .05  level.  These  results  indicate 
that  log  WBC  and  treatment  group  satisfy  the 
PH  assumption,  whereas  the  SEX  variable 
does  not.  The  same  conclusions  regarding  the 
PH  assumption  about  these  variables  would 
also  be  made  using  the  graphical  procedures 
described  earlier. 

Because  we  have  a  situation  where  one  of  the 
predictors  does  not  satisfy  the  PH  assumption, 
we  cany  out  a  stratified  Cox  (SC)  procedure  for 
the  analysis.  Using  SC,  we  can  control  for  the 
SEX  variable  -  which  does  not  satisfy  the  PH 
assumption  -  by  stratification  while  simulta¬ 
neously  including  in  the  model  the  log  WBC 
and  treatment  variables  -  which  do  satisfy  the 
PH  assumption. 


EXAMPLE 


Clinical  trial:  42  leukemia  patients 
Response-days  in  remission 

Coef.  Std.  Err.  P(PH) 

log  WBC  1.594  0.330 

Rx  1.391  0.457 

Sex  0.263  0.449 


•  log  WBC  and  Rx  satisfy  PH 

•  Sex  does  not  satisfy  PH 

(Same  conclusions  using  graphical 
approaches) 


Stratified  Cox  (SC): 

•  control  for  sex  (stratified); 

•  simultaneously  include  log  WBC 
and  Rx  in  the  model 


0.828 

0.935 

0.031 
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EXAMPLE:  (continued) 


STATA  OUTPUT  USING  SC: 

Stratified  Cox  regression 
Analysis  time_t:  survt 

Std.  Haz. 

Coef.  Err.  p  >  |z|  Ratio  [95%  Conf.  Interval] 

log  WBC  1.390  0.338  0.000  4.016  2.072  7.783 
Rx  (0.931  )  0.472  0.048  (2.537)  1.006  6.396 

No.  of  subjects  =  42  Log  likelihood  =  —57.560  Stratified  by  sex 

Appendix  A  illustrates  SC  procedures 
using  Stata,  SAS,  SPSS,  and  R 

•  Log  WBC  and  Rx  are  included  in 
SC  model. 

•  SC  model  is  stratified  by  SEX. 


Effect  of  Rx  adjusted  for  log  WBC  and 
SEX. 

•  Hazard  ratio:  2.537  =  e0  931 

•  Interpretation:  Placebo  group 
(Rx  =  1)  has  2.5  times  the  hazard 
as  the  treatment  group  (Rx  =  0) 

Stratified  Cox  regression 
Analysis  time_t:  survt 

Std.  Haz. 

Coef.  Err.  p  >  |z|  Ratio  [95%  Conf.  Interval] 

log  WBC  1.390  0.338  0.000  4.016  2.072  7.783 
Rx  f  0.931  0.472  )  0.048  2.537  ( 1.006  6.396) 

No.  of  subjects  =  42  Log  likelihood  (  57.561))  Stratified  by  sex 

95%  Cl  for  Rx  (1.006,  6.396)  indicates 
considerable  variability. 

Cl  formula:  exp(0.931  ±  1.96  x  0.472) 


Wald  test:  P  =  0.048  (two-tailed), 
significant  at  the  0.05  level. 


The  computer  results  from  a  SC  procedure  are 
shown  here.  These  results  come  from  the  Stata 
package.  (See  the  Computer  Appendix  for  run¬ 
ning  a  SC  procedure  in  Stata,  SAS,  SPSS,  and  R). 


The  computer  results  show  that  the  log  WBC 
and  Rx  variables  are  included  in  the  model  list¬ 
ing,  whereas  the  SEX  variable  is  not  included; 
rather,  the  model  stratifies  on  the  SEX  variable, 
as  indicated  at  the  bottom  of  the  output.  Note 
that  the  SEX  variable  is  being  adjusted  by  strat¬ 
ification,  whereas  log  WBC  is  being  adjusted  by 
its  inclusion  in  the  model  along  with  Rx. 

In  the  above  output,  we  have  also  circled  some 
key  information  that  can  be  used  to  assess  the 
effect  of  the  Rx  variable  adjusted  for  both  log 
WBC  and  SEX.  In  particular,  we  can  see  that 
the  hazard  ratio  for  the  effect  oiRx  adjusted  for 
log  WBC  and  SEX  is  given  by  the  value  2.537. 
This  value  can  be  obtained  by  exponentiating 
the  coefficient  0.931  of  the  Rx  variable.  The 
hazard  ratio  value  can  be  interpreted  to  mean 
that  the  placebo  group  (for  which  Rx  =  1 )  has 
2.5  times  the  hazard  for  going  out  of  remission 
as  the  treatment  group  (for  which  Rx  =  0). 


Also,  we  can  see  from  the  output  that  a  95% 
confidence  interval  for  the  effect  of  the  Rx  var¬ 
iable  is  given  by  the  limits  1.006  to  6.396.  This 
is  a  fairly  wide  range,  thus  indicating  con¬ 
siderable  variability  in  the  2.537  hazard  ratio 
point  estimate.  Note  that  these  confidence  lim¬ 
its  can  be  obtained  by  exponentiating  the  quan¬ 
tity  0.93 1  plus  or  minus  1 .96  times  the  standard 
error  0.472. 

From  the  above  output,  a  test  for  the  signifi¬ 
cance  of  the  Rx  variable  adjusted  for  log  WBC 
and  SEX  is  given  by  the  Wald  statistic  P-value 
of  0.048.  This  is  a  two-tailed  P-value,  and  the 
test  is  (barely)  significant  at  the  0.05  level. 
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EXAMPLE:  (continued) 


LR  test:  Output  for  reduced  model 
Stratified  Cox  regression 
Analysis  time  _t:  survt 

Std.  Haz. 

Coef.  Err.  p  >  |z|  Ratio  [95%  Conf.  Interval] 
log  WBC  1.456  0.320  0.000  4.289  2.291  8.03 

No.  of  subjects  =  42  Log  likelihood  =(—59.648)  Stratified  by  sex 

LR  =  (-2  x  -59.648)  -  (-2  x  -57.560) 
=  119.296-  115.120  =  4.179  (P  <  0.05) 

LR  and  Wald  give  same  conclusion. 


Hazard  function  for  stratified  Cox 
model: 

hg(t,X)  =  /jogfdexplfSjiCr  +  p2  log  WBC] 
g=U; 

g  denotes  stratum  #. 

SC  model  for  males  and  females: 
Females  (g  =  1): 

hx(t,X)  =  /loiMexplp^r  +  p2  log  WBC] 
Males  (g  =  2): 

h2(t,X)  =  7!02(f)exp[p!Rx  +  p2  log  WBC] 

Rx  and  log  WBC  in  the  model 
Sex  not  in  the  model  (stratified) 

HR  for  effect  of  Rx  adjusted  for  log 
WBC  and  sex: 

where  Pi  is  the  coefficient  otRx. 


An  alternative  test  involves  a  likelihood  ratio 
(LR)  statistic  that  compares  the  above  model 
(full  model)  with  a  reduced  model  that  does  not 
contain  the  Rx  variable.  The  output  for  the 
reduced  model  is  shown  here.  The  log-likelihood 
statistic  for  the  reduced  model  is  —2  times 
—59.648,  which  is  to  be  compared  with  the  log- 
likelihood  statistic  of  —2  times  —57.560  for  the 
full  model. 

The  LR  statistic  is  therefore  119.296  minus 
115.120,  which  equals  4.179.  Under  H0,  this 
statistic  has  a  chi-square  distribution  with  one 
degree  of  freedom  and  is  significant  at  the  0.05 
level.  Thus,  the  LR  and  Wald  tests  lead  to  the 
same  conclusion. 

So  far,  we  have  illustrated  the  results  from 
a  stratified  Cox  procedure  without  actually 
describing  the  model  form  being  used.  For  the 
remission  data  example,  we  now  present  the 
hazard  function  form  for  the  stratified  Cox 
model,  as  shown  here.  This  hazard  function 
formula  contains  a  subscript  g  that  indicates 
the  gth  stratum. 

Thus,  in  our  remission  data  example,  where  we 
have  stratified  on  SEX,  g  takes  on  one  of  two 
values,  so  that  we  have  a  different  baseline 
hazard  function  for  males  and  females. 

Notice  that  the  hazard  function  formula  con¬ 
tains  the  variables  Rx  and  log  WBC,  but  does 
not  contain  the  variable  SEX.  SEX  is  not 
included  in  the  model  because  it  doesn’t  satisfy 
the  PH  assumption.  So,  instead,  the  SEX  vari¬ 
able  is  controlled  by  stratification. 

Because  the  variables  Rx  and  log  WBC  are 
included  in  the  model,  we  can  estimate  the 
effect  of  each  variable  adjusted  for  the  other 
variable  and  the  SEX  variable  using  standard 
exponential  hazard  ratio  expressions.  For 
example,  the  estimated  hazard  ratio  for  the 
effect  of  Rx,  adjusted  for  log  WBC  and  SEX,  is 
given  by  e  to  the  Pi  “hat,”  where  pi  is  the  coeffi¬ 
cient  of  the  Rx  variable. 
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EXAMPLE:  (continued) 


Cannot  estimate  HR  for  SEX  variable 
(SEX  doesn’t  satisfy  PH). 


Different  baseline  hazard  functions: 

h01(t )  for  females  and  h02(t)  for  males. 

Same  coefficients  Pi  and  p2  for  both 
female  and  male  models. 


"  /?oi  (?)  =>  Survival  curve 

Different  for  females 

baselines  /?o2(0  =>  Survival  curve 

for  males 


Females  and  males: 

same  P]  and  p2  =>  same  HR’s,  e.g.,  e^1 

No  interaction  assumption  (see 
Section  IV) 


Estimates  of  pi  and  p2 

Maximize  partial  likelihood  (L), 
where  L  =  L\  x  L2 

L\  is  the  likelihood  for  females  derived 
from  hi(t), 

and  L2  is  the  likelihood  for  males 
derived  from  h2(t). 


Nevertheless,  because  the  SEX  variable  is  not 
included  in  the  model,  it  is  not  possible  to 
obtain  a  hazard  ratio  value  for  the  effect  of 
SEX  adjusted  for  the  other  two  variables.  This 
is  the  price  to  be  paid  for  stratification  on  the 
SEX  variable.  Note  that  a  single  value  for  the 
hazard  ratio  for  SEX  is  not  appropriate  if  SEX 
doesn’t  satisfy  the  PH  assumption,  because  the 
hazard  ratio  must  then  vary  with  time. 

Notice  also  that  the  hazard  functions  for  males 
and  females  differ  only  insofar  as  they  have 
different  baseline  hazard  functions,  namely, 
/z0i(f)  for  females  and  h02{t)  for  males.  How¬ 
ever,  the  coefficients  pi  and  p2  are  the  same 
for  both  female  and  male  models. 

Because  there  are  different  baseline  hazard 
functions,  the  fitted  stratified  Cox  model  will 
yield  different  estimated  survival  curves  for 
females  and  males.  These  curves  will  be 
described  shortly. 

Note,  however,  that  because  the  coefficients  of 
Rx  and  log  WBC  are  the  same  for  females  and 
males,  estimates  of  hazard  ratios,  such  as  e  to 
the  pi  “hat,”  are  the  same  for  both  females  and 
males.  This  feature  of  the  stratified  Cox  model 
is  called  the  “no-interaction”  assumption.  It  is 
possible  to  evaluate  whether  this  assumption 
is  tenable  and  to  modify  the  analysis  if  not 
tenable.  We  will  discuss  this  assumption  fur¬ 
ther  in  Section  IV. 

To  obtain  estimates  of  Pi  and  p2,  a  (partial) 
likelihood  function  (L)  is  formed  from  the 
model  and  the  data;  this  function  is  then  max¬ 
imized  using  computer  iteration.  The  likeli¬ 
hood  function  (L)  for  the  stratified  Cox  (SC) 
model  is  different  from  the  nonstratified  Cox 
model.  For  the  SC  model,  L  is  obtained  by  mul¬ 
tiplying  together  likelihood  functions  for  each 
stratum.  Thus,  L  is  equal  to  the  product  of  Li 
and  L2,  where  Lx  and  L2  denote  the  female  and 
male  likelihood  functions,  respectively,  which 
are  derived  from  their  respective  hazard  func¬ 
tions  hi(t)  and  h2(t). 
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EXAMPLE:  (continued) 


Adjusted  Survival  Curves  for  Rx 
from  Stratified  Cox  Model 
(adjusted  for  log  WBC) 


As  mentioned  above,  adjusted  survival  curves 
can  be  obtained  for  each  stratum  as  shown 
here.  Here  we  have  shown  four  survival  curves 
because  we  want  to  compare  the  survival  for 
two  treatment  groups  over  each  of  two  strata. 

If  we  compare  treatment  and  placebo  group 
separately  by  sex,  we  can  see  that  the  treatment 
group  has  consistently  better  survival  prog¬ 
nosis  than  the  placebo  group  for  females  and 
males  separately.  This  supports  our  findings 
about  the  hazard  ratio  for  the  treatment  effect 
derived  earlier  from  the  computer  results  for 
the  stratified  Cox  model. 


HI.  The  General  Stratified 
Cox  (SC)  Model 


Example:  one  binary  predictor 

i 

General:  several  predictors,  several 
strata 

Z1(  Z2,. . .,  Zk,  do  not  satisfy  PH 
Xu  X2>. . .,  Xp,  satisfy  PH 


Define  a  single  new  variable  Z*: 

1 .  categorize  each  Z, 

2.  form  combinations  of 
categories  (strata) 

3 .  the  strata  are  the  categories  of  Z* 


EXAMPLE 


Age 


Young 

Middle 

Old 

Treatment 

Placebo 

1 

2 

3 

status 

Treatment 

4 

5 

6 

Z*  —  new  variable  with  six  categories  stratify  on  Z* 


In  the  previous  example,  we  illustrated  the  SC 
model  for  one  binaiy  predictor  not  satisfying  the 
PH  assumption.  We  now  describe  the  general 
form  of  the  SC  model  that  allows  for  stratifica¬ 
tion  of  several  predictors  over  several  strata. 

We  assume  that  we  have  k  variables  not  satisfy¬ 
ing  the  PH  assumption  and  /;  variables  satisfying 
the  PH  assumption.  The  variables  not  satisfy¬ 
ing  the  PH  assumption  we  denote  as  Z1(  Z2,. . ., 
Zkt  the  variables  satisfying  the  PH  assumption 
we  denote  as  Xx,  X2>. . .,  Xp. 

To  perform  the  stratified  Cox  procedure,  we 
define  a  single  new  variable,  which  we  call  Z* , 
from  the  Z’s  to  be  used  for  stratification.  We  do 
this  by  forming  categories  of  each  Zh  including 
those  Z|  that  are  interval  variables.  We  then 
form  combinations  of  categories,  and  these 
combinations  are  our  strata.  These  strata  are 
the  categories  of  the  new  variable  Z* . 

For  example,  suppose  k  is  2,  and  the  two  Z’s  are 
age  (an  interval  variable)  and  treatment  status 
(a  binary  variable).  Then  we  categorize  age  into, 
say,  three  age  groups  -  young,  middle,  and  old. 
We  then  form  six  age  group-by-treatment- 
status  combinations,  as  shown  here.  These  six 
combinations  represent  the  different  categories 
of  a  single  new  variable  that  we  stratify  on  in 
our  stratified  Cox  model.  We  call  this  new 
variable  Z* . 


Presentation:  111.  The  General  Stratified  Cox  (SC)  Model  209 


Z*  has  k*  categories  where  k*  = 
total  #  of  combinations  (strata), 
e.g.,  k*  —  6  in  above  example. 


The  general  SC  model: 
hg(t,  X)  =  /jQgWexpfp!  Xi  +  p2  X2 
+  ■  •  •  +  Pp  Xp ] 

g  =  1,2,...,  k* ,  strata  defined 
from  Z* 


Z*  not  included  in  the  model 
X\ ,  X2>.  . .,  Xp  included  in  the  model 


Different  baseline  hazard  functions: 
h0g(t),g=\,2,...,k* 

Same  coefficients:  pi,  p2,. . .,  Pp 


"  hn  (t) 
Different  fua{t) 
baselines  : 

.  hok(t) 


Slit) 

Slit) 

Skit) 


Different 

survival 

curves 


HR  same  for  each  stratum 

(no-interaction  assumption, 
Section  IV) 


(Partial)  likelihood  function: 
L  =L\  x  L2  x  ...  x 


Strata:  1 

2 

..  k * 

Likelihood:  L\ 

l2 

..  Lk. 

Hazard:  h\{t,  X) 

h2(t,X)  . 

..  hk,(t,Xn 

In  general,  the  stratification  variable  Z*  will 
have  k*  categories,  where  k*  is  the  total  number 
of  combinations  (or  strata)  formed  after  cate¬ 
gorizing  each  of  the  Z’s.  In  the  above  example, 
k*  is  equal  to  6. 

We  now  present  the  general  hazard  function 
form  for  the  stratified  Cox  model,  as  shown 
here.  This  formula  contains  a  subscript  g 
which  indicates  the  gth  stratum.  The  strata 
are  defined  as  the  different  categories  of  the 
stratification  variable  Z* ,  and  the  number  of 
strata  equals  k*. 

Note  that  the  variable  Z*  is  not  explicitly 
included  in  the  model  but  that  the  I’s,  which 
are  assumed  to  satisfy  the  PH  assumption,  are 
included  in  the  model. 

Note  also  that  the  baseline  hazard  function 
h0git)  is  allowed  to  be  different  for  each  stra¬ 
tum.  However,  the  coefficients  pi,  p2). . .,  PA,  are 
the  same  for  each  stratum. 

As  previously  described  by  example,  the  fitted 
SC  model  will  yield  different  estimated  survival 
curves  for  each  stratum  because  the  baseline 
hazard  functions  are  different  for  each  stratum. 

However,  because  the  coefficients  of  the  V’s  are 
the  same  for  each  stratum,  estimates  of  hazard 
ratios  are  the  same  for  each  stratum.  This  latter 
feature  of  the  SC  model  is  what  we  previously 
have  called  the  "no-interaction”  assumption  to 
be  discussed  further  in  Section  IV. 

To  obtain  estimates  of  the  regression  coeffi¬ 
cients  Pi,  p2,...,  Pp,  we  maximize  a  (partial) 
likelihood  function  L  that  is  obtained  by  multi¬ 
plying  together  likelihood  functions  for  each 
stratum,  as  shown  here.  Thus,  L  is  equal  to 
the  product  of  Li  times  L2,  and  so  on,  up  until 
Lk* ,  where  the  subscripted  L’s  denote  the  like¬ 
lihood  functions  for  different  strata,  with  each 
of  these  L’s  being  derived  from  its  corres¬ 
ponding  hazard  function. 
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IV.  The  No-Interaction 
Assumption  and  How 
to  Test  It 


Stratified  Cox  model 
hg(t,X)  =  h0g(t)ex p  [PjXj  +  p2X2 

+  •  ■  •  +  PpXp] 

p  coefficients  do  not  vary  over 
strata  (no-interaction  assumption) 


We  previously  pointed  out  that  the  SC  model 
contains  regression  coefficients,  denoted  as  p’s, 
that  do  not  vaiy  over  the  strata.  We  have  called 
this  property  of  the  model  the  “no-interaction 
assumption.”  In  this  section,  we  explain  what 
this  assumption  means.  We  also  describe  how 
to  evaluate  the  assumption  and  what  to  do  if 
the  assumption  is  violated. 


•  how  to  evaluate 

•  what  to  do  if  violated 


EXAMPLE 


No-interaction  SC  model: 

Stratified  Cox  regression 
Analysis  time_t:  survt 

Std.  Haz. 

Coef.  Err.  p>|z|  Ratio  [95%  Conf.  Interval] 

log  WBC  I"  1.390]  0.338  0.000  4.016  2.072  7.783 

Rx  jo.931  J  0.472  0.048  2.537  1.006  6.396 

No.  of  subjects  =  42  Log  likelihood  =  —57.560  Stratified  by  sex 

Interaction  by  fitting  separate  models: 
Cox  regression  (Females) 

Analysis  time  _t:  survt 

Column  p-  p 

name  Coeff.  StErr.  value  HR  0.95  Cl  (PH) 

log  WBC  f  1.639]  0.519  0.002  5.150  1.862  14.242  0.228 

Rx  [  1.859  J  0.729  0.011  6.418  1.537  26.790  0.603 

No.  of  subjects  =  20  Log  likelihood  =  -22.100 

Cox  regression  (Males) 

Analysis  time  _t:  survt 

Column  p-  P 

name  Coeff.  StErr.  value  HR  0.95  Cl  (PH) 

log  WBC  (l. 170]  0.499  0.019  3.222  1.213  8.562  0.674 

Rx  jp.267  J  0.566  0.637  1.306  0.431  3.959  0.539 

No.  of  subjects  =  22  Log  likelihood  =  —33.736 

Which  model  is  more  appropriate 
statistically? 


We  return  to  the  SC  output  previously  illu¬ 
strated.  Notice  that  only  one  set  of  coefficients, 
namely,  1.390  for  log  WBC  and  0.931  for  Rx, 
are  provided,  even  though  there  are  two  strata, 
one  for  females  and  one  for  males.  These  results 
assume  no  interaction  of  the  sex  variable  with 
either  log  WBC  or  Rx. 

If  we  allow  for  interaction,  then  we  would 
expect  to  obtain  different  coefficients  for  each 
of  the  (SEX)  strata.  This  would  happen  if  we  fit 
separate  hazard  models  to  the  female  and  male 
data,  with  each  model  containing  the  log  WBC 
and  Rx  variables.  The  computer  results  from 
fitting  separate  models  are  shown  here. 

Notice  that  the  coefficient  of  log  WBC  is  1.639 
for  females  but  is  1.170  for  males.  Also,  the 
coefficient  for  Rx  is  1.859  for  females  but 
0.267  for  males.  These  results  show  different 
coefficients  for  females  than  for  males,  partic¬ 
ularly  for  the  Rx  variable. 

But  are  corresponding  coefficients  statistically 
different?  That  is,  which  model  is  more  appro¬ 
priate  statistically,  the  no-interaction  model  or 
the  interaction  model?  To  answer  this  ques¬ 
tion,  we  must  first  look  at  the  hazard  function 
model  for  the  interaction  situation. 
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EXAMPLE:  (continued) 


Interaction  model: 

(♦)  hg  (t,X) 

=  h0g  (f)exp(Pig  log  WBC  +  p2g.Rx] 
where  g  =  1  (females),  g  =  2  (males) 

No-interaction  model: 
hg  (f,X) 

=  h0g  (f)exp[Pi  log  WBC  +  P 2Rx] 
where  g  =  1  (females),  g  =  2  (males) 


Alternative  interaction  model: 

(* )hg(t,X)  =  hog(t)  exp  [0  j  log  WBC 


+  [f2Rx  +  (SEX  x  log  WBC) 
+  184X  (SEX  x  Rx)\ 


where  SEX  = 


1  if  female 
0  if  male 


h0g(t)  are  different  forg  =1,2 
p*  coefficients  do  not  involve  g 


Equivalence  of  models  (♦)  and  (*): 
g  =  1  (females),  so  that  sex  =  1: 
hi(t,X)  =  hoi(t)  exp[0(  log  WBC  +  feRx 


\  +0^(1  x  log  WBC)  +  0J(1  x  Rx) 

=  *01  (?)  exp  (0[  +  0()  log  WBC 


+  (02  +  04 )  |  Rx 


g  =  2  (males),  so  that  sex  =  0: 
h2(t,X)  =  h02(t)  exp[/S\  log  WBC  +  j?2Rx 
+P3  (0  x  log  WBC)  +  0J  (0  x  Rx)] 
=  /102(f)  exp  [j/jf]  log  WBC  +  [g]Rx] 


Interaction  models  in  same  format: 
Females  (g  =  1):  /?i(f,X) 

(♦)  =  MO  exP [A  1 1  log  WBC  +  02i Rx] 
(*)  =  A01  (0  exp  f(0i  +  03 )  log  WBC 
+  (02  +  04) Sx] 

Males  (g  =  2):  /j2(?,X) 

(♦)  =  MO  exp[0i2  log  WBC  +  022 Av] 
(*)  =  *02(0  exp [0[  logWBC+0*ff.v] 


One  way  to  state  the  hazard  model  formula 
when  there  is  interaction  is  shown  here  (♦). 
Notice  that  each  variable  in  this  model  has  a 
different  coefficient  for  females  than  for  males, 
as  indicated  by  the  subscript  g  in  the  coeffi¬ 
cients  Pig  and  p2g. 

In  contrast,  in  the  no -interaction  model,  the 
coefficient  (pi)  of  log  WBC  is  the  same  for 
females  and  for  males;  also,  the  coefficient 
(p2)  of  Rx  is  the  same  for  females  and  for  males. 

An  alternative  way  to  write  the  interaction 
model  is  shown  here  (*).  This  alternative 
form  contains  two  product  terms,  SEX  x  log 
WBC  and  SEX  x  Rx,  as  well  as  the  main  effects 
of  log  WBC  and  Rx.  We  have  coded  the  SEX  so 
that  1  denotes  female  and  0  denotes  male. 

In  this  alternative  model,  note  that  although  the 
baseline  hazards  h0g(t)  are  different  for  each 
sex,  the  p*  coefficients  do  not  involve  the  sub¬ 
script  g  and  therefore  are  the  same  for  each  sex. 

Nevertheless,  this  alternative  formula  (★)  is 
equivalent  to  the  interaction  formula  (♦) 
above.  We  show  this  by  specifying  the  form 
that  the  model  takes  for  g  =  1  (females)  and 
g  =  2  (males). 

Notice  that  the  coefficients  of  log  WBC  are 
different  in  each  formula,  namely,  (P\+  PI) 
for  females  versus  P\  for  males. 

Similarly,  the  coefficients  of  Rx  are  different, 
namely,  (P*2  +  />4)  for  females  versus  [>2  for 
males. 

The  preceding  formulae  indicate  that  two  seem¬ 
ingly  different  formulae  for  the  interaction 
model  -  (♦)  versus  (★),  shown  earlier  -  can  be 
written  in  the  same  format.  We  show  these  for¬ 
mulae  here  separately  for  females  and  males. 
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EXAMPLE:  (continued) 


(♦)  (*) 

Females  (g  =  1)  :  JSU  =  p\  +  p\ 

Pll  =  P*2  +  P*4 


(♦)  (*) 
Males  (g  =  2)  :  /?12  =  $ 


P 22  ~  Pi 


Stratified  Cox  regression 
Analysis  time  _t:  survt 


Std.  Haz.  [95%  Conf. 

Coef.  Err:  p>|z|  Ratio  Interval] 


log  WBC 
Rx 

Sex  x  log 
WBC 
Sex  x  Rx 


1.170 

0.267 

0.469 


0.499 

0.566 

0.720 


0.019  3.222 

0.637  1.306 

0.515  1.598 

0.084  4.915 


1.213  8.562 

0.431  3.959 

0.390  6.549 

0.805  30.003 


No.  of  subjects  =  42  Log  likelihood  =  —55.835  Stratified  by  sex 


Females: 


log  WBC 


Pn=\Lm 

(t\  +PI  =  1.170  +  0.469  =  [L6W1 


Rx 


Pn  =  1 1.859’ 

K2  +  K  =  0.267  +  1.592  =ri~8~59~| 


Males: 

log  WBC  Pl2  =  ILI70I  =  /?* 
Rx  P22  =  [pj267]  =  ~p*2 


Interaction  model: 

hg(t,X)  =  hog(t)  exp[fl\  log  WBC  +  P*2Rx 
+  PI  (SEX  x  log  WBC) 

+PI  x  (SEX  x  Rx)] 


Notice  that  for  females,  the  coefficient  Pn  in 
model  (*)  must  be  equivalent  to  (/?]+/? 3)  in 
model  (♦)  because  both  models  have  the  same 
format,  and  both  Pn  and  (p\  +^3)  are  coeffi¬ 
cients  of  the  same  variable,  log  WBC.  Similarly, 
P21  in  model  (*)  is  equivalent  to  (P*2  +  P*4)  in 
model  (♦)  because  both  are  coefficients  of  the 
same  variable,  Rx. 

For  males,  it  follows  in  an  analogous  way,  that 
the  coefficient  Pi2  is  equivalent  to  fi\,  and,  sim¬ 
ilarly,  P22  equals  jf2. 

Here  we  provide  computer  results  obtained 
from  fitting  the  alternative  interaction  model 
(*).  The  estimated  regression  coefficients 

1,  fl2,  jS3, ,  and  [i4,  respectively,  are  circled. 


We  have  indicated  above  that  the  sums  /3,  +  p3 
and  P2  +  [i4  are  equal  to  the  coefficients  and 
P2l,  respectively,  in  the  original  interaction 
model  for  females. 


>\  3|C 

Also,  we  have  indicated  that  and  [>2  are  equal 
to  the  coefficients  / 112  and  j>22 ,  respectively,  in 
the  original  interaction  model  for  the  males. 
The  numerical  equivalences  are  shown  here. 
Note  again  that  the  coefficients  of  log  WBC 
and  Rx  for  females  are  different  from  males, 
as  is  to  be  expected  if  sex  interacts  with  each 
variable. 

We  have  thus  seen  that  the  interaction  model 
can  be  written  in  a  format  that  contains  prod¬ 
uct  terms  involving  the  variable  being  strati¬ 
fied,  SEX,  being  multiplied  by  each  of  the 
predictors  not  being  stratified.  We  show  this 
model  involving  product  terms  again  here. 
We  will  use  this  model  to  describe  a  test  of  the 
no-interaction  assumption. 


Presentation:  IV.  The  No-Interaction  Assumption  and  How  to  Test  It  213 


EXAMPLE:  (continued) 


Testing  the  no- interaction  assumption: 

LR  =  -2  In  Lr  -  (-2  In  LF) 

R  =  reduced  (no-interaction)  model 
F  =  full  (interaction)  model 


LR  ~  under  H0:  no  interaction 
(2  df  because  two  product  terms  tested 
in  interaction  model) 


No  interaction  (reduced  model): 
iOutput:  -2  log  I:  115.1201 
-2  In  Lr 


Interaction  (full  model): 

iOutput:  -2  log X:  11 1.670 j 


-2  In  Lp 


LR  =  115.120  -  111.670  =  3.45 
(P  >  0.05  not  significant). 

Thus,  the  no-interaction  model  is 
acceptable. 


The  test  is  a  likelihood  ratio  ( LR )  test  which 
compares  log-likelihood  statistics  for  the  inter¬ 
action  model  and  the  no-interaction  model. 
That  is,  the  LR  test  statistic  is  of  the  form 
—2  In  Lr  minus  —2  In  LF,  where  R  denotes  the 
reduced  model,  which  in  this  case  is  the  no¬ 
interaction  model,  and  F  denotes  the  full 
model,  which  is  the  interaction  model. 

This  LR  test  statistic  has  approximately  a  chi- 
square  distribution  with  2  degrees  of  freedom 
under  the  null  hypothesis  that  the  no-interac- 
tion  model  is  correct.  The  degrees  of  freedom 
here  is  2  because  there  are  two  product  terms 
being  tested  in  the  interaction  model. 

The  log-likelihood  statistic  for  the  reduced 
model  comes  from  the  computer  output  for 
the  no-interaction  model  and  is  equal  to 
—2  times  —57.560,  or  115.120. 

The  log-likelihood  statistic  for  the  full  model 
comes  from  the  computer  results  for  the  inter¬ 
action  model  and  is  equal  to  -2  times  -55.835, 
or  111.670. 

The  LR  statistic  is  therefore  115.120  minus 
111.670,  which  equals  3.45.  This  value  is  not 
significant  at  the  0.05  level  for  2  degrees  of 
freedom.  Thus,  it  appears  that  despite  the 
numerical  difference  between  corresponding 
coefficients  in  the  female  and  male  models, 
there  is  no  statistically  significant  difference. 
We  can  therefore  conclude  for  these  data  that 
the  no-interaction  model  is  acceptable  (at  least 
at  the  0.05  level). 


Remission  data  example: 

•  described  no-interaction 
assumption 

•  evaluated  assumption  using  LR 
test 

•  provided  interaction  model  if 
needed 


Using  the  remission  data  example,  we  have 
described  the  no-interaction  assumption,  have 
shown  how  to  evaluate  this  assumption  using 
a  likelihood  ratio  test,  and  have  provided  the 
form  of  an  interaction  model  that  should  be 
used  in  case  the  no-interaction  assumption 
does  not  hold.  We  now  describe  this  process 
more  generally  for  any  stratified  Cox  analysis. 


Now,  we  generalize  this  process. 
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Recall  that  the  general  form  of  the  no-interaction 
model  for  the  stratified  Cox  procedure  is  given  as 
shown  here.  This  model  allows  for  several  vari¬ 
ables  being  stratified  through  the  use  of  a  newly 
defined  variable  called  Z* ,  whose  strata  consist 
of  combinations  of  categories  of  the  variables 
being  stratified. 

If,  in  contrast,  we  allow  for  interaction  of  the  Z* 
variable  with  the  X’s  in  the  model,  we  can  write 
the  model  as  shown  here.  Notice  that  in  this 
interaction  model,  each  regression  coefficient 
has  the  subscript  g,  which  denotes  the  gth  stra¬ 
tum  and  indicates  that  the  regression  coeffi¬ 
cients  are  different  for  different  strata  of  Z*. 

Alternative  SC  interaction  model:  An  alternative  way  to  write  the  interaction 

model  uses  product  terms  involving  the  vari- 

•  uses  product  terms  involving  Z*  able  Z*  with  each  of  the  predictors.  However, 

•  define  k*  —  1  dummy  variables  to  write  this  model  correctly,  we  need  to  use 

Zj,  Z|, . . . ,  Z*k,  _ , ,  from  Z*  k*  —  1  dummy  variables  to  distinguish  the  k* 

•  products  of  the  form  Z*  xXjt  categories  of  Z*;  also,  each  of  these  dummy 

where  i  =  1,. . .,  k*  —  1  and  variables,  which  we  denote  as  Zj ,  Z^ ,  ■  ■  ■  ZJ,  _ , , 

j  =  I .  needs  to  be  involved  in  a  product  term  with 

each  of  the  X’s. 

The  hazard  model  formula  alternative  model  is 
shown  here.  Notice  that  the  first  line  of  the 
formula  contains  the  X’s  by  themselves,  the 
next  line  contains  products  of  each  X;  with  Z\, 
the  third  line  contains  the  products  with  Z\, 
and  the  last  line  contains  products  with  Z*k,_v 
Note  also  that  the  subscript  g  occurs  only  with 
the  baseline  hazard  function  h0g(t),  and  is  not 
explicitly  used  in  the  (3  coefficients. 


hg(t,  X)  =  h0g{t)  exp  [jSjXj  4 - h  ppXp 

+  fill  {Z\  x  X[)  + - f  Ppi  (Zj  x  Xp) 

+  Pl2  (Z2  X  Xl)  T - b  Pp2{Z2  X  Xp) 

3 - b  Pi,k‘- 1  (Zj._ i  x  X,)  H - 

XPpje-i  (Zk,_ i  x  Xp)] 
g  =  1, 2, . . . ,  k* ,  strata  defined  from  Z* 


SC  model  allowing  interaction: 

hg(t,X)  =  /zogO)exp[plgXi 

+  P2gX2  +  .  .  .  +  PpgXp] 

g  =  1,  2,. . .,  k* ,  strata  defined 
from  Z* 


No-interaction  SC  model: 

hg(tX)  =  /zogWexptPtXj  +  p2X2 

+  •  •  •  +  PpXp] 

g  =  1,  2,. . .,  k* ,  strata  defined 
from  Z* 
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EXAMPLE:  (Remission  Data) 


Z*  =  sex,  fc*  =  2, 

Zj  =  sex(0,  1), 

Xx  =  log  WBC,  X2  =  Rx(p  =  2) 
hg(t,X)  =  h0g(t)  exp +  p2X2 
+  Pn(z;  xX,) 

+fe(zj  xX2)] 

=  h0g(t)  exp  [/?i  log  WBC 
4-  /J^RX  +  /^(sex  x  log  WBC) 
+/?4(sex  x  R.x)) 

g=l,2. 

Ai  =  Ai 7  P2  —  P2>  Pn  =  Pa  and  /hi  =  P4 


In  our  previous  example  involving  the  remission 
data,  the  stratification  variable  (Z*)  was  the 
variable  SEX,  and  k*  was  equal  to  2;  thus, 
we  have  only  one  dummy  variable  Z\,  which 
uses  a  (0,  1)  coding  to  indicate  sex,  and  we 
have  only  (p  equal  to)  two  predictors:  Xx  equal 
to  log  WBC  and  X2  equal  to  Rx.  The  interaction 
model  is  then  written  in  either  of  the  forms 
shown  here. 

The  latter  version  of  the  interaction  model  is 
what  we  previously  presented  for  the  remission 
data  example.  Because  the  two  versions  pre¬ 
sented  here  are  equivalent,  it  follows  that 

P*\  =  P\ ■  Pi  —  Pi’  Pn  =  Pi'  and  P21  =  Pi- 


We  have  thus  seen  that  the  interaction  model 
can  be  written  in  a  format  that  contains  product 
terms  involving  dummy  variables  (i.e.,  Z* )  for 
the  variable  being  stratified  being  multiplied  by 
each  of  the  predictors  (i.e.,  X,)  not  being  strati¬ 
fied.  We  will  use  this  model  to  describe  a  test  of 
the  no-interaction  assumption. 


Testing  the  no-interaction  assump¬ 
tion: 

LR  =  -2  In  Lr  -  (-2  In  LF) 

R  =  reduced  (no-interaction) 
model 

F  =  full  (interaction)  model 
contains  product  terms 

H0  : 

l  Pl,k*- 1  =  ’  ’  ’  =  Ppp-l  =  0 


P 11  —  •  •  •  —  Ppi  —  0 

Pl2  —  "  *  =  Ppi  —  0 


The  test  is  a  likelihood  ratio  (LR)  test  which 
compares  log  likelihood  statistics  for  the  inter¬ 
action  model  and  the  no-interaction  model. 
That  is,  the  LR  test  statistic  is  of  the  form 
—2  In  Lr  minus  —2  In  LF,  where  R  denotes  the 
reduced  model,  which  in  this  case  is  the  no¬ 
interaction  model,  and  F  denotes  the  full 
model,  which  is  the  interaction  model. 

The  no-interaction  model  differs  from  the 
interaction  model  in  that  the  latter  contains 
additional  product  terms.  Thus,  one  way  to 
state  the  null  hypothesis  of  no  interaction  is 
that  the  coefficients  of  each  of  these  product 
terms  are  all  zero. 


LF  ~  Xp(k>-i)  df 

under  Hq  :  no  interaction 


p(k*  —  1)  gives  number  of  product 
terms  being  tested  in  interaction 
model 


The  LR  test  statistic  has  approximately  a  chi- 
square  distribution  with  p(k*  -  1)  degrees  of 
freedom  under  the  null  hypothesis.  The 
degrees  of  freedom  here  is  p(k*  -  1)  because 
this  value  gives  the  number  of  product  terms 
that  are  being  tested  in  the  interaction  model. 
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EXAMPLE:  (Remission  Data) 


Z*  =  sex,  fc*  =  2, 

Zj  =  sex(0, 1) 

Xr  =  log  WBC,  X2  =  Rx  (p  =  2) 
p(fc*  -  1)  =  2,  so 

Zi?  ~  zldf under  H0:  no  interaction 


Returning  to  the  remission  data  example,  for 
which  p  =  2  and  k*  =  2,  the  value  of  p(k*  —  1)  is 
equal  to  two  times  (2  -  1),  which  equals  two. 
Thus,  to  test  whether  the  SEX  variable  inter¬ 
acts  with  the  log  WBC  and  Rx  predictors,  the 
degrees  of  freedom  for  the  LR  statistic  is  two, 
as  previously  described. 


V.  A  Second  Example 
Involving  Several 
Stratification  Variables 


EXAMPLE 


vets.dat:  survival  time  in  days,  n  =  137 

Veteran’s  Administration  Lung  Cancer  Trial 
Column  1 :  Treatment  (standard  =  1 ,  test  =  2) 
Column  2:  Cell  type  1  (large  =  1,  other  =  0) 
Column  3:  Cell  type  2  (adeno  =  1,  other  =  0) 
Column  4:  Cell  type  3  (small  =  1,  other  =  0) 
Column  5:  Cell  type  4  (squamous  =  1, 
other  =  0) 

Column  6:  Survival  time  (days) 

Column  7:  Performance  status 

(0  =  worst,. . 100  =  best) 

Column  8:  Disease  duration  (months) 
Column  9:  Age 

Column  10:  Prior  therapy  (none  =  0, 
some  =10) 

Column  1 1 :  Status  (0  =  censored,  1  =  died) 

Cox  regression 
Analysis  time_t:  survt 

Std!  Haz!  [95%  Conf.  P~ 

Coef.  Err.  p  >  |z|  Ratio  Interval]  (PH) 

Treatment  0.290  0.207  0.162  1.336  0.890  2.006  0.628 

Large  cell  0.400  0.283  0.157  1.491  0.857  2.594  0.033 

Adeno  cell  1.188  0.301  0.000  3.281  1.820  5.915  0.081 

Small  cell  0.856  0.275  0.002  2.355  1.374  4.037  0.078 

Perf.  Stat  -0.033  0.006  0.000  0.968  0.958  0.978  0.000 

Dis.  Durat.  0.000  0.009  0.992  1.000  0.982  1.018  0.919 

Age  -0.009  0.009  0.358  0.991  0.974  1.010  0.198 

Pr.  Therapy  0.007  0.023  0.755  1.007  0.962  1.054  0.145 

No.  of  subjects  =  137  Log  likelihood  =  -475.180 


Variables  not  satisfying  PH: 

•  cell  type  (3  dummy  variables) 

•  performance  status 

•  prior  therapy  (possibly) 

SC  model:  stratifies  on  cell  type  and 
performance  status 


The  dataset  “vets.dat”  considers  survival  times 
in  days  for  137  patients  from  the  Veteran’s 
Administration  Lung  Cancer  Trial  cited  by 
Kalbfleisch  and  Prentice  in  their  text  ( The  Sta¬ 
tistical  Analysis  of  Survival  Time  Data,  Wiley, 
pp.  223-224,  1980).  The  exposure  variable  of 
interest  is  treatment  status.  Other  variables  of 
interest  as  control  variables  are  cell  type  (four 
types,  defined  in  terms  of  dummy  variables), 
performance  status,  disease  duration,  age,  and 
prior  therapy  status.  Failure  status  is  defined 
by  the  status  variable.  A  complete  list  of  the 
variables  is  shown  here. 


Here  we  provide  computer  output  obtained 
from  fitting  a  Cox  PH  model  to  these  data. 
Using  the  P(PH)  information  in  the  last  col¬ 
umn,  we  can  see  that  at  least  four  of  the  vari¬ 
ables  listed  have  P(PH )  values  below  the  0.100 
level.  These  four  variables  are  labeled  in  the 
output  as  large  cell  (0.033),  adeno  cell  (0.081), 
small  cell  (0.078),  and  Perf.  Stat  (0.000).  Notice 
that  the  three  variables,  large  cell,  adeno  cell, 
and  small  cell,  are  dummy  variables  that  dis¬ 
tinguish  the  four  categories  of  cell  type. 

Thus,  it  appears  from  the  P(PH)  results  that  the 
variables  cell  type  (defined  using  dummy  vari¬ 
ables)  and  performance  status  do  not  satisfy 
the  PH  assumption. 

Based  on  the  conclusions  just  made  about  the 
PH  assumption,  we  now  describe  a  stratified 
Cox  analysis  that  stratifies  on  the  variables,  cell 
type  and  performance  status. 
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EXAMPLE:  (continued) 


Z*  given  by  combinations  of  categories: 

•  cell  type  (four  categories) 

•  performance  status  (interval) 
change  to 

•  PSbin  (two  categories) 

Z*  has  k*  =  4x2  =  8  categories 


Four  other  variables  considered  as  iVs: 

•  treatment  status 

•  disease  duration 

•  age 

•  prior  therapy 


Here,  we  use  treatment  status  and  age 
as  2fs 


Stratified  Cox  regression 
Analysis  time_t:  survt 

Std.  Haz.  [95%  Conf. 

Coef.  Err.  p>|z|  Ratio  Interval] 

Treatment  0.125  0.208  (0.548)(1.134  )  0.753  1.706 

Age  -0.001  0.010  0.807  0.999  0.979  1.019 

No.  of  subjects  =  137  Log  likelihood  =  —262.020  Stratified  by  Z* 

No-interaction  model 

HR=  1.134(P  =  0.548) 

Treatment  effect  (adjusted  for  age  and 
Z*)  is  nonsignificant 


No-interaction  model: 
hs(f,X) 

=  f*Og(f)exp[Pi  Treatment  +  (S2  Age] 
g  =  1,  2,. . 8  (=#  of  strata  defined 
from  Z*) 

Interaction  model: 

hs(t,X) 

=/?0g(f)exp[Pig  Treatment  +  (S2g  Age] 
g=l,2 . 8° 


Because  we  are  stratifying  on  two  variables,  we 
need  to  form  a  single  new  categorical  variable 
Z*  whose  categories  represent  combinations  of 
categories  of  the  two  variables.  The  cell  type 
variable  has  four  categories  by  definition.  The 
performance  status  variable,  however,  is  an 
interval  variable  ranging  between  0  for  worst 
to  100  for  best,  so  it  needs  to  be  categorized. 
We  categorize  this  variable  into  two  groups 
using  a  cutpoint  of  60,  and  we  denote  this 
binary  variable  as  PSbin.  Thus,  the  number  of 
categories  for  our  Z*  variable  is  4  x  2,  or  8;  that 
is,  k*  =  8. 


In  addition  to  the  two  stratification  variables, 
cell  type  and  performance  status,  there  are  four 
other  variables  to  be  considered  as  predictors  in 
the  stratified  Cox  model.  These  are  treatment 
status,  disease  duration,  age,  and  prior  therapy. 

For  illustrative  purposes  here,  we  use  only 
treatment  status  and  age  as  predictors.  The 
other  two  variables,  disease  duration  and 
prior  therapy,  are  considered  in  exercises  fol¬ 
lowing  this  presentation. 

Here  we  show  computer  output  from  fitting  a 
stratified  Cox  model  that  stratifies  on  cell  type 
and  performance  status  using  the  eight-category 
stratification  variable  Z*.  This  model  also 
includes  treatment  and  age  as  predictors.  These 
results  consider  a  no- interaction  model,  because 
only  one  regression  coefficient  is  provided  for 
the  treatment  and  age  predictors.  Notice  that 
the  estimated  hazard  ratio  is  1.134  for  the  effect 
of  the  treatment  variable  adjusted  for  age  and  Z* , 
the  latter  being  adjusted  by  stratification.  The  p- 
value  for  this  adjusted  treatment  effect  is  0.548, 
which  is  highly  nonsignificant. 

The  no- interaction  model  we  have  just  described 
has  the  hazard  function  formula  shown  here. 


To  evaluate  whether  the  no-interaction  model 
is  appropriate,  we  need  to  define  an  interaction 
model  that  allows  different  regression  coeffi¬ 
cients  for  different  strata.  One  way  to  write 
this  interaction  model  is  shown  here. 
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EXAMPLE:  (continued) 


Alternative  interaction  model: 

hg(t,X) 

=  hog(t)  exp[/lj  Treatment 
+  ft  Age 

+/Jj!  (Z*  x  Treatment)  +  •  •  • 

+  /J17(Z)  x  Treatment) 

+ftt  (Zi  x  Age)  •••:  f  ft7 (Z7*  x  Age)] 
g=  1,2, ...  ,8 

Another  version  of  interaction  model: 
Replace  Z), . . .  ,Z)  by 

Z(  =  large  cell  (binary) 

Z2  =  adeno  cell  (binary) 

Zj  =  small  cell  (binary) 

Z4  =  PSbin  (binary) 

Z5  =  Zj  x  Z4 
Zg  =  Z2  x  Z4 
Z)  =  Z3  x  Z4 

hg(t,X)  =  /!o?(f)  exp[/?!  Treatment  +  /i2  Age 

+  fti(tr  Zi)  +  @12 (tr  Z2)  +  fin (t1  zl) 

+  ^i4(trZ4)  +  /t15  (tr  Z)Z4) 

+  16  (b*  Z2Z4)  +  /^17  (tr  Z3Z4) 

+ft,  (AGE  Z\)  +  P22  (AGE  Z*) 

+/t>3  (age  z3*)  +  p24  (age  z*) 

+  p25  (AGE  Z(Z4)  +  /326  (AGE  Z^Z*) 

+ft7  (age  z)z4)] 


An  alternative  version  of  this  interaction  model 
that  involves  product  terms  is  shown  here.  This 
version  uses  seven  dummy  variables  denoted 
as  Z\,Z\  up  through  Z)  to  distinguish  the  eight 
categories  of  the  stratification  variable  Z* .  The 
model  contains  the  main  effects  of  treatment 
and  age  plus  interaction  terms  involving  pro¬ 
ducts  of  each  of  the  seven  dummy  variables 
with  each  of  the  two  predictors. 


Yet  another  version  of  the  interaction  model  is 
to  replace  the  seven  dummy  variables  Z\  to  Z) 
by  the  seven  variables  listed  here.  These  vari¬ 
ables  are  three  of  the  binary  variables  making 
up  the  cell  type  variable,  the  binary  variable  for 
performance  status,  plus  three  product  terms 
involving  each  of  the  cell  type  dummy  variables 
multiplied  by  the  PSbin  dummy  variable  (Z4). 


The  latter  interaction  model  is  shown  here. 
In  this  model,  the  variable  tr  Z\  denotes  the 
product  of  treatment  status  with  the  large  cell 
dummy  Z\,  the  variable  tr  Z?  denotes  the  prod¬ 
uct  of  treatment  status  with  the  adeno  cell  var¬ 
iable  Z*2,  and  so  on.  Also,  the  variable  tr  Z\Z\ 
denotes  the  triple  product  of  treatment  status 
times  the  large  cell  variable  Z\  times  the  PSbin 
variable  Z4,  and  so  on,  for  the  other  triple  prod¬ 
uct  terms  involving  treatment.  Similarly,  for 
the  terms  involving  age,  the  variable  Age  Z\ 
denotes  the  product  of  age  with  Z\,  and  the 
variable  Age  Z\Z\  denotes  the  triple  product  of 
age  times  Z\  times  Z\. 


Note  that  we  are  only  considering  the  interac¬ 
tion  between  the  stratified  variables  and  the 
predictors.  We  could  also  (but  do  not)  consider 
the  interaction  between  the  two  predictors, 
treatment,  and  age. 
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EXAMPLE:  (continued) 


Stratified  Cox  Regression  Analysis  on 
Variable:  Z* 

Response:  Surv.  Time 


Coef. 

Std. 

Err. 

p>M 

Haz. 

Ratio 

[95%  Conf. 
Interval] 

Treatment 

0.286 

0.664 

0.667 

1.331 

0.362 

4.893 

Age 

0.000 

0.030 

0.978 

0.999 

0.942 

1.060 

tr  Zj 

2.351 

1.772 

0.184 

10.495 

0.326 

337.989 

tr  Z\ 

-1.158 

0.957 

0.226 

0.314 

0.048 

2.047 

tr  Z3 

0.582 

0.855 

0.496 

1.790 

0.335 

9.562 

tr  Z4 

-1.033 

0.868 

0.234 

0.356 

0.065 

1.950 

tr  Z\2 

-0.794 

1.980 

0.688 

0.452 

0.009 

21.882 

tr  Z2Z4 

2.785 

1.316 

0.034 

16.204 

1.229 

213.589 

tr  Z3Z4 

0.462 

1.130 

0.683 

1.587 

0.173 

14.534 

Age  Z[ 

0.078 

0.064 

0.223 

1.081 

0.954 

1.225 

Age  Zj 

-0.047 

0.045 

0.295 

0.954 

0.873 

1.042 

Age  Z3 

-0.059 

0.042 

0.162 

0.943 

0.868 

1.024 

Age  Z4 

0.051 

0.048 

0.287 

1.053 

0.958 

1.157 

Age  Z\Z\ 

-0.167 

0.082 

0.042 

0.847 

0.721 

0.994 

Age  Z2Z4 

-0.045 

0.068 

0.511 

0.956 

0.838 

1.092 

Age  Z\Z\ 

0.041 

0.061 

0.499 

1.042 

0.924 

1.175 

No.  of  subjects  =  137  Log  likelihood  =  -249.972  Stratified  by  Z* 


Eight  possible  combinations  ofZj  toZ];: 

*=1:  Zj  =  Zt  =  Zj  =  Z4  =  0 

g  =  2:  Zj  =  1,  Zj  =  Z3  =  Z4  =  0 

g  =  3:  z*  =  i,  z;=z;=z;  =  o 

8  =  4:  z;  =  1,  Zj  =  Zj  =  Z4  =  0 

g  =  5  :  Zj  =  Zj  =  Z3  =  0,  Z4  =  1 

g  =  6 :  zj  =  1,  zj  =  z;  =  0,  Z*  =  1 

g  =  7 :  zj  =  1,  zj  =  z;  =  0,  Z*  =  1 

g  =  8 :  z;  =  1,  ZJ  =  Zj  =  0,  Z*  =  1 

8  —  1  :  Zj  =  Z2  =  Z3  =  Z4  =  0 

(Squamous  cell  type  and  PSbin  =  0) 

All  product  terms  are  zero: 

/ti(CX) 

=  /i0i(f)exp[Pi  Treatment  +  p2  Age], 
where  /ij  =  0.286, 

/)2  =  0.000,  so  that 

hi ( t ,  X)  =  /?oi  (f)  exp[(0.283)  Treatment] 


g  =  2:  Zj  =  1,  Zj  =  Zj  =Z*  =0 

(Large  cell  type  and  PSbin  =  0) 

Nonzero  product  terms  Coefficients 

Age  Zj  =  Age  P21 

tr  Zj  =  Treatment  Pn 


Here  we  provide  the  computer  results  from 
fitting  the  interaction  model  just  described. 
Notice  that  the  first  two  variables  listed  are 
the  main  effects  of  treatment  status  and  age. 
The  next  seven  variables  are  product  terms 
involving  the  interaction  of  treatment  status 
with  the  seven  categories  of  Z*.  The  final 
seven  variables  are  product  terms  involving 
the  interaction  of  age  with  the  seven  categories 
of  Z*.  As  defined  on  the  previous  page,  the 
seven  variables  used  to  define  Z*  consist  of 
three  dummy  variables  Zj .  Zj  and  Zj  for  cell 
type,  a  binary  variable  Z\  for  performance  sta¬ 
tus  and  products  of  Z\  with  each  of  Zj .  Zt , 
and  Z\.  Note  that  once  the  variables  Zj .  Z]  Z\, 
and  Z\  are  specified,  the  values  of  the  three 
product  terms  are  automatically  determined. 

We  can  use  these  results  to  show  that  the  inter¬ 
action  model  being  fit  yields  different  regres¬ 
sion  coefficients  for  each  of  the  eight 
categories  defined  by  the  subscript  g  for  the 
stratification  variable  Z*.  These  eight  cate¬ 
gories  represent  the  possible  combinations  of 
the  four  variables  Z\  to  Zjjj,  as  shown  here. 

Consider  the  hazard  function  when  the  vari¬ 
ables  Z\  through  Zjj  are  all  equal  to  zero.  This 
stratum  is  defined  by  the  combination  of  squa¬ 
mous  cell  type  and  a  binary  performance  status 
value  of  0.  In  this  case,  all  product  terms  are 
equal  to  zero  and  the  hazard  model  contains 
only  the  main  effect  terms  treatment  and  age. 
The  estimated  hazard  function  for  this  stratum 
uses  the  coefficients  0.286  for  treatment  and 
0.000  for  age,  yielding  the  expression  shown 
here.  Note  that  age  drops  out  of  the  expression 
because  its  coefficient  is  zero  to  three  decimal 
places. 

Now  consider  the  hazard  function  when 
the  variable  Z\  equals  1  and  Zj,  through  Z\  are 
equal  to  zero.  This  stratum  is  defined  by  the 
combination  of  large  cell  type  and  a  PSbin 
value  of  0.  In  this  case,  the  only  nonzero  prod¬ 
uct  terms  are  Age  Z\  and  tr  Z\,  whose  coeffi¬ 
cients  are  p2i  and  pn,  respectively. 
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EXAMPLE:  (continued) 


^2(TX)  =  ^o2(f)exp[(p1  +  Pn)  Treatment 
+  (P2  +  P21)  Age] 

Pi  =  0.286,  P2  =  0.000 
Pn  =2.351,  p2l  =0.078 

Hazard  functions  for  interaction  model: 
g=l:(ZJ=Z*=Z3*  =  Z*=0): 

/ii(f,X)  =/;oi(f)exp[(0.286)  Treatment] 
g  =  2:(Z]  =  l,Z*=Z3*=Z]  =  0): 

/;2(f,X)  =/jo2(f)exp[(2.637)  Treatment 
+  (0.078)Age] 

g  =  3:(Z*  =  l,Z]  =  Z3*=Z]  =  0): 

/?3  (t,  X)  = /103(f)  exp  [(— 0.872)  Treatment 
+  (— 0.047)Age] 

g=4:(z;  =  i,zT=z*=z;=o): 

/14  (r,  X)  = /104(f)  exp[(0.868)  Treatment 
+  (— 0.059)Age] 

g  =  5:(Z]  =  Z*=Z3*=0,Z*  =  l): 

/15  (f ,  X)  =/io5(f)exp[(0.747)  Treatment 
+  (— 0.051)Age] 

g  =  6:  {Z\  =  1,  Z*  =  Z3*  =0,  Z4*  =  1) : 

/ie(f,  X)  =/io6(f)exp[(0.810)  Treatment 
+  (— 0.038)Age] 

g  =  7:(Z*  =  l,Z]=Z3*=0,Z4*  =  l): 

/i7 (f , X)  =/io7(f)exp[(0.880)  Treatment 
+  (— 0.041)Age] 

g  =  8 :  (Z3*  =  1,  Z]  =  Z*  =0,  Z4*  =  1) : 

/i8(f,X)  =/i08(f)exp[(0.297)  Treatment 
+  (0.033)Age] 

LR  test  to  compare  no-interaction 
model  with  interaction  model: 


H0:  no-interaction  model  acceptable,  i.e., 
Treatment:  Pn  =  pi2  =  ■■■  =  p17  =  0 
and  Age:  p21  =  p22  =  •••  =  P27  =  0 


14  coefficients  =4-  df  =  14 
LR  =  -2  In  Lr  -  (2  In  LF) 

R  =  reduced  (no-interaction)  model 
F  =  full  (interaction)  model 


The  hazard  function  for  this  second  stratum  is 
shown  here.  Notice  that  the  coefficients  of  the 
treatment  and  age  variables  are  (pi  +  Pn)  and 
(P2  +  p2t)»  respectively.  The  estimated  values  of 
each  of  these  coefficients  are  given  here. 


The  corresponding  estimated  hazard  function 
for  the  second  stratum  (i.e.,  g  =  2)  is  shown 
here.  For  comparison,  we  repeat  the  estimated 
hazard  function  for  the  first  stratum. 

The  estimated  hazard  functions  for  the  remain¬ 
ing  strata  are  provided  here.  We  leave  it  up  to 
the  reader  to  verify  these  formulae.  Notice  that 
the  coefficients  of  treatment  are  all  different  in 
the  eight  strata,  and  the  coefficients  of  age  also 
are  all  different  in  the  eight  strata. 


We  have  presented  computer  results  for  both  the 
no-interaction  and  the  interaction  models.  To 
evaluate  whether  the  no-interaction  assumption 
is  satisfied,  we  need  to  carry  out  a  likelihood 
ratio  test  to  compare  these  two  models. 

The  null  hypothesis  being  tested  is  that  the  no¬ 
interaction  model  is  acceptable.  Equivalently, 
this  null  hypothesis  can  be  stated  by  setting  the 
coefficients  of  all  product  terms  in  the  interac¬ 
tion  model  to  zero.  That  is,  the  seven  coefficients 
of  product  terms  involving  treatment  and  the 
seven  coefficients  of  the  product  terms  involving 
age  are  set  equal  to  zero  as  shown  here. 

Because  the  null  hypothesis  involves  14  coeffi¬ 
cients,  the  degrees  of  freedom  of  the  LR  chi- 
square  statistic  is  14.  The  test  statistic  takes  the 
usual  form  involving  the  difference,  between  log- 
likelihood  statistics  for  the  reduced  and  full  mod¬ 
els,  where  the  reduced  model  is  the  no-interaction 
model  and  the  full  model  is  the  interaction  model. 
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EXAMPLE  (continued) 


LR  ~  Zi4df  under  H0:  no  interaction 
LR  =  (-2  x  -262.020)  -  (-2  x  -249.972) 
=  524.040  -  499.944  =  24.096 
P  =  0.045  (significant  at  0.05) 
Conclusion: 

Reject  H0:  interaction  model  is 
preferred. 


Might  use  further  testing  to  simplify 
interaction  model,  e.g.,  test  for  seven 
products  involving  treatment  or  test 
for  seven  products  involving  age. 


Thus,  under  the  null  hypothesis,  the  LR  statistic 
is  approximately  chi-square  with  14  degrees  of 
freedom. 

The  computer  results  for  the  no-interaction  and 
interaction  models  give  log-likelihood  values 
of  524.040  and  499.944,  respectively.  The  differ¬ 
ence  is  24.096.  A  chi-square  value  of  24.096  with 
14  degrees  of  freedom  yields  a  p-value  of  0.045, 
so  that  the  test  gives  a  significant  result  at  the 
0.05  level.  This  indicates  that  the  no-interaction 
model  is  not  acceptable  and  the  interaction 
model  is  preferred. 

Note,  however,  that  it  may  be  possible  from 
further  statistical  testing  to  simplify  the  inter¬ 
action  model  to  have  fewer  than  14  product 
terms.  For  example,  one  might  test  for  only 
the  seven  product  terms  involving  treatment 
or  only  the  seven  product  terms  involving  age. 


VI.  A  Graphical  View 
of  the  Stratified  Cox 
Approach 

a.  h(t)  =  hoMexpfpjRX  +  p2SEX) 
ln(-ln  S(t))  =  ln(— In  S0(t)) 

+  p!  RX  +  p2SEX 


In  this  section  we  examine  four  log-log  survival 
plots  illustrating  the  assumptions  underlying  a 
stratified  Cox  model  with  or  without  interac¬ 
tion.  Each  of  the  four  models  considers  two 
dichotomous  predictors:  treatment  (coded 
RX  =  1  for  placebo  and  RX  =  0  for  new  treat¬ 
ment)  and  SEX  (coded  0  for  females  and  1  for 
males).  The  four  models  are  as  follows  (see  left). 


males,  RX  =1  cl. 


This  model  assumes  the  PH  assumption  for 
both  RX  and  SEX  and  also  assumes  no 
interaction  between  RX  and  SEX.  Notice 
all  four  log-log  curves  are  parallel 
(PH  assumption)  and  the  effect  of 
treatment  is  the  same  for  females  and  males 
(no  interaction).  The  effect  of  treatment 
(controlling  for  SEX)  can  be  interpreted 
as  the  distance  between  the  log-log  curves 
from  RX  =  1  to  RX  =  0,  for  males  and  for 
females,  separately. 
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b.  h(t)  =  ho(t)  expCpj  RX  +  p2SEX  b. 
+  P3  RX  x  SEX) 
ln(-ln  S(t))  =  ln(— In  S0(t)) 

+  PiRX  +  p2SEX  +  p3RX  x  SEX 


c.  h(t)  =  h0g(t)exp(p1RX)  c. 

(g  =  1  for  males,  g  =  0  for  females) 
ln(-ln  S(t))  =  ln(-ln  S0g(t)) 

+  PiRX 


d.  h(t)  =  h0g(t)exp(p1RX  d. 

+  p2RX  x  SEX) 

(g  =  1  for  males,  g  =  0  for  females) 
ln(-ln  S(t))  =  ln(-ln  S0g(t)) 

+  PjRX  +  p2RX  x  SEX 

>  males,  RX  =  1 

ln(-lnS(t))  males,  RX  =  0 


This  model  assumes  the  PH  assumption 
for  both  RX  and  SEX  and  allows  for 
interaction  between  these  two  variables. 

All  four  log-log  curves  are  parallel 
(PH  assumption)  but  the  effect  of  treatment 
is  larger  for  males  than  females  as  the 
distance  from  RX  =  1  to  RX  =  0  is  greater 
for  males. 


This  is  a  stratified  Cox  model  in  which  the 
PH  assumption  is  not  assumed  for  SEX. 
Notice  the  curves  for  males  and  females 
are  not  parallel.  However,  the  curves  for 
RX  are  parallel  within  each  stratum  of 
SEX  indicating  that  the  PH  assumption  is 
satisfied  for  RX.  The  distance  between 
the  log-log  curves  from  RX  =  1  to  RX  = 

0  is  the  same  for  males  and  females 
indicating  no  interaction  between  RX  and 
SEX. 


This  is  a  stratified  Cox  model  allowing  for 
interaction  of  RX  and  SEX.  The  curves  for 
males  and  females  are  not  parallel  although 
the  PH  assumption  is  satisfied  for  RX 
within  each  stratum  of  SEX. The  distance 
between  the  log-log  curves  from  RX  =  1  to 
RX  =  0  is  greater  for  males  than  females 
indicating  interaction  between  RX  and  SEX. 


females,  RX  -  1 
females.  RX  -  0 
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VII.  The  Stratified  Cox 
Likelihood 

•  Cox  PH  Likelihood  (L)  described 
in  Chapter  3,  Section  VIII 

•  L  now  extended  for  SC  model 


At  the  end  of  the  presentation  from  Chapter  3 
(Section  VIII),  we  illustrated  the  Cox  likelihood 
using  the  dataset  shown  on  the  left.  In  this 
section,  we  extend  that  discussion  to  illustrate 
the  likelihood  for  a  stratified  Cox  model. 


ID  TIME  STATUS 


Barry  2  1 

Gary  3  1 

Harry  5  0 

Larry  8  1 


SMOKE 

1 

0 

0 

1 


TIME  =  Survival  time  (in  years) 
STATUS  =  1  for  event,  0  for 
censorship 

SMOKE  =  1  for  a  smoker,  0  for  a 
non-smoker 


To  review:  The  data  indicate  that  Barry  got  the 
event  at  TIME  =  2  years.  Gary  got  the  event  at 
3  years,  Harry  was  censored  at  5  years  and 
Larry  got  the  event  at  8  years.  Furthermore, 
Barry  and  Larry  were  smokers  while  Gary  and 
Harry  were  non-smokers. 


Cox  PH  model:  h(t)  =  ho(t)e^lSMOKE 
Cox  PH  Likelihood: 


In  Chapter  3,  we  constructed  the  Cox  likelihood 
with  one  predictor,  SMOKE,  in  the  model.  The 
model  and  the  likelihood  are  shown  on  the  left. 


h0(t)ep' 

h0(t)el!i  +  ho(t)e°  +  h0(t)e°  +  A0  ( 0 


' _ h0(t)e° _ 

ho(t)e°  +  ho(t)e°  +  ho(t)eP\ 


~h0(t)eP' 
h0{t)eP  i 


The  likelihood  is  a  product  of  three  terms,  one 
term  for  each  event  time  tf  (TIME  =  2,3,  and  8). 
The  denominator  of  each  term  is  the  sum  of  the 
hazards  from  the  subjects  still  in  the  risk  set  at 
time  tf,  including  the  censored  subject,  Harry. 
The  numerator  of  each  term  is  the  hazard  of 
the  subject  who  got  the  event  at  tf.  The  reader 
may  wish  to  reread  Section  VIII  of  Chapter  3. 


Stratified  Cox  model: 

hg{t)  =  h0g{t)ep'SMOKE 

g=  1  history  of  hypertension 
g=2  no  history  of  hypertension 


Now  consider  a  stratified  Cox  model,  in  which 
the  stratified  variable  is  a  dichotomous  indi¬ 
cator  of  whether  the  subject  has  or  does  not 
have  a  history  of  hypertension.  The  predictor 
SMOKE  remains  in  the  model  (shown  at  left). 


PH  assumption: 

•  Violated  overall 

•  Assumed  to  hold  within 
categories  of  stratified  variable 


The  model  allows  for  a  violation  of  the  PH 
assumption.  However,  within  each  category  of 
hypertension,  the  PH  assumption  is  assumed 
to  hold. 
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ID 

TIME  STATUS 

SMOKE 

HT 

Barry 

2  1 

1 

1 

Gary 

3  1 

0 

2 

Harry 

5  0 

0 

1 

Larry 

8  1 

1 

2 

HT  = 

History  of  hypertension 

(l=yes,  2=no) 

Formulate  likelihood 
Data  for  HT=1 

in  pieces: 

ID 

TIME  STATUS 

SMOKE 

HT 

Barry 

2  1 

1 

1 

Harry 

5  0 

0 

1 

Data  for  HT=2 

ID 

TIME  STATUS 

SMOKE 

HT 

Gary 

3  1 

0 

2 

Larry 

8  1 

1 

2 

Stratified  Cox  model: 
hg(t)  =  h0g(t)e^SMOKE 

g=  1  history  of  hypertension 
g=2  no  history  of  hypertension 

Among  HT=1 

Barry  has  only  event 

Barry  and  Harry  at  risk  at 

TIME=2 


Li  -  ,l0ieP' 

Ime^  +  hoie0 


Among  HT=2 

Gary  and  Larry  both  get  events 
Gary  and  Larry  at  risk  at 
TIME=3 

Larry  at  risk  at  TIME=8 

,  h02e°  h02efl 

/jmf0  +  lioieP'  h  02e/il 


The  data  are  shown  on  the  left.  The  additional 
variable  is  HT  which  is  the  variable  we  wish  to 
stratify  on.  Barry  and  Harry  have  a  history  of 
hypertension  (coded  HT=1)  while  Gary  and 
Larry  do  not  (coded  HT=2). 


The  stratified  Cox  likelihood  is  formulated  in 
pieces.  Each  piece  represents  a  stratified  cate¬ 
gory.  Within  each  piece,  the  likelihood  is  for¬ 
mulated  similarly  as  the  likelihood  formulated 
for  a  Cox  PH  model. 


The  first  piece  (stratified  category)  is  defined 
among  observations  in  which  HT=1.  The  sec¬ 
ond  piece  is  defined  among  observations  in 
which  HT=2. 


Here,  for  convenience,  we  again  show  you  the 
SC  model  for  these  data  on  the  left. 


For  the  first  stratum  (HT=1),  Barry  gets  an 
event  at  TIME=2.  Barry  and  Harry  are  in  the 
risk  set  when  Barry  gets  his  event.  Barry  is  the 
only  event  from  this  stratum  as  Harry  is  cen¬ 
sored  at  TIME=5.  Therefore  the  likelihood 
for  this  piece,  Lt  (shown  at  left),  contains  one 
term.  That  one  term  has  Barry’s  hazard  in 
the  numerator.  Barry’s  and  Harry’s  hazard  are 
summed  in  the  denominator. 

For  the  second  stratum  (HT=2),  Gary  gets  an 
event  at  TIME=3.  Gary  and  Larry  are  in  the 
risk  set  when  Gary  gets  his  event.  Larry  gets 
an  event  at  TIME=8  and  is  the  only  one  in  the 
risk  set  at  the  time  of  his  event.  Therefore,  the 
likelihood  for  this  piece,  L2  (shown  at  left),  is  a 
product  of  two  factors.  Each  of  the  two  factors 
corresponds  to  an  event. 
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L  =  Li  x  L.2 
h0leP' 

hQXe^  +  h0ie° 


*02  h  02  e^1 

hQ2e°  +  *02  hmeP\ 


The  stratified  Cox  likelihood  can  be  formulated 
by  taking  the  product  of  each  piece  (Lj  and  L2). 
Each  piece  was  formulated  within  a  stratum. 


The  baseline  hazard  cancels  in  L 


L  = 


eh 

eft  4-  e° 


x 


Notice  that  the  baseline  hazard  cancels  from 
the  likelihood.  As  with  the  Cox  PH  model,  the 
stratified  likelihood  is  just  determined  by  the 
order  of  events  (not  the  baseline  hazard). 


■  e0  ef1- 
e°  +  eft  X  eft 

\  / 

Same  (h  in  each  piece 

No  interaction  model: 

Effect  of  smoking  is  same  for 
those  with  or  without  hyper¬ 
tension 


L  = 


eft 


One  additional  point  about  this  model  is  that 
the  Pi  in  the  first  piece  of  the  likelihood  (Ld  is 
the  same  Pi  that  is  in  the  second  piece  of  the 
likelihood  (L2).  In  other  words,  this  is  a  no 
interaction  model.  The  effect  of  smoking 
(expressed  as  Pi)  does  not  depend  on  hyperten¬ 
sion  status. 


VIII.  Summary 


Stratified  Cox  (SC)  model: 

•  stratification  of  predictors  not 
satisfying  PH  assumption 

•  includes  predictors  satisfying 

PH 

•  does  not  include  stratified 
variables 


We  now  summarize  the  most  important  features 
of  the  stratified  Cox  (SC)  model  described  in  this 
presentation. 

The  SC  model  is  a  modification  of  the  Cox  PH 
model  to  allow  for  control  by  “stratification”  of 
predictors  not  satisfying  the  PH  assumption. 
Variables  that  are  assumed  to  satisfy  the 
assumption  are  included  in  the  model  as  pre¬ 
dictors;  the  stratified  variables  are  not  included 
in  the  model. 


Computer  Results 


Stratified  Cox  regression 
Analysis  time  _t:  survt 


Std.  Haz.  [95%  Conf. 

Coef.  Err.  p  >  |z|  Ratio  Interval] 

log 

WBC  1.390  0.338  0.000  4.016  2.072  7.783 

RX  0.931  0.472  0.048  2.537  1.006  6.396 

No.  of  Log  likelihood  Stratified 

subjects  =  42  =  —57.560  by  sex 


The  computer  results  for  a  SC  model  provides 
essentially  the  same  type  of  output  as  provided 
for  a  Cox  PH  model  without  stratification. 
An  example  of  SC  output  using  the  remission 
data  is  shown  here.  The  variables  included  as 
predictors  in  the  model  are  listed  in  the  first 
column  followed  by  their  estimated  coeffi¬ 
cients,  standard  errors,  p-values,  hazard  ratio 
values,  and  95%  confidence  limits.  Such  infor¬ 
mation  cannot  be  provided  for  the  variables 
being  stratified,  because  these  latter  variables 
are  not  explicitly  included  in  the  model. 
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Hazard  function  for  no-interaction 
stratified  Cox  model: 


hg  (t,X)  =  VWexptpA  +  P2^2 

+  •  ■  ■  +  PpXp] 

g  =  1,2,...,  k* ,  strata  defined 
from  Z* 

Z*  has  k*  categories 
,  X2t. . .,  Xp  satisfy  PH 

Stratification  variable  Z*\ 


•  identify  Z\,  Z2,. . .,  Z^  not 
satisfying  PH 

•  categorize  each  Z 

•  form  combinations  of 
categories  (strata) 

•  each  combination  is  a  stratum 
of  Z* 

No-interaction  model: 

Same  coefficients  p1(  p2,...,  p/;  for 
each  g,  i.e.,  Z*  does  not  interact 
with  the  Xs. 


Different 

baselines 


*01  (t)  => 

Sib) 

*02  (t)  => 

*O/t(0  => 

Sk'(t) 

Different 

survival 

curves 


HR  same  for  each  stratum 


(Partial)  likelihood  function: 
L  =  L\  x  L2  x  . . .  x  Lfc. 


Stratified  Cox  model  allowing 
interaction: 


hg(t,X)  =  h0g(t)ex  pfPigXj  +  p2gX2 

+  •  •  ■  +  PpgXp] 

g=  1,2  ,...,k*,  strata  defined  from  Z* . 


The  general  hazard  function  form  for  the  no¬ 
interaction  stratified  Cox  model  is  shown  here. 
This  formula  contains  a  subscript  g  that  indi¬ 
cates  the  gth  stratum,  where  the  strata  are  dif¬ 
ferent  categories  of  the  stratification  variable 
Z*  and  the  number  of  strata  equals  k*.  Notice 
that  the  baseline  hazard  functions  are  different 
in  each  stratum. 


The  variable  Z*  is  defined  by  first  identifying 
the  Z|  variables  not  satisfying  the  PH  assump¬ 
tion.  We  then  categorize  each  Z  and  form  com¬ 
binations  of  categories  of  each  of  the  Z’s.  Each 
combination  represents  a  different  stratum 
making  up  the  variable  Z*. 


The  above  model  is  designated  as  a  “no¬ 
interaction”  model  because  the  p’s  in  the  model 
are  the  same  for  each  subscript  g.  The  no¬ 
interaction  assumption  means  that  the  variables 
being  stratified  are  assumed  not  to  interact  with 
theWs  in  the  model. 


For  the  no-interaction  model,  the  fitted  SC 
model  will  yield  different  estimated  survival 
curves  for  each  stratum  because  the  baseline 
hazard  functions  are  different  for  each  stratum. 

However,  because  the  coefficients  of  the  X’s  are 
the  same  for  each  stratum,  estimates  of  hazard 
ratios  are  the  same  for  each  stratum. 

Regression  coefficients  in  the  SC  model  are 
estimated  by  maximizing  a  partial  likelihood 
function  that  is  obtained  by  multiplying  likeli¬ 
hood  functions  for  each  stratum. 

In  order  to  evaluate  the  no-interaction  assump¬ 
tion,  we  must  define  an  interaction  model  for 
comparison.  One  version  of  the  interaction 
model  is  shown  here.  This  version  shows  regres¬ 
sion  coefficients  with  different  subscripts  in  dif¬ 
ferent  strata;  that  is,  each  p  coefficient  has  a 
subscript  g. 
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Alternative  stratified  Cox  inter¬ 
action  model: 

•  uses  product  terms  involving  Z* 

•  define  k*  —  1  dummy  variables 
from  Z* 

•  products  of  the  form  Z*  x  Xj 


An  alternative  way  to  write  the  interaction 
model  uses  product  terms  involving  the  Z*  var¬ 
iable  with  each  predictor.  This  model  uses  k*  —  1 
dummy  variables  to  distinguish  the  k*  cate¬ 
gories  of  Z* .  Each  of  these  dummy  variables  is 
included  as  a  product  term  with  each  of  the  X’s. 


Testing  the  no-interaction  assump¬ 
tion: 


LR  =  -2  In  Lr  -  (-2  In  LF) 

R  =  reduced  (no-interaction)  model 
F  =  full  (interaction)  model 
contains  product  terms 
LR  ~  y2p(k>-v)dt  under  H0:  no 
interaction 


To  evaluate  the  no-interaction  assumption, 
we  can  perform  a  likelihood  ratio  test  that  com¬ 
pares  the  (reduced)  no-interaction  model  to 
the  (full)  interaction  model.  The  null  hypo¬ 
thesis  is  that  the  no-interaction  assumption  is 
satisfied.  The  test  statistic  is  given  by  the  dif¬ 
ference  between  the  log-likelihood  statistics  for 
the  no-interaction  and  interaction  models.  This 
statistic  is  approximately  chi-square  under 
the  null  hypothesis.  The  degrees  of  freedom  is 
p(k*  —  1)  where  p  denotes  the  number  of  X’s 
and  k*  is  the  number  of  categories  making  up  Z* . 


Presentation  Complete!  This  presentation  is  now  complete.  We  suggest 

that  the  reader  review  this  presentation  using 
Chapter'S  the  detailed  outline  that  follows.  Thenanswerthe 

practice  exercises  and  the  test  that  follow. 

The  next  Chapter  (6)  is  entitled  “Extension 
of  the  Cox  PH  Model  for  Time-Dependent 
Variables.”  There  we  show  how  an  “extended” 
Cox  model  can  be  used  as  an  alternative  to  the 
stratified  Cox  model  when  one  or  more  pre¬ 
dictors  do  not  satisfy  the  PH  assumption.  We 
also  discuss  more  generally  what  is  a  time- 
dependent  variable,  and  show  how  such  a  vari¬ 
able  can  be  evaluated  using  an  extended  Cox 
model. 

6.  Extension  of  the  Cox 

Proportional  Hazards  Model 
for  Time-Dependent  Variables 


1 .  Introduction  to  Survival 
Analysis 

2.  Kaplan-Meier  Survival  Curves 
and  the  Log-Rank  Test 

3.  The  Cox  Proportional  Hazards 
Model  and  Its  Characteristics 

4.  Evaluating  the  Proportional 

Hazards  Assumption _ 

V5.  (The  Stratified  Cox  Procedure) 

Next: 
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Detailed 

Outline 


I.  Preview  (page  204) 

A.  Focus  on  how  stratified  Cox  (SC)  procedure  is 
carried  out: 

•  analysis  of  computer  results  from  SC 
procedure; 

•  hazard  function  for  SC  model; 

•  stratifying  on  a  single  predictor  versus  two 
or  more  predictors; 

•  no-interaction  versus  interaction  models. 


II.  An  Example  (pages  204-208) 

A.  Cox  PH  results  for  remission  data  yield  P{PH)  = 
0.031  for  SEX. 

B.  SC  model  used:  control  for  SEX  (stratified); 
include  log  WBC  and  Rx  in  model. 

C.  Analysis  of  Rx  effect  from  stratified  Cox  results: 
HR  =  2.537;  95%  Cl:  (1.006,6.396);  LR  and  Wald 
tests:  P  <  0.05. 

D.  Hazard  model:  hg(t,X )  =  /j0g(f)exp[p1  log  WBC 

+  P  2Rx],g=  1,2 

•  different  baseline  hazard  functions  and 
survival  curves  for  females  and  males; 

•  same  coefficients  [S,  and  p2  for  both  females 
and  males  (no-interaction  assumption); 

•  obtain  estimates  by  maximizing  partial 
likelihood  L  =  Lx  x  L2. 

E.  Graph  of  four  adjusted  survival  curves  for  Rx 
(adjusted  for  log  WBC). 


III.  The  General  Stratified  Cox  (SC)  Model 

(pages  208-209) 


A. 


hg(t,X)  —  /i0g(t)exp[PiXi  +  P2-X2  +  . . .  +  PpXp], 

g=  1,2,...,** 


where  the  strata  are  defined  from  the 
stratification  variable  Z* . 

B.  Z*  defined  from  Z\,Z2t. .  .,Z^  variables  that  do 
not  satisfy  PH: 

•  categorize  each  Z: 

•  form  combinations  of  categories 

•  each  combination  is  a  stratum  of  Z* 


C.  Different  baseline  hazard  functions  and 
survival  curves  for  each  stratum. 


D.  Assumes  no  interaction:  same  coefficients 
Pi,p2,. .  .,Pp  for  each  g;  i.e.,  Z*  does  not  interact 
with  the  X's;  i.e.,  estimated  HR  is  same  for  each 
stratum. 


E.  Obtain  estimates  by  maximizing  partial 

likelihood  L  =  L\  x  L2  x  . . .  x  L where  is 
likelihood  for  zth  stratum. 
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IV.  The  No-Interaction  Assumption  and  How  to 
Test  It  (pages  210-216) 

A.  Assumes  same  coefficients  Pi,p2,.  .  .,Pp  for  each  g. 

B.  Interaction  model: 

hg (t,X)  =  /zogOOexpfpigA-!  +  p2gV2  +  . . .  +  ppgXp], 

g  =  1,  2,. .  ,,k*  strata  defined  from  Z*. 

C.  Alternative  stratified  Cox  interaction  model: 

•  uses  product  terms  involving  Z* 

•  define  k*  —  \  dummy  variables 
Zj ,  Zj , . . .  ,Z|„_,  from  Z* 

•  products  of  the  form  Z*  x  Xj,  where 
i  —  1,...,  k*  —  1;  /  =  1 

•  hazard  function:  g  =  1,2 k*  strata 
defined  from  Z* 

hg(t,X)  =  h0g(t)  exp[/?jX|  + - h  fSpXp  +  /Jn(ZJ  x  X|) 

H - +  Ppi  (Z)  x  Xp)  +  /?12(Z)  x  X|)  + - f  x  Xp) 

“I - h  /h,e-i  (ze-i  x  zi)  H - (ZJ.-1  x  Xp)] 


D.  Testing  the  no-interaction  assumption:  use  LR 
statistic  given  by  LR  =  —2  In  LR  -  (—2  In  LF) 
where  W  =  reduced  (no  interaction)  model  and 
F  =  full  (interaction)  model 
LR  ~  xl(k*-i)d f  uncler  no  interaction,  i.e., 

Pll  =  Pl2  =  •  •  •  =  Pp,fc*-1  =  0 

V.  A  Second  Example  Involving  Several 

Stratification  Variables  (pages  216-221) 

A.  Dataset  “vets.dat”  from  Veteran’s 
Administration  Lung  Cancer  Trial;  n  =  137; 
survival  time  in  days. 

B.  Variables  are:  treatment  status,  cell  type  (four 
types),  performance  status,  disease  duration, 
age,  and  prior  therapy  status. 

C.  Cox  PH  results  indicate  [using  P(PH )]  that  cell 
type  and  performance  status  do  not  satisfy  PH 
assumption. 

D.  Example  stratifies  on  cell  type  and  performance 
status  using  four  categories  of  cell  type  and  two 
categories  of  performance  status,  so  that  Z*  has 
k*  =  8  strata. 

E.  X’s  considered  in  model  are  treatment  status 
and  age. 

F.  Computer  results  for  no-interaction  model: 
estimated  HR  for  effect  of  treatment  adjusted 
for  age  and  Z*  is  1.134  (P  =  0.548);  not 
significant. 
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G.  Hazard  function  for  no-interaction  model: 

/ig(f,X)  =  /-ZogWexpfPi  Treatment  +  p2  Age], 
g  -  1.2 . 8 

H.  Hazard  function  for  interaction  model: 
hs(t,X )  =  /zog(t)exp[plg  Treatment  +  p2g  Age], 

g  =  1,  2,. . 8 

I.  Alternative  version  of  interaction  model: 

hg  (t,X)  =  /zogftfexpfp!  Treatment  +  p2  Age 
+Pu  (Z,\  x  Treatment)  +  •  •  •  +  /?17(Zy  x  Treatment) 

+P21  (Zj  x  Age)  H - +  PniZq  x  Age)] , 

g  =  1,2,  ...8 

where  Z\  =  large  cell  (binary),  Z\  =  adeno  cell 
(binary),  Z\  =  small  cell  (binary),  Z\  =  PSbin 
(binary), Z5  =  Z\  x  Z\,  Z*6  =  Z2  x  Z^Z)  =  Z3  x  Z4 

J.  Demonstration  that  alternative  interaction 
version  (in  item  I)  is  equivalent  to  original 
interaction  formulation  (in  item  H)  using 
computer  results  for  the  alternative  version. 

K.  Test  of  no-interaction  assumption: 

•  null  hypothesis:  Pi  1  =  Pi2  =  . . .  =  P17  =  0  and 
P21  =  P22  =  •  •  •  =  P27  =  0 

•  LR  ~  /]4df  under  H0:  no  interaction 

•  LR  =  524.040  -  499.944  =  24.096 
(P  =  0.045) 

Conclusion:  Reject  null  hypothesis; 
interaction  model  is  preferred. 

VI.  A  Graphical  View  of  the  Stratified  Cox 
Approach  (pages  221-222) 

Comparison  of  log-log  survival  curves 

1 .  Describe  interaction  of  Rx  and  Sex. 

2.  Describe  violation  of  PH  assumption  for  Sex. 

VII.  The  Stratified  Cox  Likelihood  (pages  223-225) 
VIII.  Summary  (pages  225-227) 


Practice  Exercises  231 


Practice  The  following  questions  derive  from  the  dataset  vets.dat 

Exercises  concerning  the  Veteran’s  Administration  Lung  Cancer  Trial 

that  we  previously  considered  in  this  chapter.  Recall  that 
survival  times  are  in  days  and  that  the  study  size  contains 
137  patients.  The  exposure  variable  of  interest  is  treatment 
status  (standard  =  1 ,  test  =  2) .  Other  variables  of  interest  as 
control  variables  are  cell  type  (four  types,  defined  in  terms 
of  dummy  variables),  performance  status,  disease  dura¬ 
tion,  age,  and  prior  therapy  status.  Failure  status  is  defined 
by  the  status  variable  (0  =  censored,  1  =  died). 

1 .  Consider  the  following  two  edited  printouts  obtained 
from  fitting  a  Cox  PH  model  to  these  data. 


Cox  regression 
Analysis  time  _t: 
survt 

Coef. 

Std.  Err. 

p  >  M 

Haz.  Ratio 

[95%  Conf.  Interval] 

P(PH) 

Treatment 

0.290 

0.207 

0.162 

1.336 

0.890 

2.006 

0.628 

Large  cell 

0.400 

0.283 

0.157 

1.491 

0.857 

2.594 

0.033 

Adeno  cell 

1.188 

0.301 

0.000 

3.281 

1.820 

5.915 

0.081 

Small  cell 

0.856 

0.275 

0.002 

2.355 

1.374 

4.037 

0.078 

Perf.Stat 

-0.033 

0.006 

0.000 

0.968 

0.958 

0.978 

0.000 

Dis.Durat. 

0.000 

0.009 

0.992 

1.000 

0.982 

1.018 

0.919 

Age 

-0.009 

0.009 

0.358 

0.991 

0.974 

1.010 

0.198 

Pr.  Therapy 

0.007 

0.023 

0.755 

1.007 

0.962 

1.054 

0.145 

No.  of  subjects  = 

137 

Log  likelihood  = 

:  -475.180 

Cox  regression 
Analysis  time  _t: 
survt 

Coef. 

Std.  Err. 

p  >  M 

Haz.  Ratio 

[95%  Conf.  Interval] 

P(PH) 

Treatment 

0.298 

0.197 

0.130 

1.347 

0.916 

1.981 

0.739 

Small  cell 

0.392 

0.210 

0.062 

1.481 

0.981 

2.235 

0.382 

Perf.Stat 

-0.033 

0.005 

0.000 

0.968 

0.958 

0.978 

0.000 

Dis.Durat. 

-0.001 

0.009 

0.887 

0.999 

0.981 

1.017 

0.926 

Age 

-0.006 

0.009 

0.511 

0.994 

0.976 

1.012 

0.211 

Pr.  Therapy 

-0.003 

0.023 

0.884 

0.997 

0.954 

1.042 

0.146 

No.  of  subjects  = 

137 

Log  likelihood  = 

:  -487.770 

How  do  the  printouts  differ  in  terms  of  what  the 
P(PH )  information  says  about  which  variables  do 
not  satisfy  the  PH  assumption? 

2.  Based  on  the  above  information,  if  you  were  going  to 
stratify  on  the  cell  type  variable,  how  would  you  define 
the  strata?  Explain. 
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3.  Consider  a  stratified  analysis  that  stratifies  on  the 
variables  Z\  =  “small  cell”  and  Z2  =  “performance 
status.”  The  small  cell  variable  is  one  of  the  dummy 
variables  for  cell  type  defined  above.  The  performance 
status  variable  is  dichotomized  into  high  (60  or  above) 
and  low  (below  60)  and  is  denoted  as  PSbin.  The 
stratification  variable  which  combines  categories 
from  Z|  and  Z2  is  denoted  as  SZ*  and  consists  of  four 
categories.  The  predictors  included  (but  not  stratified) 
in  the  analysis  are  treatment  status,  disease  duration, 
age,  and  prior  therapy.  The  computer  results  are  as 
follows: 


Stratified  Cox 
regression 
Analysis  time  _t: 


survt 

Coef. 

Std.  Err. 

p  >  M 

Haz.  Ratio 

[95%  Conf.  Interval] 

Treatment 

0.090 

0.197 

0.647 

1.095 

0.744 

1.611 

Dis.Durat. 

0.000 

0.010 

0.964 

1.000 

0.982 

1.019 

Age 

0.002 

0.010 

0.873 

1.002 

0.983 

1.021 

Pr.  Therapy 

-0.010 

0.023 

0.656 

0.990 

0.947 

1.035 

No.  of  subjects  =137  Log  likelihood  =  —344.848  Stratified  by  SZ* 

Based  on  these  results,  describe  the  point  and  interval 
estimates  for  the  hazard  ratio  for  the  treatment  effect 
adjusted  for  the  other  variables,  including  SZ*.  Is  this 
hazard  ratio  meaningfully  and/or  statistically  signifi¬ 
cant?  Explain. 

4.  State  the  form  of  the  hazard  function  for  the  model 
being  fit  in  question  3.  Why  does  this  model  assume 
no  interaction  between  the  stratified  variables  and  the 
predictors  in  the  model? 

5.  State  two  alternative  ways  to  write  the  hazard  function 
for  an  “interaction  model”  that  allows  for  the  interac¬ 
tion  of  the  stratified  variables  with  the  treatment  status 
variable,  but  assumes  no  other  type  of  interaction. 

6.  State  two  alternative  versions  of  the  hazard  function 
for  an  interaction  model  that  allows  for  the  inter¬ 
action  of  the  stratified  variables  (small  cell  and  perfor¬ 
mance  status)  with  each  of  the  predictors  treatment 
status,  disease  duration,  age,  and  prior  therapy. 

7.  For  the  interaction  model  described  in  question  6, 
what  is  the  formula  for  the  hazard  ratio  for  the  effect 
of  treatment  adjusted  for  the  other  variables?  Does 
this  formula  give  a  different  hazard  ratio  for  different 
strata?  Explain. 
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8.  State  two  alternative  versions  of  the  null  hypothesis 
for  testing  whether  the  no-interaction  assumption  is 
satisfied  for  the  stratified  Cox  model.  Note  that  one 
of  these  versions  should  involve  a  set  of  regression 
coefficients  being  set  equal  to  zero. 

9.  State  the  form  of  the  likelihood  ratio  statistic  for 
evaluating  the  no-interaction  assumption.  How  is 
this  statistic  distributed  under  the  null  hypothesis, 
and  with  what  degrees  of  freedom? 

10.  Provided  below  are  computer  results  for  fitting  the 
interaction  model  described  in  question  6.  In  this  print¬ 
out  the  variable  Z\  denotes  the  small  cell  variable  and 
the  variable  Z\  denotes  the  PSbin  variable.  The  variable 
DDZ]  denotes  the  product  of  Z\  with  disease  duration, 
and  other  product  terms  are  defined  similarly. 


Stratified  Cox 
regression 
Analysis  time  _t: 


survt 

Coef. 

Std.  Err. 

p  >  M 

Haz.  Ratio 

[95%  Conf.  Interval] 

Treatment 

0.381 

0.428 

0.374 

1.464 

0.632 

3.389 

Dis.Durat. 

0.015 

0.021 

0.469 

1.015 

0.975 

1.057 

Age 

0.000 

0.017 

0.994 

1.000 

0.968 

1.033 

Pr.  Therapy 

0.023 

0.041 

0.571 

1.023 

0.944 

1.109 

DDZ* 

-0.029 

0.024 

0.234 

0.971 

0.926 

1.019 

AgeZj 

-0.055 

0.037 

0.135 

0.946 

0.880 

1.018 

P'i'z; 

0.043 

0.075 

0.564 

1.044 

0.901 

1.211 

ddz*2 

0.025 

0.032 

0.425 

1.026 

0.964 

1.092 

AgeZ* 

0.001 

0.024 

0.956 

1.001 

0.956 

1.049 

PTZ* 

-0.078 

0.054 

0.152 

0.925 

0.831 

1.029 

DDZi  Z*2 

-0.071 

0.059 

0.225 

0.931 

0.830 

1.045 

AgeZjZ; 

0.084 

0.049 

0.084 

1.088 

0.989 

1.196 

PTZiZt 

-0.005 

0.117 

0.963 

0.995 

0.791 

1.250 

ti-z; 

0.560 

0.732 

0.444 

1.751 

0.417 

7.351 

f-z; 

-0.591 

0.523 

0.258 

0.554 

0.199 

1.543 

trZiZ, 

-0.324 

0.942 

0.731 

0.723 

0.114 

4.583 

No.  of  subjects  = 

137 

Log  likelihood  =  - 

-335.591 

Stratified  by  SZ* 

Use  the  above  computer  results  to  state  the  form  of  the 
estimated  hazard  model  for  each  of  the  four  strata  of 
the  stratification  variable  SZ*.  Also,  for  each  strata, 
compute  the  hazard  ratio  for  the  treatment  effect 
adjusted  for  disease  duration,  age,  and  prior  therapy. 
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Test 


11.  Carry  out  the  likelihood  ratio  test  to  evaluate  the 
no-interaction  model  described  in  question  4.  In  car¬ 
rying  out  this  test,  make  sure  to  state  the  null  hypoth¬ 
esis  in  terms  of  regression  coefficients  being  set  equal 
to  zero  in  the  interaction  model  fitted  in  question  10. 
Also,  determine  the  p-value  for  this  test  and  state  your 
conclusions  about  significance  as  well  as  which  model 
you  prefer,  the  no-interaction  model  or  the  interaction 
model. 

12.  The  adjusted  log-log  survival  curves  for  each  of  the 
four  strata  defined  by  the  stratification  variable  SZ* 
(adjusted  for  treatment  status,  disease  duration,  age, 
and  prior  therapy)  are  presented  below. 


Using  this  graph,  what  can  you  conclude  about  whether 
the  PH  assumption  is  satisfied  for  the  variables,  small 
cell  type  and  PSbin? 

1 3 .  Comment  on  what  you  think  can  be  learned  by  graph¬ 
ing  adjusted  survival  curves  that  compare  the  two 
treatment  groups  for  each  of  the  four  strata  of  SZ*. 


The  following  questions  consider  a  dataset  from  a  study  by 
Caplehorn  et  al.  (“Methadone  Dosage  and  Retention  of 
Patients  in  Maintenance  Treatment,”  Med.  J.  Aust.,  1991). 
These  data  comprise  the  times  in  days  spent  by  heroin 
addicts  from  entry  to  departure  from  one  of  two  metha¬ 
done  clinics.  Two  other  covariates,  namely,  prison  record 
and  maximum  methadone  dose,  are  believed  to  affect  the 
survival  times.  The  dataset  name  is  addicts.dat.  A  listing  of 
the  variables  is  given  below: 

Column  1:  Subject  ID 

Column  2:  Clinic  (1  or  2) 
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Column  3:  Survival  status  (0  =  censored,  1  = 
departed  from  clinic) 

Column  4:  Survival  time  in  days 

Column  5:  Prison  record  (0  =  none,  1  =  any) 

Column  6:  Maximum  methadone  dose  (mg/day) 


1 .  The  following  edited  printout  was  obtained  from  fitting 
a  Cox  PH  model  to  these  data: 


Cox  regression 
Analysis  time  _t: 


survt 

Coef. 

Std.  Err.  p  >  |z| 

Haz.  Ratio 

[95%  Conf.  Interval] 

P(PH) 

Clinic 

-1.009 

0.215  0.000 

0.365 

0.239 

0.556 

0.001 

Prison 

0.327 

0.167  0.051 

1.386 

0.999 

1.924 

0.332 

Dose 

-0.035 

0.006  0.000 

0.965 

0.953 

0.977 

0.341 

No.  of  subjects  =  238  Log  likelihood  =  —673.403 

Based  on  the  P(PH )  information  in  the  above  printout, 
it  appears  that  clinic  does  not  satisfy  the  PH  assump¬ 
tion;  this  conclusion  is  also  supported  by  comparing 
log-log  curves  for  the  two  clinics  and  noticing  strong 
nonparallelism.  What  might  we  learn  from  fitting  a 
stratified  Cox  (SC)  model  stratifying  on  the  clinic  vari¬ 
able?  What  is  a  drawback  to  using  a  SC  procedure  that 
stratifies  on  the  clinic  variable? 

2.  The  following  printout  was  obtained  from  fitting  a  SC 
PH  model  to  these  data,  where  the  variable  being  stra¬ 
tified  is  clinic: 

Stratified  Cox 
regression 
Analysis  time  _t: 


survt 

Coef. 

Std.  Err. 

P  >  M 

Haz.  Ratio 

[95%  Conf.  Interval] 

Prison 

Dose 

0.389 

-0.035 

0.169 

0.006 

0.021 

0.000 

1.475 

0.965 

1.059  2.054 

0.953  0.978 

No.  of  subjects  =  238  Log  likelihood  =  —597.714  Stratified  by  clinic 

Using  the  above  fitted  model,  we  can  obtain  the 
adjusted  curves  below  that  compare  the  adjusted  sur¬ 
vival  probabilities  for  each  clinic  (i.e.,  stratified  by 
clinic)  adjusted  for  the  variables,  prison  and  maximum 
methadone  dose. 
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Based  on  these  adjusted  survival  curves,  what  conclu¬ 
sions  can  you  draw  about  whether  the  survival  experi¬ 
ence  is  different  between  the  two  clinics?  Explain. 

3.  State  the  hazard  function  model  being  estimated  in  the 
above  computer  results.  Why  is  this  model  a  no-inter¬ 
action  model? 

4.  Using  the  above  computer  results,  provide  point  and 
interval  estimates  for  the  effect  of  prison  adjusted  for 
clinic  and  dose.  Is  this  adjusted  prison  effect  signifi¬ 
cant?  Explain. 

5.  The  following  computer  results  consider  a  SC  model 
that  allows  for  interaction  of  the  stratified  variable 
clinic  with  each  of  the  predictors,  prison  and  dose. 
Product  terms  in  the  model  are  denoted  as  clinpr  = 
clinic  x  prison  and  clindos  =  clinic  x  dose. 


Stratified  Cox 
regression 
Analysis  time  _t: 


survt 

Coef. 

Std.  Err. 

P  >  |z| 

Haz.  Ratio 

[95%  Conf.  Interval] 

prison 

1.087 

0.539 

0.044 

2.966 

1.032 

8.523 

dose 

-0.035 

0.020 

0.079 

0.966 

0.929 

1.004 

clinpr 

-0.585 

0.428 

0.172 

0.557 

0.241 

1.290 

clindos 

-0.001 

0.015 

0.942 

0.999 

0.971 

1.028 

No.  of  subjects  =  238  Log  likelihood  =  —596.779  Stratified  by  clinic 

State  two  alternative  versions  of  the  interaction  model 
being  estimated  by  the  above  printout,  where  one  of 
these  versions  should  involve  the  product  terms  used  in 
the  above  printout. 

6.  Using  the  computer  results  above,  determine  the  esti¬ 
mated  hazard  models  for  each  clinic.  (Note  that  the 
clinics  are  coded  as  1  or  2.) 
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7.  below  are  the  adjusted  survival  curves  for  each  clinic 
based  on  the  interaction  model  results  above.  These 
curves  are  adjusted  for  the  prison  and  dose  variables. 


Compare  the  survival  curves  by  clinic  obtained  for  the 
interaction  model  with  the  corresponding  curves  previ¬ 
ously  shown  for  the  no-interaction  model.  Do  both 
graphs  indicate  the  similar  conclusions  about  the  clinic 
effect?  Explain. 

8.  Carry  out  a  likelihood  ratio  test  to  determine  whether 
the  no-interaction  model  is  appropriate.  In  doing  so, 
make  use  of  the  computer  information  described 
above,  state  the  null  hypothesis,  state  the  form  of  the 
likelihood  statistic  and  its  distribution  under  the  null 
hypothesis,  and  compute  the  value  of  the  likelihood 
statistic  and  evaluate  its  significance.  What  are  your 
conclusions? 


1.  The  first  printout  indicates  that  the  variables  large 
cell,  adeno  cell,  small  cell,  and  performance  status  do 
not  satisfy  the  PH  assumption  at  the  0.10  level.  The 
second  printout  considers  a  different  model  that  does 
not  contain  the  large  cell  and  adeno  cell  variables. 
This  latter  printout  indicates  that  small  cell  satisfies 
the  PH  assumption,  in  contrast  to  the  first  printout. 
The  performance  status  variable,  however,  does  not 
satisfy  the  PH  assumption  as  in  the  first  printout. 

2.  The  cell  type  variable  is  defined  to  have  four  cate¬ 
gories,  as  represented  by  the  three  dummy  variables 
in  the  first  printout.  The  “small  cell”  variable  dic¬ 
hotomizes  the  cell  type  variable  into  the  categories 
small  cell  type  versus  the  rest.  From  the  second  print¬ 
out,  the  small  cell  variable  does  not  appear  by  itself  to 
violate  the  PH  assumption.  This  result  conflicts  with 
the  results  of  the  first  printout,  for  which  the  cell  type 
variable  considered  in  four  categories  does  not  satisfy 
the  PH  assumption  at  the  0.10  level  of  significance. 
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We  therefore  think  it  is  more  appropriate  to  use  a  SC 
procedure  only  if  four  strata  are  to  be  used.  A  draw¬ 
back  to  using  four  strata,  however,  is  that  the  number 
of  survival  curves  to  be  plotted  is  larger  than  for  two 
strata;  consequently,  a  large  number  of  curves  is  more 
difficult  to  interpret  graphically  than  when  there  are 
only  two  curves.  Thus,  for  convenience  of  interpreta¬ 
tion,  we  may  choose  to  dichotomize  the  cell  type  vari¬ 
able  instead  of  considering  four  strata.  We  may  also 
consider  dichotomies  other  than  those  defined  by  the 
small  cell  variable.  For  instance,  we  might  consider 
dichotomizing  on  either  the  adeno  or  large  cell  vari¬ 
ables  instead  of  the  small  cell  variable.  Alternatively, 
we  may  combine  categories  so  as  to  compare,  say, 
large  and  adeno  cell  types  with  small  and  squamous 
types.  However,  a  decision  to  combine  categories 
should  not  be  just  a  statistical  decision,  but  should 
also  be  based  on  biologic  considerations. 

3.  HRadi  =  1.095,  95%  Cl:  (0.744,1.611),  two-tailed  p-value 
is  0.647,  not  significant.  The  estimated  hazard  ratio  for 
treatment  is  neither  meaningfully  nor  statistically  sig¬ 
nificant.  The  point  estimate  is  essentially  1 ,  which  says 
that  there  is  no  meaningful  effect  of  treatment  adjusted 
for  the  predictors  in  the  model  and  for  the  stratified 
predictor  SZ*. 

4.  /zg(t,X)  =  /z0g(t)exp[Pi  Treatment  +  p 2DD  +  p3  Age  + 
P4PT],  g  =  1,. . .,  4,  where  the  strata  are  defined  from 
the  stratification  variable  SZ*,  DD  =  disease  duration, 
and  PT  =  prior  therapy.  This  model  assumes  no  inter¬ 
action  because  the  coefficient  of  each  predictor  in  the 
model  is  not  subscripted  by  g,  i.e.,  the  regression  coef¬ 
ficients  are  the  same  for  each  stratum. 

5.  Version  1:  hg  (f,X)  =  /z0g(t)exp[plg  Treatment  +  p2  DD 
+  p3  Age  +  p 4PT],  g  =  1,. . .,  4. 

Version  2:  hg  (/ ,X)  =  /? )exp[ p ,  Treatment  +  p2  DD 
+  /?3Age  +  +  /)5(Zj  x  Treatment) 

+/16(Z2  x  Treatment)  +  /)7(Zj  xZj  x  Treatment)], 
where  Z\  =  small  cell  type  (0,  1),  ZJ  =  Psbin  (0,  1), 
andg  =  1,. . .,  4. 
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6.  Version  1:  hs(t,X)  =  /j0g(f)exp[plg  Treatment  +  p2g  DD 
+  P3g  Age  +  p4g  PT],  g  =  1,. . 4. 

Version  2:  hg(t,  X)  =  hog(t)  exp[pl  Treatment  +  /?2DD 
+  [i3  Age  +  /i4  PT  +  (Zj  x  Treatment)  +  p6  [Z\  x  DD) 

+  /i7(Zj  x  Age)  +  jS8(Zj  x  PT)  +  /?9(Z2  x  Treatment) 

+  j810 (Z*  x  DD)  +  Pn  (Z*2  x  Age)  +  pn(Z*2  x  PT) 

+  /tl3(z;  x  Z*  x  Treatment)  +  £14(Z*  x  Z*2  x  DD) 

+y515(Zt  X  z;  x  Age)  +  plb(z\  x?x  PT)]  > 
g  =  1, ...  ,4. 

7.  //Pg  =  exp(plg),  using  version  1  model  form.  Yes,  this 
formula  gives  different  hazard  ratios  for  different 
strata  because  the  value  of  the  hazard  ratio  changes 
with  the  subscript  g. 

8.  H0:  No  interaction  assumption  is  satisfied. 

H0:  Pll  =  Pl2  =  Pl3  =  Pl4)  p21  =  P22  =  P23  =  P24> 

P31  =  P32  =  p33  =  P34>  P41  =  P42  =  p43  =  P44 
from  version  1 . 

H0:  P5  =  P6  =  P7  =  Ps  =  P9  =  P10  =  Pll  =  P12 

=  P13  =  Pi4  =  P15  =  Pi6  =  0  from  version  2. 

9.  LR  =  —2  In  Lr  — (— 2  In  LF),  where  R  denotes  the 
reduced  (no-interaction)  model  and  F  denotes  the  full 
(interaction)  model.  Under  the  null  hypothesis,  LR  is 
approximately  a  chi-square  with  1 2  degrees  of  freedom. 

10.  Estimated  hazard  models  for  each  stratum: 

g=  i;  z*  =  z;  =  0: 

h  1  ,X)  =  /;m(t)exp[(0.381)Treatment  +  (0.015)DD  + 
(O.OOO)Age  +  (0.023)PP] 

g  =  2;  Zj  =  1 ,  Z2  =  0  : 

/t2(/,X)  =  /)02(t)exp[(0.941)Treatment  +  (-0.014)DD  + 
(— 0.055)Age  +  (0.066)PP] 

g  =  3;  Z\  =  0,  Z)  =1  : 

=  /)03(t)exp[(-0.210)Treatment  +  (0.040)DD  + 
(O.OOl)Age  +  (— 0.055)PP] 

g  =  4;  Z,  =  1,  Z*  =  1  : 

/?4(t,X)  =  /*o4(t)exp[(0.026)Treatment  +  (-0.060)DD  + 
(0.030)Age  +  (— 0.017)PP] 
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Estimated  hazard  ratios  for  treatment  effect  adjusted 
for  DD,  Age,  and  PT: 

g  =  1  :  HRi  =  exp(0.381)  =  1.464 
g  =  2:HR2  =  exp(0.941)  =  2.563 
g  =  3  :  HRi  =  exp(— 0.210)  =  0.811 
g  =  4  :  HR4  =  exp(0.026)  =  1.026 

11.  Hq.  P5  =  p6  =  p7  =  pg  =  p9  =  P10  =  Pn  =  P12  =  P13 
=  P14  =  P15  =  Pi6  =  0 

LR  =  689.696  —  671.182  =  18.514,  which  is  approxi¬ 
mately  chi-square  with  12  df. 

P  =  0.101,  which  is  not  significant  below  the  .05  level. 
Conclusion :  Accept  the  null  hypothesis  and  conclude 
that  the  no-interaction  model  is  preferable  to  the 
interaction  model. 

12.  The  three  curves  at  the  bottom  of  the  graph  appear  to 
be  quite  non-parallel.  Thus,  the  PH  assumption  is  not 
satisfied  for  one  or  both  of  the  variables,  small  cell 
type  and  PSbin.  Note,  however,  that  because  both 
these  variables  have  been  stratified  together,  it  is 
not  clear  from  the  graph  whether  only  one  of  these 
variables  fails  to  satisfy  the  PH  assumption. 

13.  If  we  graph  adjusted  survival  curves  that  compare  the 
two  treatment  groups  for  each  of  the  four  strata,  we 
will  be  able  to  see  graphically  how  the  treatment 
effect,  if  any,  varies  over  time  within  each  strata.  The 
difficulty  with  this  approach,  however,  is  that  eight 
adjusted  survival  curves  will  be  produced,  so  that  if  all 
eight  curves  are  put  on  the  same  graph,  it  may  be 
difficult  to  see  what  is  going  on. 
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Introduction 


Abbreviated 

Outline 


We  begin  by  defining  a  time-dependent  variable  and 
providing  some  examples  of  such  a  variable.  We  also  state 
the  general  formula  for  a  Cox  model  that  is  extended  to 
allow  time  dependent  variables,  followed  by  a  discussion  of 
the  characteristics  of  this  model,  including  a  description  of 
the  hazard  ratio. 

In  the  remainder  of  the  presentation,  we  give  examples  of 
models  with  time-dependent  variables,  including  models 
that  allow  for  checking  the  PH  assumption  for  time- 
independent  variables.  In  particular,  we  describe  a  method 
that  uses  “heaviside  functions”  to  evaluate  the  PH  assumption 
for  time-independent  variables.  We  also  describe  two 
computer  applications  of  the  extended  Cox  model,  one  con¬ 
cerning  a  study  on  the  treatment  of  heroin  addiction  and  the 
other  concerning  the  Stanford  heart  transplant  study. 

The  outline  below  gives  the  user  a  preview  of  the  material 
to  be  covered  by  the  presentation.  A  detailed  outline  for 
review  purposes  follows  the  presentation. 

I.  Preview  (page  244) 

II.  Review  of  the  Cox  PH  Model  (pages  244-246) 

III.  Definition  and  Examples  of  Time-Dependent 
Variables  (pages  246-249) 

IV.  The  Extended  Cox  Model  for  Time-Dependent 
Variables  (pages  249-251) 

V.  The  Hazard  Ratio  Formula  for  the  Extended  Cox 
Model  (pages  251-253) 

VI.  Assessing  Time-Independent  Variables  That  Do 
Not  Satisfy  the  PH  Assumption  (pages  254-259) 

VII.  An  Application  of  the  Extended  Cox  Model  to  An 
Epidemiologic  Study  on  the  Treatment  of  Heroin 
Addiction  (pages  260-264) 

VIII.  An  Application  of  the  Extended  Cox  Model  to  the 
Analysis  of  the  Stanford  Heart  Transplant  Data 

(pages  265-269) 

IX.  The  Extended  Cox  Likelihood  (pages  269-274) 

X.  Summary  (pages  274-277) 
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Objectives 


Upon  completing  the  chapter,  the  learner  should  be  able  to: 

1 .  State  or  recognize  the  general  form  of  the  Cox  model 
extended  for  time-dependent  variables. 

2.  State  the  specific  form  of  an  extended  Cox  model 
appropriate  for  the  analysis,  given  a  survival  analysis 
scenario  involving  one  or  more  time-dependent 
variables. 

3.  State  the  formula  for  a  designated  hazard  ratio  of 
interest,  given  a  scenario  describing  a  survival  analysis 
using  an  extended  Cox  model. 

4.  State  the  formula  for  an  extended  Cox  model  that 
provides  a  method  for  checking  the  PH  assumption  for 
one  more  of  the  time-independent  variables  in  the 
model,  given  a  scenario  describing  a  survival  analysis 
involving  time-independent  variables. 

5.  State  the  formula  for  an  extended  Cox  model  that  uses 
one  or  more  heaviside  functions  to  check  the  PH 
assumption  for  one  more  of  the  time-independent 
variables  in  the  model,  given  a  scenario  describing  a 
survival  analysis  involving  time-independent  variables. 

6.  State  the  formula  for  the  hazard  ratio  during  different 
time  interval  categories  specified  by  heaviside 
function(s)  that  are  contained  in  an  extended  cox  model. 

7.  Carry  out  an  appropriate  analysis  of  the  data  to 
evaluate  the  effect  of  one  or  more  of  the  explanatory 
variables  in  the  model(s)  being  used,  given  computer 
results  for  a  survival  analysis  involving  time-dependent 
variables.  Such  an  analysis  will  involve: 

•  computing  and  interpreting  any  hazard  ratio(s) 
of  interest; 

•  carrying  out  and  interpreting  appropriate  test(s) 
of  hypotheses  for  effects  of  interest; 

•  obtaining  confidence  intervals  for  hazard  ratios  of 
interest; 

•  evaluating  interaction  and  confounding  involving 
one  or  more  covariates. 
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Presentation 


I.  Preview 


This  presentation  describes  how  the  Cox 
proportional  hazards  (PH)  model  can  be 
extended  to  allow  time-dependent  variables  as 
predictors.  Here,  we  focus  on  the  model  form, 
characteristics  of  this  model,  the  formula  for 
and  interpretation  of  the  hazard  ratio,  and 
examples  of  the  extended  Cox  model.  We  also 
show  how  the  extended  Cox  model  can  be 
used  to  check  the  PH  assumption  for  time- 
independent  variables,  and  we  provide  com¬ 
puter  applications  to  illustrate  different  types 
of  time-dependent  variables.  Finally,  we  des¬ 
cribe  the  extended  cox  likelihood  and  how  it 
contrasts  with  the  Cox  PH  likelihood  function. 


1 1 .  Review  of  the  Cox  The  general  form  of  the  Cox  PH  model  is  shown 

PH  Model  here.  This  model  gives  an  expression  for  the 

hazard  at  time  t  for  an  individual  with  a  given 
specification  of  a  set  of  explanatory  variables 
denoted  by  the  bold  X.  That  is,  the  bold  X 
represents  a  collection  (sometimes  called  a 
"vector”)  of  predictor  variables  that  is  being 
modeled  to  predict  an  individual’s  hazard. 

x  =  (x1(x2,  ...,xp) 

Explanatory/predictor  variables 


h(t,X)  =  h0(t)  exp 


i=i 


ho(t)  x  exp 


Baseline  hazard 

Involves  t  but 
not  Xs 


E^- 


Exponential 

Involves  X’s  but 
not  t  (Xs  are 
time- 

independent) 


The  Cox  model  formula  says  that  the  hazard  at 
time  t  is  the  product  of  two  quantities.  The  first 
of  these,  hQ(t),  is  called  the  baseline  hazard 
function.  The  second  quantity  is  the  exponen¬ 
tial  expression  e  to  the  linear  sum  of  (!,  X,,  where 
the  sum  is  over  the  p  explanatory  X  variables. 

An  important  feature  of  this  formula,  which 
concerns  the  proportional  hazards  (PH) 
assumption,  is  that  the  baseline  hazard  is  a 
function  of  t  but  does  not  involve  the  Xs, 
whereas  the  exponential  expression  involves 
the  X’s  but  does  not  involve  t.  The  Xs  here  are 
called  time-independent  X’s. 


Presentation:  II.  Review  of  the  Cox  PH  Model  245 


X’s  involving  t:  time-dependent 

Requires  extended  Cox  model 
(no  PH) 


It  is  possible,  nevertheless,  to  consider  X’s 
that  do  involve  t.  Such  X’s  are  called  time- 
dependent  variables.  If  time-dependent  vari¬ 
ables  are  considered,  the  Cox  model  form  may 
still  be  used,  but  such  a  model  no  longer  satisfies 
the  PH  assumption  and  is  called  the  extended 
Cox  model.  We  will  discuss  time-dependent 
variables  and  the  corresponding  extended  Cox 
model  beginning  in  the  next  section. 


Hazard  ratio  formula: 


HR  =  exp 


Ear 


-x 


From  the  Cox  PH  model,  we  can  obtain  a 
general  formula,  shown  here,  for  estimating  a 
hazard  ratio  that  compares  two  specifications 
of  the  X’s,  defined  as  X*  and  X. 


where  X*  =  (x|,  X*,...  ,X*) and 
X  =  (Xu  X2,  . . .,  xj)  denote  the  two 
sets  of  X’s. 


PH  assumption: 

h(t,X*)  - 

— - =  0  (a  constant  over  t) 

h(t,  X) 

i.e.,  h(t,X*)  =  Qh(t,X) 


The  (PH)  assumption  underlying  the  Cox  PH 
model  is  that  the  hazard  ratio  comparing  any 
two  specifications  of  X  predictors  is  constant 
over  time.  Equivalently,  this  means  that  the 
hazard  for  one  individual  is  proportional  to  the 
hazard  for  any  other  individual,  where  the  pro¬ 
portionality  constant  is  independent  of  time. 


Hazards  cross  =>  PH  not  met  An  example  of  when  the  PH  assumption  is  not 

met  is  given  by  any  study  situation  in  which  the 
Hazards  don’t  cross  X  PH  met  hazards  for  two  or  more  groups  cross  when 

graphed  against  time.  However,  even  if  the 
hazard  functions  do  not  cross,  it  is  possible 
that  the  PH  assumption  is  not  met. 


Three  approaches: 

•  graphical 

•  time-dependent  variables 

•  goodness-of-fit  test 


As  described  in  more  detail  in  Chapter  4,  there 
are  three  general  approaches  for  assessing  the 
PH  assumption.  These  are 

•  a  graphical  approach; 

•  the  use  of  time-dependent  variables  in  an 
extended  Cox  model;  and 

•  the  use  of  a  goodness-of-fit  test. 


Time-dependent  covariates  used  to 
assess  PH  for  time-indep.  X’s 

If 

Extend  Cox  model:  add  product 
term(s)  involving  function  of  t 


When  time-dependent  variables  are  used  to 
assess  the  PH  assumption  for  a  time- 
independent  variable,  the  Cox  model  is 
extended  to  contain  product  (i.e.,  interaction) 
terms  involving  the  time-independent  variable 
being  assessed  and  some  function  of  time. 
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EXAMPLE 


h(t,X. )  =  /20(f)  expfPisex  +  p2(sex  x  f)] 
H0:  p2  =  0  =>  PH  assumption  satisfied 


Options  when  PH  assumption  not 
satisfied: 

•  Use  a  stratified  Cox  (SC)  model. 

•  Use  time-dependent  variables. 


Time-dependent  variables  may  be: 

•  inherently  time-dependent 

•  defined  to  analyze  a  time- 
independent  predictor  not 
satisfying  the  PH  assumption. 


For  example,  if  the  PH  assumption  is  being 
assessed  for  sex,  a  Cox  model  might  be 
extended  to  include  the  variable  sex  x  /  in  addi¬ 
tion  to  sex.  If  the  coefficient  of  the  product 
term  turns  out  to  be  non-significant,  we  can 
conclude  that  the  PH  assumption  is  satisfied 
for  sex  provided  that  the  variable  sex  x  t  is  an 
appropriate  choice  of  time-dependent  variable. 

There  are  two  options  to  consider  if  the  PH 
assumption  is  not  satisfied  for  one  or  more  of 
the  predictors  in  the  model.  In  Chapter  5,  we 
described  the  option  of  using  a  stratified  Cox 
(SC)  model,  which  stratifies  on  the  predictor(s) 
not  satisfying  the  PH  assumption,  while 
keeping  in  the  model  those  predictors  that 
satisfy  the  PH  assumption.  In  this  chapter,  we 
describe  the  other  option,  which  involves  using 
time-dependent  variables. 

Note  that  a  given  study  may  consider  predictors 
that  are  inherently  defined  as  time-dependent, 
as  we  will  illustrate  in  the  next  section.  Thus, 
in  addition  to  considering  time-dependent 
variables  as  an  option  for  analyzing  a  time- 
independent  variable  not  satisfying  the  PH 
assumption,  we  also  discuss  predictors  which 
are  inherently  defined  as  time-dependent. 


III.  Definition  and 
Examples  of  Time- 
Dependent  Variables 

Definition: 


A  time-dependent  variable  is  defined  as 
any  variable  whose  value  for  a  given  subject 
may  differ  over  time  (t).  In  contrast,  a  time- 
independent  variable  is  a  variable  whose  value 
for  a  given  subject  remains  constant  over  time. 


Time- 

dependent 

Time- 

independent 

Value  of  variable 

Value  of  variable 

differs  over  time 

is  constant  over 

time 

Example: 

Race  x  t 

Race 

As  a  simple  example,  the  variable  RACE  is  a 
time-independent  variable,  whereas  the  vari¬ 
able  RACE  x  time  is  a  time-dependent  variable. 
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EXAMPLE  OF  DEFINED  VARIABLES 


Defined  variable:  RACE  x  t 


Time-independent 

Race  =  1  =>  Race  x  t  =  t 

Race  =  0  =>  Race  x  t  =  0  (at  any  t) 


E  x  (log  t  -  3) 

Function  of  t 

fE  denotes  a  (0,1)  exposure  variable]. 


Heaviside  function 


1 

0 


if  t  >  ?0 
if  t  <  t0 


■p 

t  >  to  ■  E  x  g(t)  =  E 

m 

t  <  to  :  E  x  g{t)  =  0 


Heaviside  functions  used  when  PH 
assumptions  not  met. 


The  variable  RACE  x  time  is  an  example  of 
what  is  called  a  “defined”  time-dependent  vari¬ 
able.  Most  defined  variables  are  of  the  form  of 
the  product  of  a  time-independent  variable 
(e.g.,  RACE)  multiplied  by  time  or  some  func¬ 
tion  of  time.  Note  that  after  RACE  is  deter¬ 
mined  for  a  given  subject,  all  the  values  of  the 
RACE  x  time  variable  are  completely  defined 
over  a  specified  time  interval  of  study. 


A  second  example  of  a  defined  variable  is  given 
by  £  x  (log  t  -  3),  where  £  denotes,  say,  a  (0,1) 
exposure  status  variable  determined  at  one’s 
entry  into  the  study.  Notice  that  here  we  have 
used  a  function  of  time,  log  t  —  3,  rather  than 
time  t  alone. 


Yet  another  example  of  a  defined  variable, 
which  also  involves  a  function  of  time,  is 
given  by  £  x  g(t),  where  g(t)  is  defined  to  take 
on  the  value  1  if  t  is  greater  than  or  equal  to 
some  specified  value  of  t,  called  t0,  and  takes  on 
the  value  0  if  t  is  less  than  t0. 

The  function  g(t)  is  called  a  “heaviside”  func¬ 
tion.  Note  that  whenever  t  is  greater  than  or 
equal  to  t0,  g(t)  equals  1,  so  £  x  g(t)  =  E;  how¬ 
ever,  whenever  t  is  less  than  t0,  g(t)  =  0,  so  the 
value  of  E  x  g(t)  is  always  0.  We  will  later  return 
to  illustrate  how  heaviside  functions  may  be 
used  as  one  method  for  the  analysis  when  a 
time-independent  variable  like  £  does  not  sat¬ 
isfy  the  proportional  hazards  assumption. 


Internal  variable:  Another  type  of  time-dependent  variable  is 

called  an  “internal”  variable.  Examples  of 
such  a  variable  include  exposure  level  £  at 
time  t,  employment  status  ( EMP )  at  time  t, 
smoking  status  ( SMK )  at  time  t,  and  obesity 
level  ( OBS )  at  time  t. 


EXAMPLES  OF  INTERNAL 
VARIABLES 


E(t),  EMP(t),  SMK(t),  OBS(t), 


Values  change  because  of  “inter¬ 
nal”  characteristics  or  behavior  of 
the  individual. 


All  these  examples  consider  variables  whose 
values  may  change  over  time  for  any  subject 
under  study;  moreover,  for  internal  variables, 
the  reason  for  a  change  in  value  depends  on 
"internal”  characteristics  or  behavior  specific 
to  the  individual. 
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"Ancillary”  variable: 

Value  changes  because  of  “external” 
characteristics. 


EXAMPLES  OF  ANCILLARY 
VARIABLES 


Air  pollution  index  at  time  t;  EMP{t) 


In  contrast,  a  variable  is  called  an  “ancillary” 
variable  if  its  value  changes  primarily  because 
of  “external”  characteristics  of  the  environ¬ 
ment  that  may  affect  several  individuals  simul¬ 
taneously.  An  example  of  an  ancillary  variable 
is  air  pollution  index  at  time  t  for  a  particular 
geographical  area.  Another  example  is  employ¬ 
ment  status  ( EMP )  at  time  t,  if  the  primary 
reason  for  whether  someone  is  employed  or 
not  depends  more  on  general  economic  cir¬ 
cumstances  than  on  individual  characteristics. 


ANOTHER  EXAMPLE 


Heart  transplant  status  at  time  f: 


HT(t)  = 


1  if  received  transplant  at  some  time  tQ<t 
0  if  did  not  receive  transplant  by  time  t 


Transplant  HT(t):  0000.  ..0  111111111 

i  ~  ,l 


No  transplant  HT(t ):  0000. ..00000 


Heart  transplant  status  =  HT(t ) 


Internal:  Ancillary: 

Status  determined  Status  determined 

from  individual  from  external 

traits  availability  of  a 

donor 


As  another  example,  which  may  be  part  inter¬ 
nal  and  part  ancillary,  we  consider  heart  trans¬ 
plant  status  (. HT )  at  time  t  for  a  person 
identified  to  have  a  serious  heart  condition, 
making  him  or  her  eligible  for  a  transplant. 
The  value  of  this  variable  HT  at  time  /  is  1  if 
the  person  has  already  received  a  transplant  at 
some  time,  say  t0,  prior  to  time  t.  The  value  of 
HT  is  0  at  time  t  if  the  person  has  not  yet 
received  a  transplant  by  time  t. 


Note  that  once  a  person  receives  a  transplant, 
at  time  t0,  the  value  of  HT  remains  at  1  for  all 
subsequent  times.  Thus,  for  a  person  receiving 
a  transplant,  the  value  of  HT  is  0  up  to  the  time 
of  transplant,  and  then  remains  at  1  thereafter. 
In  contrast,  a  person  who  never  receives  a 
transplant  has  HT  equal  to  0  for  all  times  dur¬ 
ing  the  period  he  or  she  is  in  the  study. 


The  variable  "heart  transplant  status,”  HT(t), 
can  be  considered  essentially  an  internal  vari¬ 
able,  because  individual  traits  of  an  eligible 
transplant  recipient  are  important  determinants 
of  the  decision  to  cany  out  transplant  surgery. 
Nevertheless,  the  availability  of  a  donor  heart 
prior  to  tissue  and  other  matching  with  an  eligi¬ 
ble  recipient  can  be  considered  an  “ancillary” 
characteristic  external  to  the  recipient. 
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Computer  commands  differ  for 
defined  vs.  internal  vs.  ancillary. 

But,  the  form  of  extended  Cox 
model  and  procedures  for  analysis 
are  the  same  regardless  of  variable 
type. 


The  primary  reason  for  distinguishing  among 
defined,  internal,  or  ancillary  variables  is  that 
the  computer  commands  required  to  define  the 
variables  for  use  in  an  extended  Cox  model  are 
somewhat  different  for  the  different  variable 
types,  depending  on  the  computer  program 
used.  Nevertheless,  the  form  of  the  extended 
Cox  model  is  the  same  regardless  of  variable 
type,  and  the  procedures  for  obtaining  esti¬ 
mates  of  regression  coefficients  and  other 
parameters,  as  well  as  for  carrying  out  statisti¬ 
cal  inferences,  are  also  the  same. 


IV.  The  Extended  Cox 
Model  for  Time- 
Dependent  Variables 


h(t,X(t))  =ho(t)  exp 


P2 

+EW) 


j= i 


X(0=  (xux2,...xPl, 

s - v - ' 

Time— independent 

X1(t),X2(t),...XP2(t)) 

" - - V - ' 

Time— dependent 


Given  a  survival  analysis  situation  involving 
both  time-independent  and  time-dependent 
predictor  variables,  we  can  write  the  extended 
Cox  model  that  incorporates  both  types  as 
shown  here  at  the  left.  As  with  the  Cox  PH 
model,  the  extended  model  contains  a  baseline 
hazard  function  hQ{t)  which  is  multiplied  by 
an  exponential  function.  However,  in  the 
extended  model,  the  exponential  part  contains 
both  time-independent  predictors,  as  denoted 
by  the  X-,  variables,  and  time-dependent  predic¬ 
tors,  as  denoted  by  the  Xj(t)  variables.  The 
entire  collection  of  predictors  at  time  t  is 
denoted  by  the  bold  X(t). 


EXAMPLE 


h(t,X.(t))  =  /z0(f)exp[|lE  +  5 (E  x  f)], 
Pi  =  l,P2  =  1, 

X(f)  =  (X1  =  E,  Ai(f)  =Ext ) 


As  a  simple  example  of  an  extended  Cox  model, 
we  show  here  a  model  with  one  time-indepen- 
dent  variable  and  one  time-dependent  variable. 
The  time-independent  variable  is  exposure  sta¬ 
tus  E,  say  a  (0,1)  variable,  and  the  time-depen- 
dent  variable  is  the  product  term  Ext. 


Estimating  regression 
coefficients: 

ML  procedure: 

Maximize  (partial)  L. 

Risk  sets  more  complicated  than 
for  PH  model. 


As  with  the  simpler  Cox  PH  model,  the  regres¬ 
sion  coefficients  in  the  extended  Cox  model  are 
estimated  using  a  maximum  likelihood  (ML) 
procedure.  ML  estimates  are  obtained  by  max¬ 
imizing  a  (partial)  likelihood  function  L.  How¬ 
ever,  the  computations  for  the  extended  Cox 
model  are  more  complicated  than  for  the  Cox 
PH  model,  because  the  risk  sets  used  to  form  the 
likelihood  function  are  more  complicated  with 
time-dependent  variables.  The  extended  Cox 
likelihood  is  described  later  in  this  chapter. 
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Computer  packages  that  include  programs  for 
fitting  the  extended  Cox  model  include  Stata, 
SAS,  SPSS,  and  R.  See  the  Computer  Appendix 
at  the  end  of  this  text  for  a  comparison  of  the 
Stata,  SAS,  SPSS,  and  R  procedures  applied  to 
the  same  dataset. 

Statistical  inferences:  Methods  for  making  statistical  inferences  are 

Wald  and/or  LR  tests  essentially  the  same  as  for  the  PH  model.  That  is, 

Large  sample  confidence  intervals  one  can  use  Wald  and/or  likelihood  ratio  ( LR )  tests 

and  large  sample  confidence  interval  methods. 

Assumption  of  the  model:  An  important  assumption  of  the  extended  Cox 

The  hazard  at  time  t  depends  on  model  is  that  the  effect  of  a  time-dependent 
the  value  of  Xj(t)  at  that  same  time,  variable  Xj(t)  on  the  survival  probability  at 

time  t  depends  on  the  value  of  this  variable  at 
that  same  time  /,  and  not  on  the  value  at  an 
earlier  or  later  time. 

Note  that  even  though  the  values  of  the  variable 
Xj{t)  may  change  over  time,  the  hazard  model 
provides  only  one  coefficient  for  each  time- 
dependent  variable  in  the  model.  Thus,  at 
time  t,  there  is  only  one  value  of  the  variable 
Xj(t)  that  has  an  effect  on  the  hazard,  that  value 
being  measured  at  time  /. 


Can  modify  for  lag-time  effect  It  is  possible,  nevertheless,  to  modify  the  defi¬ 

nition  of  the  time-dependent  variable  to  allow 
for  a  “lag-time”  effect. 

Lag-time  effect:  To  illustrate  the  idea  of  a  lag-time  effect, 

suppose,  for  example,  that  employment  status, 
measured  weekly  and  denoted  as  EMP(t),  is 
the  time-dependent  variable  being  considered. 
Then,  an  extended  Cox  model  that  does 
not  consider  lag-time  assumes  that  the  effect 
of  employment  status  on  the  probability  of 
survival  at  week  t  depends  on  the  observed 
value  of  this  variable  at  the  same  week  t,  and 
not,  for  example,  at  an  earlier  week. 

However,  to  allow  for,  say,  a  time-lag  of  one 
week,  the  employment  status  variable  may  be 
modified  so  that  the  hazard  model  at  time 
t  is  predicted  by  the  employment  status  at 
week  Z  —  1 .  Thus,  the  variable  EMP(t )  is  replaced 
in  the  model  by  the  variable  EMP(t  —  1). 


EXAMPLE 


EMP(t)  =  employment  status  at  week  t 

Model  without  lag-time: 
h(t,X(t))  =  h0(t)exp[8EMP(t)\ 

Model  with  1-week  lag-time: 
h(t,X(t))  =  h0(t)exp[S*EMP(t  -  1)] 

/ 

One-week  earlier 


"  Pi 

h(t,X(t))  =  h0(t)  exp 


+  X!  vow 

i=,t 

One  coefficient  for  X,(t) 


Computer  programs  for  the 
extended  Cox  model: 

Stata  (Stcox) 

SAS  (PHREG)  Computer 
SPSS  (COXREG)  Appendix 
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General  lag-time  extended  model: 


h(t,X(t))  =  ho(t)  exp 


E^ 


;=  1 


j=  1  yf  J 

Xj(t  —  Lj)  replaces  Xj(t) 


More  generally,  the  extended  Cox  model  may 
be  alternatively  written  to  allow  for  a  lag-time 
modification  of  any  time-dependent  variable  of 
interest.  If  we  let  L,  denote  the  lag-time  speci¬ 
fied  for  time-dependent  variable  ;,  then  the 
general  “lag-time  extended  model”  can  be  writ¬ 
ten  as  shown  here.  Note  that  the  variable  X,(t) 
in  the  earlier  version  of  the  extended  model  is 
now  replaced  by  the  variable  Xj(t  -  Lj). 


V.  The  Hazard  Ratio  We  now  describe  the  formula  for  the  hazard 

Formula  for  the  ratio  that  derives  from  the  extended  Cox 

Extended  Cox  Model  model.  The  most  important  feature  of  this  for¬ 
mula  is  that  the  proportional  hazards  assump- 
PH  assumption  is  not  satisfied  for  tion  is  no  longer  satisfied  when  using  the 
the  extended  Cox  model.  extended  Cox  model. 


HR{t)  = 


h(t,X(t)) 


=  exp 


+E8i 

j= i 


Two  sets  of  predictors: 

X*(0  =  (x*l,x*2,---,x;i,x*l(t)i 

X(t)  =  (XuX2,...,Xpi,X1(t), 
X2(t),...,XP2(t)) 


The  general  hazard  ratio  formula  for  the 
extended  Cox  model  is  shown  here.  This  for¬ 
mula  describes  the  ratio  of  hazards  at  a  partic¬ 
ular  time  t,  and  requires  the  specification  of 
two  sets  of  predictors  at  time  t.  These  two  sets 
are  denoted  as  bold  X*(f)  and  bold  X(f). 


The  two  sets  of  predictors,  X*(f)  and  X(f),  iden¬ 
tify  two  specifications  at  time  t  for  the  combined 
set  of  predictors  containing  both  time-indepen- 
dent  and  time-dependent  variables.  The  individ¬ 
ual  components  for  each  set  of  predictors  are 
shown  here. 
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EXAMPLE 


h(t,X(t))  =  /20(f)  exp[(lE  +  5 (E  x  f)] 


{1  if  exposed 
0  if  unexposed 


X*(f)  =  {E=  l,E  x  t  =  t) 
X(?)  =(£  =  0,£x/  =  0) 


HR{t)  = 


h(t,E  =  1) 


h(t,E  =  0) 

=  exp  [p(l  —  0)  +  5((1  xt)-(Oxt)) 


=  exp  [p  +  Sf| 

5o  =>HR(t)  f  as  ff 

PH  assumption  not  satisfied 


As  a  simple  example,  suppose  the  model  con¬ 
tains  only  one  time-independent  predictor, 
namely,  exposure  status  E,  a  (0,1)  variable, 
and  one  time-dependent  predictor,  namely,  E 
x  t.  Then,  to  compare  exposed  persons,  for 
whom  E  =  1,  with  unexposed  persons,  for 
whom  E  =  0,  at  time  t,  the  bold  X*(f)  set  of 
predictors  has  as  its  two  components  E  =  1 
and  E  x  t  =  f,  the  bold  X(f)  set  has  as  its  two 
components  E  =  0  and  E  x  t  =  0. 

If  we  now  calculate  the  estimated  hazard  ratio 
that  compares  exposed  to  unexposed  persons 
at  time  t,  we  obtain  the  formula  shown  here; 
that  is,  HR  "hat”  equals  the  exponential  of  () 
"hat”  plus  8  “hat”  times  t.  This  formula  says 
that  the  hazard  ratio  is  a  function  of  time;  in 
particular,  if  8  "hat”  is  positive,  then  the  hazard 
ratio  increases  with  increasing  time.  Thus,  the 
hazard  ratio  in  this  example  is  certainly  not 
constant,  so  that  the  PH  assumption  is  not 
satisfied  for  this  model. 


HR(t)  =  exp 


[*,*-*>] 


i=  1 


•  -V .  1  / 1 1 

;'= 1 


A  function  of  time 


More  generally,  because  the  general  hazard 
ratio  formula  involves  differences  in  the  values 
of  the  time-dependent  variables  at  time  t,  this 
hazard  ratio  is  a  function  of  time.  Thus,  in  gen¬ 
eral,  the  extended  Cox  model  does  not  satisfy 
the  PH  assumption  if  any  8 y  is  not  equal  to  zero. 


In  general,  PH  assumption  not 
satisfied  for  extended  Cox  model. 

8 i;-  is  not  time-dependent. 

8;-  represents  “overall”  effect  of 


Note  that,  in  the  hazard  ratio  formula,  the  coef¬ 
ficient  8 j  "hat”  of  the  difference  in  values  of  the 
;th  time-dependent  variable  is  itself  not  time- 
dependent.  Thus,  this  coefficient  represents 
the  “overall”  effect  of  the  corresponding  time- 
dependent  variable,  considering  all  times  at 
which  this  variable  has  been  measured  in  the 
study. 
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As  another  example  to  illustrate  the  formula 
for  the  hazard  ratio,  consider  an  extended  Cox 
model  containing  only  one  variable,  say  a 
weekly  measure  of  chemical  exposure  status 
at  time  t.  Suppose  this  variable,  denoted  as 
E(t),  can  take  one  of  two  values,  0  or  1 ,  depend¬ 
ing  on  whether  a  person  is  unexposed  or 
exposed,  respectively,  at  a  given  weekly 
measurement. 

As  defined,  the  variable  E{t)  can  take  on  differ¬ 
ent  patterns  of  values  for  different  subjects.  For 
example,  for  a  5-week  period,  subject  A’s  values 
may  be  0101 1,  whereas  subject  B’s  values  may 
be  11010. 

Note  that  in  this  example,  we  do  not  consider 
two  separate  groups  of  subjects,  with  one  group 
always  exposed  and  the  other  group  always 
unexposed  throughout  the  study.  This  latter  sit¬ 
uation  would  require  a  (0,1)  time-independent 
variable  for  exposure,  whereas  our  example 
involves  a  time-dependent  exposure  variable. 

The  extended  Cox  model  that  includes  only  the 
variable  E(t)  is  shown  here.  In  this  model,  the 
values  of  the  exposure  variable  may  change 
over  time  for  different  subjects,  but  there  is 
only  one  coefficient,  §,  corresponding  to  the 
one  variable  in  the  model.  Thus,  §  represents 
the  overall  effect  on  survival  time  of  the  time- 
dependent  variable  E(t). 

Notice,  also,  that  the  hazard  ratio  formula, 
which  compares  an  exposed  person  to  an  unex¬ 
posed  person  at  time  t,  yields  the  expression 
e  to  the  8  “hat.” 

Although  this  result  is  a  fixed  number,  the  PH 
assumption  is  not  satisfied.  The  fixed  number 
gives  the  hazard  ratio  at  a  given  time,  assuming 
that  the  exposure  status  at  that  time  is  1  in  the 
numerator  and  is  0  denominator.  Thus,  the 
hazard  ratio  is  time-dependent,  because  expo¬ 
sure  status  is  time-dependent,  even  though  the 
formula  yields  a  single  fixed  number. 


EXAMPLE 


E(t)  =  chemical  exposure  status  at 
time  t  (weekly) 

{0  if  unexposed  at  time  t 
1  if  exposed  at  time  t 


h(t,X(t))  =  h0(t)  exp[8£(f)] 


One  coefficient 
8  represents  the  overall  effect  of  £(f). 


m)=kt’m  =  1) 

h(t,E(t)  =  0) 

=  exp|^8[l  -  0]J 
=  e  ,  a  fixed  number 

But,  PH  is  not  satisfied: 

HR(t)  is  time-dependent  because  £(f) 
is  time-dependent. 
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VI.  Assessing  Time- 

Independent  Variables 
That  Do  Not  Satisfy 
the  PH  Assumption 


Use  an  extended  Cox  model  to 

•  check  PH  assumption; 

•  assess  effect  of  variable  not 
satisfying  PH  assumption. 


We  now  discuss  how  to  use  an  extended  Cox 
model  to  check  the  PH  assumption  for  time- 
independent  variables  and  to  assess  the  effect 
of  a  variable  that  does  not  satisfy  the  PH 
assumption. 


Three  methods  for  checking  PH 
assumption: 

1.  graphical _ 

2.  ( extended  Cox  model ) 

3.  GOF  test 


As  described  previously  (see  Chapter  4),  there 
are  three  methods  commonly  used  to  assess  the 
PH  assumption:  (1)  graphical,  using,  say, 
log-log  survival  curves;  (2)  using  an  extended 
Cox  model;  and  (3)  using  a  goodness-of-fit 
(GOF)  test.  We  have  previously  (in  Chapter  4) 
discussed  items  1  and  3,  but  only  briefly 
described  item  2,  which  we  focus  on  here. 


Cox  PH  model  for  p  time-indepen¬ 
dent  A’s: 


h(t,X) 


h0(t )  exp 


Ep^ 


If  the  dataset  for  our  study  contains  several, 
say  p,  time-independent  variables,  we  might 
wish  to  fit  a  Cox  PH  model  containing  each  of 
these  variables,  as  shown  here. 


Extended  Cox  model: 

Add  product  terms  of  the  form: 
Xi  x  gi(t) 


h(t,X(t))  =  h0(t)  exp 


EP'*'- 


+ E 

i=i 


However,  to  assess  whether  such  a  PH  model  is 
appropriate,  we  can  extend  this  model  by 
defining  several  product  terms  involving  each 
time-independent  variable  with  some  function 
of  time.  That  is,  if  the  zth  time-independent 
variable  is  denoted  as  A,-,  then  we  can  define 
the  zth  product  term  as  A,  x  g\(t)  where  g,(f)  is 
some  function  of  time  for  the  zth  variable. 

The  extended  Cox  model  that  simultaneously 
considers  all  time-independent  variables  of 
interest  is  shown  here. 
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In  using  this  extended  model,  the  crucial 
decision  is  the  form  that  the  functions  g,-(f) 
should  take.  The  simplest  form  for  g,(f)  is  that 
all  gi(t)  are  identically  0  at  any  time;  this  is 
another  way  of  stating  the  original  PH  model, 
containing  no  time-dependent  terms. 

Another  choice  for  theg,(t)  is  to  letg,(z)  =  t.  This 
implies  that  for  each  A,  in  the  model  as  a  main 
effect,  there  is  a  corresponding  time-dependent 
variable  in  the  model  of  the  form  A;  x  t.  The 
extended  Cox  model  in  this  case  takes  the  form 
shown  here. 

Suppose,  however,  we  wish  to  focus  on  a  par¬ 
ticular  time-independent  variable,  say,  variable 
XL.  Then  g,-(/)  =  /  for  i  =  L,  but  equals  0  for  all 
other  i.  The  corresponding  extended  Cox  model 
would  then  contain  only  one  product  term 
XL  x  t,  as  shown  here. 


Another  choice  for  the  g,-(f)  is  the  log  of  t,  rather 
than  simply  t,  so  that  the  corresponding  time- 
dependent  variables  will  be  of  the  form  X,  x  In  t. 


And  yet  another  choice  would  be  to  let  g,(t)  be  a 
"heaviside  function”  of  the  form  g,-(/)  =  1  when 
t  is  at  or  above  some  specified  time,  say  to,  and 
gi(t)  =  0  when  t  is  below  t0.  We  will  discuss  this 
choice  in  more  detail  shortly. 

Given  a  particular  choice  of  the  g,(t),  the 
corresponding  extended  Cox  model,  shown 
here  again  in  general  form,  may  then  be  used 
to  check  the  PH  assumption  for  the  time-inde¬ 
pendent  variables  in  the  model.  Also,  we  can 
use  this  extended  Cox  model  to  obtain  a  hazard 
ratio  formula  that  considers  the  effects  of  vari¬ 
ables  not  satisfying  the  PH  assumption. 

To  check  the  PH  assumption  using  a  statistical 
test,  we  consider  the  null  hypothesis  that  all  the 
8  terms,  which  are  coefficients  of  the  A ,g,(f) 
product  terms  in  the  model,  are  zero. 


Extended  Cox  model: 

p 

h(t,X(t))  =  h0(t)  exp  22  $ixi 

_i=  1 

+  22  ‘^'£'(0 
;=i 

•  Check  PH  assumption. 

•  Obtain  hazard  ratio  when  PH 
assumption  not  satisfied. 

Ho-  8i  =  S2  =  . . .  =  8P  =  0 


EXAMPLE 


gi(t)  =  0  for  all  i  implies  no  time- 
dependent  variable  involving  A,-,  i.e., 


h(t,  X(/))  =  MO  exp  22  PA 


EXAMPLE  2 


gi(t)  =  t  =>  X ig(t)  =  Xi  x  t 


P  P 

/i(f,X(f))  =  A0(f)exp  +  x  t) 

.1=1  Ml 


EXAMPLE  3:  one  variable  at  a  time 


A  only  = 


8L{t)  =  t, 

gi(t)  =  0  for  other  i 


MX(f))  =  M0exp  21  PiX+5L(XLxf) 
i=l 
.ML 


EXAMPLE  4 


gi(t)  =  In  t  =>  Xig(t )  =  Xt  xln  t 

h(t,X{t))  =/zq(0  exp  MPA  +  ^SgA-xlnr) 
.Ml  Ml 


EXAMPLE  5:  Heaviside  function 


,  ,  f  0  if  t  >  to 
g/(0  - 1 1  ifr><?0 
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Under  H0,  the  model  reduces  to  PH  Under  this  null  hypothesis,  the  model  reduces 
model:  to  the  PH  model. 


h(t,  X)  =  ho(t)  exp 


Ep  ix‘ 


LR  —  2  In  /•  l  ■  1 1  model 

—  (  —  -  ln  hext  Cox  model) 

~  X2p  under  H0 


This  test  can  be  carried  out  using  a  likelihood 
ratio  {LR)  test  which  computes  the  difference 
between  the  log  likelihood  statistic,  —2  ln  L,  for 
the  PH  model  and  the  log  likelihood  statistic  for 
the  extended  Cox  model.  The  test  statistic  thus 
obtained  has  approximately  a  chi-square  distri¬ 
bution  with  p  degrees  of  freedom  under  the  null 
hypothesis,  where  p  denotes  the  number  of 
parameters  being  set  equal  to  zero  under  H0. 


EXAMPLE 


h(t,X{t))  =  /!0(f)exp[p£  +  S (E  x  t)] 

H0-.  S  =  0  (i.e. ,  PH  assumption  is 
satisfied) 

Reduced  model: 
h(t,X)  =  /i0(f)exp[p£] 

LR  =  -2  ln  Lr-  (-2  ln  LP) 

~  X2  with  1  df  under  Hq 

F  =  full  (extended),  R  =  reduced  (PH) 


As  an  example  of  this  test,  suppose  we  again 
consider  an  extended  Cox  model  that  contains 
the  product  term  Ext  in  addition  to  the  main 
effect  of£,  where  E  denotes  a  (0,1)  time-inde¬ 
pendent  exposure  variable. 

For  this  model,  a  test  for  whether  or  not  the  PH 
assumption  is  satisfied  is  equivalent  to  testing 
the  null  hypothesis  that  8  =  0.  Under  this 
hypothesis,  the  reduced  model  is  given  by  the 
PH  model  containing  the  main  effect  E  only. 
The  likelihood  ratio  statistic,  shown  here  as  the 
difference  between  log-likelihood  statistics  for 
the  full  (i.e.,  extended  model)  and  the  reduced 
(i.e.,  PH)  model,  will  have  an  approximate  chi- 
square  distribution  with  one  degree  of  freedom 
in  large  samples. 


SAS:  PHREG  fits  both  PH  and 
extended  Cox  models. 
Stata:  Stcox  fits  both  PH  and 
extended  Cox  models. 


If  PH  test  significant:  Extended 
Cox  model  is  preferred;  HR  is 
time-dependent. 


Note  that  to  cany  out  the  computations  for  this 
test,  two  different  types  of  models,  a  PH  model 
and  an  extended  Cox  model,  need  to  be  fit. 
Nevertheless,  some  computer  packages  such 
as  SAS  and  Stata  use  the  same  computer  pro¬ 
gram  to  fit  both  models. 

If  the  result  of  the  test  for  the  PH  assumption  is 
significant,  then  the  extended  Cox  model  is  pre¬ 
ferred  to  the  PH  model.  Thus,  the  hazard  ratio 
expression  obtained  for  the  effect  of  an  expo¬ 
sure  variable  of  interest  is  time-dependent.  That 
is,  the  effect  of  the  exposure  on  the  outcome 
cannot  be  summarized  by  a  single  HR  value, 
but  can  only  be  expressed  as  a  function  of  time. 


Presentation:  VI.  Assessing  Time-Independent  Variables  257 


We  again  consider  the  previous  example,  with 
the  extended  Cox  model  shown  here.  For  this 
model,  the  estimated  hazard  ratio  for  the  effect 
of  exposure  is  given  by  the  expression  e  to  the 
quantity  P  “hat”  plus  5  “hat”  times  t.  Thus, 
depending  on  whether  §  "hat”  is  positive  or 
negative,  the  estimated  hazard  ratio  will 
increase  or  decrease  exponentially  as 
t  increases.  The  graph  shown  here  gives  a 
sketch  of  how  the  hazard  ratio  varies  with 
time  if  8  “hat”  is  positive. 


We  now  provide  a  description  of  the  use  of  a 
"heaviside”  function.  When  such  a  function  is 
used,  the  hazard  ratio  formula  yields  constant 
hazard  ratios  for  different  time  intervals,  as 
illustrated  in  the  accompanying  graph. 

Recall  that  a  heaviside  function  is  of  the  form  g 
( t ),  which  takes  on  the  value  1  if  t  is  greater 
than  or  equal  to  some  specified  value  of  t, 
called  t0,  and  takes  on  the  value  0  if  t  is  less 
than  t0.  An  extended  Cox  model  which  contains 
a  single  heaviside  function  is  shown  here. 

t  >  t0:  g(t )  =  1  =>  E  x  g(t)  =  E  Note  that  if  t  >  t0,g{t)  =  1,  so  the  value  of£  x  g(t) 

h(t,  X)  =  h0(t)exp[(p  +  8)Zi]  =  E\  the  corresponding  hazard  function  is  of  the 

_  r„  form  hoi/)  x  e  to  the  quantity  (p  +  8)  times  E,  and 

HR  —  exp  p  +  8  the  estimated  hazard  ratio  for  the  effect  of  E  has 

the  form  e  to  the  sum  of  p  "hat”  plus  8  “hat.” 

t  <  to-  g(t )  =  0  =>■  E  x  g(t)  =  0  If  t  <  to,  g(t)  =  0,  the  corresponding  hazard  ratio 

h(t,X)  =  /z0(t)exp[p,E]  is  simplified  to  e  to  the  p  "hat.” 

HR  =  exp  (3 

A  single  heaviside  function  in  the  Thus,  we  have  shown  that  the  use  of  a  single 
model  heaviside  function  results  in  an  extended  Cox 

h(t,X)  model  which  gives  two  hazard  ratio  values, 

=  h0(t)  exp[pC  +  8(E  x  g(f))]  each  value  being  constant  over  a  fixed  time 

interval. 

yields  two  hazard  ratios: 
t  >  t0  :  HR  =  exp  p  +  8 
t  <  to  :  HR  =  exp  p 


EXAMPLE 


h(t,X.(t))  =  h0(t)exp[fiE  +  5 (E  x  t) \ 
HR  =  exp  +  5rJ 
HR  I 
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Alternative  model  with  two  heavi- 
side  functions: 

Ht,X)  =  h0(t)exp[3i(E  x  g^t)) 

+  82(E  x  g2(t))] 

if  t  >  to 
if  t  <  to 
if  t  <  to 
if  t  >  to 


*.«)  =  {' 
eW  =  {‘ 


There  is  actually  an  equivalent  way  to  write  this 
model  that  uses  two  heaviside  functions  in  the 
same  model.  This  alternative  model  is  shown 
here.  The  two  heaviside  functions  are  called 
gjft)  and  g2(t).  Each  of  these  functions  are  in 
the  model  as  part  of  a  product  term  with  the 
exposure  variable  E.  Note  that  this  model  does 
not  contain  a  main  effect  term  for  exposure. 


Note:  Main  effect  for  £  not  in  model. 


Two  HR’s  from  the  alternative 
model: 

t  >  to  ■  gi{t)  =  1,  gi{t)  =  0 
/i(f,X)  =  ho(t)  exp[8i(£  x  1) 

+82(£  x  0)] 

=  /*oO)exp[8i£] 

so  that//!?  =  expfSi') 


For  this  alternative  model,  as  for  the  earlier 
model  with  only  one  heaviside  function,  two 
different  hazard  ratios  are  obtained  for  differ¬ 
ent  time  intervals.  To  obtain  the  first  hazard 
ratio,  we  consider  the  form  that  the  model 
takes  when  t  >  t0.  In  this  case,  the  value  of 
gi(t)  is  1  and  the  value  of  g2(t)  is  0,  so  the  expo¬ 
nential  part  of  the  model  simplifies  to  §!  x  E; 
the  corresponding  formula  for  the  estimated 
hazard  ratio  is  then  e  to  the  8,  "hat.” 


t  >  to  ■  gi(t)  =  0,  g2(t)  =  1 
h(t,  X)  =  /*o(f)exp[8i  (E  x  0) 

+  82  (£x  1)] 
=  hQ(t)exp[82E] 

so  that  HR  =  exp  ^82^ 


When  t  <  t0,  the  value  of  gi(t)  is  0  and  the  value 
of  g2{t)  is  1 .  Then,  the  exponential  part  of  the 
model  becomes  82  x  E,  and  the  corresponding 
hazard  ratio  formula  is  e  to  the  82  “hat.” 


Alternative  model: 

h(t,X(t ))  =  h0(t)exp[8i(E  x  gr(t)) 

+  82(E  x  g20))] 

Original  model: 
h(t,X(t)) 

=  h0(t)exp[fiE  +  8 (E  x  g(t))] 
t  >  t0  :  HR  =  exp  ^81  ^  =  exp  +  8^ 
t  <  to  :  HR  =  exp  ^82^)  =  exp  ^ 


Thus,  using  the  alternative  model,  again  shown 
here,  we  obtain  two  distinct  hazard  ratio 
values.  Mathematically,  these  are  the  same 
values  as  obtained  from  the  original  model 
containing  only  one  heaviside  function.  In 
other  words,  81  “hat"  in  the  alternative  model 
equals  (3  "hat”  plus  8  “hat”  in  the  original  model 
(containing  one  heaviside  function),  and 
82  "hat”  in  the  alternative  model  equals  p 
“hat”  in  the  original  model. 
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Heaviside  functions: 

•  two  HR’s  constant  within  two 
time  intervals 

•  extension :  several  lIR’s  constant 
within  several  time  intervals 


We  have  thus  seen  that  heaviside  functions  can 
be  used  to  provide  estimated  hazard  ratios  that 
remain  constant  within  each  of  two  separate 
time  intervals  of  follow-up.  We  can  also  extend 
the  use  of  heaviside  functions  to  provide  sev¬ 
eral  distinct  hazard  ratios  that  remain  constant 
within  several  time  intervals. 


Four  time  intervals: 
HR  I 


0  .5  1.0  1.5  1  (years) 

Extended  Cox  model  contains  either 

•  E,  E  x  gjft),  E  x  g2(f),  E  x  g3(t) 
or 

•  Ex  E  x  g2(t),  E  x  g3(t), 

E  x  g4(t) 

I  H  2  I  3  I  4 
0  0.5  1.0  1.5  /(years) 


Suppose,  for  instance,  that  we  wish  to  separate 
the  data  into  four  separate  time  intervals,  and 
for  each  interval,  we  wish  to  obtain  a  different 
hazard  ratio  estimate  as  illustrated  in  the 
graph  shown  here. 


We  can  obtain  four  different  hazard  ratios  using 
an  extended  Cox  model  containing  a  main  effect 
of  exposure  and  three  heaviside  functions  in  the 
model  as  products  with  exposure.  Or,  we  can 
use  a  model  containing  no  main  effect  exposure 
term,  but  with  product  terms  involving  expo¬ 
sure  with  four  heaviside  functions. 


To  illustrate  the  latter  model,  suppose,  as 
shown  on  the  graph,  that  the  first  time  interval 
goes  from  time  0  to  0.5  of  a  year;  the  second 
time  interval  goes  from  0.5  to  1  year;  the  third 
time  interval  goes  from  1  year  to  a  year  and  a 
half;  and  the  fourth  time  interval  goes  from  a 
year  and  a  half  onward. 


Ht,X(t)) 

=  hofOexpfSiEgjft)  +  82£g2(f) 
+  83£g3(f)  +  84 Eg4(t)] 


where 


8i(t) 

g2(t) 

83(t) 

g4(f 


if  0  <  t  <  0.5  year 
if  otherwise 

if  0.5  year  <  t  <  1.0  year 
if  otherwise 

if  1.0  year  <  t  <  1.5  years 
if  otherwise 
if  t  >  1.5  years 
if  otherwise 


Then,  an  appropriate  extended  Cox  model 
containing  the  four  heaviside  functions  gift), 
g2(f),  g3(f),  and  g4(f)  is  shown  here.  This  model 
assumes  that  there  are  four  different  hazard 
ratios  identified  by  three  cutpoints  at  half  a 
year,  one  year,  and  one  and  a  half  years.  The 
formulae  for  the  four  hazard  ratios  are  given  by 
separately  exponentiating  each  of  the  four  esti¬ 
mated  coefficients,  as  shown  below: 


4  HR's  < 


0  <  /  <  0.5  :  HR  =  ex p(<5j) 
0.5  <  t  <  1.0  :  HR  =  exp  ^<52^) 
1.0  <  t  <  1.5  :  HR  =  exp  ^<53^ 
t  >  1.5  :  HR  =  exp^<54^) 
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VII.  An  Application  of  the 
Extended  Cox  Model 
to  An  Epidemiologic 
Study  on  the 
Treatment  of  Heroin 
Addiction 


EXAMPLE 


1991  Australian  study  (Caplehom 
et  al.)  of  heroin  addicts 

•  two  methadone  treatment  clinics 

•  T  =  days  remaining  in  treatment 
(=  days  until  drop  out  of  clinic) 

•  clinics  differ  in  treatment  policies 

Dataset  name:  ADDICTS 
Column  1:  Subject  ID 
Column  2:  Clinic  (1  or  2) 

Column  3:  Survival  status  (0  = 
censored,  1  =  departed  clinic) 
Column  4:  Survival  time  in  days 

Column  5:  Prison  Record 

(0  =  none,  1  =  any)  J> covanates 

Column  6:  Maximum  Methadone 
Dose  (mg/day) 


h(l,X)  =  h0(t)  expfPdclinic) 

+  (^(prison)  +  p3(dose)] 

Std.  Haz.  P 

Coef.  En\  p  >  |z|  Ratio  (PH) 

Clinic  -1.009  0.215  0.000  0.365  0.001 

Prison  0.327  0.167  0.051  1.386  0.332 

Dose  -0.035  0.006  0.000  0.965  0.347 


P{PH)  for  the  variables  prison  and 
dose  are  nonsignificant  =>  remain  in 
model. 


A  1991  Australian  study  by  Caplehorn  et  al., 
compared  retention  in  two  methadone  treat¬ 
ment  clinics  for  heroin  addicts.  A  patient’s  sur¬ 
vival  time  (T)  was  determined  as  the  time  in 
days  until  the  patient  dropped  out  of  the  clinic 
or  was  censored  at  the  end  of  the  study  clinic. 
The  two  clinics  differed  according  to  their  over¬ 
all  treatment  policies. 


A  listing  of  some  of  the  variables  in  the  dataset 
for  this  study  is  shown  here.  The  dataset  name 
is  called  "ADDICTS,”  and  survival  analysis  pro¬ 
grams  in  the  Stata  package  are  used  in  the 
analysis.  Note  that  the  survival  time  variable 
is  listed  in  column  4  and  the  survival  status 
variable,  which  indicates  whether  a  patient 
departed  from  the  clinic  or  was  censored,  is 
listed  in  column  3.  The  primary  exposure  vari¬ 
able  of  interest  is  the  clinic  variable,  which  is 
coded  as  1  or  2.  Two  other  variables  of  interest 
are  prison  record  status,  listed  in  column  5  and 
coded  as  0  if  none  and  1  if  any,  and  maximum 
methadone  dose,  in  milligrams  per  day,  which 
is  listed  in  column  6.  These  latter  two  variables 
are  considered  as  covariates. 


One  of  the  first  models  considered  in  the 
analysis  of  the  addicts  dataset  was  a  Cox  PH 
model  containing  the  three  variables,  clinic, 
prison  record,  and  dose.  An  edited  printout  of 
the  results  for  this  model  is  shown  here.  What 
stands  out  from  this  printout  is  that  the  P(PH ) 
value  for  the  clinic  variable  is  zero  to  three 
significant  places,  which  indicates  that  the 
clinic  variable  does  not  satisfy  the  proportional 
hazard  assumption. 

Since  the  P(PH )  values  for  the  other  two  vari¬ 
ables  in  the  model  are  highly  nonsignificant, 
this  suggests  that  these  two  variables,  namely, 
prison  and  dose,  can  remain  in  the  model. 
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EXAMPLE:  (continued) 


Adjusted  Survival  Curves 


Results: 

•  Curve  for  clinic  2  consistently  lies 
above  curve  for  clinic  1 . 

•  Curves  diverge,  with  clinic 

2  being  vastly  superior  after  one 
year. 

Stratifying  by  clinic:  cannot  obtain 
hazard  ratio  for  clinic 

Hazard  ratio  for  clinic  requires  clinic 
in  the  model. 


Extended  Cox  model: 
h(t,X(t))  =  /20(f)exp[Pi(clinic) 

+  p2(prison)  +  p3(dose) 
+  S(clinic)  g(f)] 

where 


2«  =  {J 


if  t  >  365  days 
if  t  <  365  days 


clinic  = 


1  if  clinic  1 
0  if  clinic  2 


None 

Previously 
clinic  =  2  for 
clinic  2 


t  >  365  days:  HR  =  exp^p,  +  sj 
t  <  365  days:  HR  =  exp^p,  j 


Further  evidence  of  the  PH  assumption  not 
being  satisfied  for  the  clinic  variable  can  be 
seen  from  a  graph  of  adjusted  survival  curves 
stratified  by  clinic,  where  the  prison  and  dose 
variables  have  been  kept  in  the  model.  Notice 
that  the  two  curves  are  much  closer  together  at 
earlier  times,  roughly  less  than  1  year  (i.e.,  365 
days),  but  the  two  curves  diverge  greatly  after 
1  year.  This  indicates  that  the  hazard  ratio  for 
the  clinic  variable  will  be  much  closer  to  one  at 
early  times  but  quite  different  from  one  later  on. 


The  above  graph,  nevertheless,  provides  impor¬ 
tant  results  regarding  the  comparison  of  the 
two  clinics.  The  curve  for  clinic  2  consistently 
lies  above  the  curve  for  clinic  1 ,  indicating  that 
clinic  2  does  better  than  clinic  1  in  retaining  its 
patients  in  methadone  treatment.  Further, 
because  the  two  curves  diverge  after  about  a 
year,  it  appears  that  clinic  2  is  vastly  superior  to 
clinic  1  after  one  year  but  only  slightly  better 
than  clinic  1  prior  to  one  year. 


Unfortunately,  because  the  clinic  variable  has 
been  stratified  in  the  analysis,  we  cannot  use 
this  analysis  to  obtain  a  hazard  ratio  expres¬ 
sion  for  the  effect  of  clinic,  adjusted  for  the 
effects  of  prison  and  dose.  We  can  only  obtain 
such  an  expression  for  the  hazard  ratio  if  the 
clinic  variable  is  in  the  model. 

Nevertheless,  we  can  obtain  a  hazard  ratio  using 
an  alternative  analysis  with  an  extended  Cox 
model  that  contains  a  heaviside  function,  g(t), 
together  with  the  clinic  variable,  as  shown  here. 
Based  on  the  graphical  results  shown  earlier,  a 
logical  choice  for  the  cutpoint  of  the  heaviside 
function  is  one  year  (i.e.,  365  days).  The  corres¬ 
ponding  model  then  provides  two  hazard  ratios: 
one  that  is  constant  above  365  days  and  the 
other  that  is  constant  below  365  days. 

Note  that  in  the  extended  Cox  model  here,  we 
have  coded  the  clinic  variable  as  1  if  clinic  1 
and  0  if  clinic  2,  whereas  previously  we  had 
coded  clinic  2  as  2.  The  reason  for  this  change 
in  coding,  as  illustrated  by  computer  output 
below,  is  to  obtain  hazard  ratio  estimates  that 
are  greater  than  unity. 
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EXAMPLE:  (continued) 


h(t,X.(t))  =  /20(f)  exp[p2(prison) 

+  p3(dose)  +  Pi(clinic)gi(f) 
+  52(clinic)g2(f)] 


where 
SiW  = 

and 

82{t)  = 


1  if  t  >  365  days 
0  if  t  >  365  days 

1  if  t  >  365  days 
0  if  t  >  365  days 


t  <  365  days:  HR  =  exp(Si) 
t  >  365  days:  HR  =  exp(S2) 


Std.  Haz.  [95%  Conf. 

Coef.  Err.  p  >  |z|  Ratio  Interval] 


Prison 
Dose 

Clinic  x  gi  0.460\0.255  (0! 
Clinic  xg2  1. 828  v  0^86  [o. 


0.378  0.168  0.025 

0.036  0.006  0.000 

0.460N  0.255  (0.072 

1.828.  0^86  lo.OQQ 


t  <  365  days:  HR  =  >*id60 
t  >  365  days:  [HR  =  e^828 


1.459  1.049 

0.965  0.953 

1.583]  0.960 
6.2231  2.921 


-  1.583 
=  6.223 


2.029 

0.977 

2.611 

13.259 


95%  confidence  intervals  for  clinic 
effect: 

t  <  365  days:  (0.960,  2.61 1) 


HR 


An  equivalent  way  to  write  the  model  is  to  use 
two  heaviside  functions,  gi(t)  and  g2(t),  as 
shown  here.  This  latter  model  contains  product 
terms  involving  clinic  with  each  heaviside 
function,  and  there  is  no  main  effect  of  clinic. 

Corresponding  to  the  above  model,  the  effect  of 
clinic  is  described  by  two  hazard  ratios,  one  for 
time  less  than  365  days  and  the  other  for 
greater  than  365  days.  These  hazard  ratios  are 
obtained  by  separately  exponentiating  the 
coefficients  of  each  product  term,  yielding  e  to 
the  o  i  "hat”  and  e  to  the  §2  “hat,”  respectively. 

A  printout  of  results  using  the  above  model 
with  two  heaviside  functions  is  provided  here. 
The  results  show  a  borderline  nonsignificant 
hazard  ratio  (P  =  0.072)  of  1.6  for  the  effect  of 
clinic  when  time  is  less  than  365  days  in  con¬ 
trast  to  a  highly  significant  (P  =  0.000  to  three 
decimal  places)  hazard  ratio  of  6.2  when  time 
exceeds  365  days. 

Note  that  the  estimated  hazard  ratio  of  1.583 
from  the  printout  is  computed  by  exponentiat¬ 
ing  the  estimated  coefficient  0.460  of  the  prod¬ 
uct  term  “clinic  x  gx”  and  that  the  estimated 
hazard  ratio  of  6.223  is  computed  by  exponen¬ 
tiating  the  estimated  coefficient  1.828  of  the 
product  term  “clinic  x  g2”. 

Note  also  that  the  95%  confidence  interval 
for  the  clinic  effect  prior  to  365  days  (that  is, 
for  the  product  term  "clinic  x  gx(t)")  is  given 
by  the  limits  0.960  and  2.611,  whereas  the 
corresponding  confidence  interval  after  365 
days  (that  is,  for  the  product  term  "clinic  x 
g2(t)”)  is  given  by  the  limits  2.921  and  13.259. 
The  latter  interval  is  quite  wide,  showing  a  lack 
of  precision  when  t  exceeds  365  days;  however, 
when  t  precedes  365  days,  the  interval  includes 
the  null  hazard  ratio  of  1,  suggesting  a  chance 
effect  for  this  time  period. 

The  results  we  have  just  shown  support  the 
observations  obtained  from  the  graph  of 
adjusted  survival  curves.  That  is,  these  results 
suggest  a  large  difference  in  clinic  survival 
times  after  1  year  in  contrast  to  a  small  difference 
in  clinic  survival  times  prior  to  1  year,  with  clinic 
2  always  doing  better  than  clinic  1  at  any  time. 
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EXAMPLE:  (continued) 


One  other  analysis: 

Use  an  extended  Cox  model  that 
provides  for  diverging  survival  curves 


h(t,X(t))  =  /20(f)exp[Pi(clinic) 

+  p2(prison)  +  p3(dose) 
+  S(clinic  x  t)] 


HR  =  exp^jh  +  Stj 


HR  changes  over  time. 
t  =  91  days 

h(t,X(t))  =  /20(t)exp[p  ^clinic) 

+  p2(prison)  +  p3  (dose) 
+  S(clinic)(91)] 


HR  =  exp^P!  +  918^ 
t  =  274: 

h(t,X(t))  =  /z0(f)exp[Pi(clinic) 

+  p2(prison)  +  p3(dose) 
+  S(clinic)(274)] 


HR  =  exp^Pj  +  2748) 
t  =  458.5: 


HR 


=  exp^Pj  +  458.58^ 

t  =  639: 

HR  =  exp  (Pj  +  6398) 
t  =  821.5: 

HR  =  exp(p,  +  821. 58 j 


There  is,  nevertheless,  at  least  one  other 
approach  to  the  analysis  using  time-dependent 
variables  that  we  now  describe.  This  approach 
considers  our  earlier  graphical  observation 
that  the  survival  curves  for  each  clinic  continue 
to  diverge  from  one  another  even  after  1  year. 
In  other  words,  it  is  reasonable  to  consider  an 
extended  Cox  model  that  allows  for  such  a 
divergence,  rather  than  a  model  that  assumes 
the  hazard  ratios  are  constant  before  and  after 
1  year. 

One  way  to  define  an  extended  Cox  model  that 
provides  for  diverging  survival  curves  is  shown 
here.  This  model  includes,  in  addition  to  the 
clinic  variable  by  itself,  a  time-dependent  vari¬ 
able  defined  as  the  product  of  the  clinic  vari¬ 
able  with  time  (i.e.,  clinic  x  /).  By  including  this 
product  term,  we  are  able  to  estimate  the  effect 
of  clinic  on  survival  time,  and  thus  the  hazard 
ratio,  for  any  specified  time  t. 

To  demonstrate  how  the  hazard  ratio  changes 
over  time  for  this  model,  we  consider  what  the 
model  and  corresponding  estimated  hazard 
ratio  expression  are  for  different  specified 
values  of  t. 

For  example,  if  we  are  interested  in  the  effect  of 
clinic  on  survival  on  day  91,  so  that  t  =  91,  the 
exponential  part  of  the  model  simplifies  to 
terms  for  the  prison  and  dose  variables  plus 
Pi  times  the  clinic  variable  plus  5  times  the 
clinic  variable  times  9 1 :  the  corresponding  esti¬ 
mated  hazard  ratio  for  the  clinic  effect  is  then 
e  to  the  power  P!  “hat”  plus  8  "hat”  times  t  =  91. 

At  274  days,  the  exponential  part  of  the  model 
contains  the  prison,  dose,  and  clinic  main 
effect  terms  as  before,  plus  8  times  the  clinic 
variable  times  274:  the  corresponding  hazard 
ratio  for  the  clinic  effect  is  then  e  to  fn  “hat” 
plus  274  8  “hat”. 

The  formulae  for  the  estimated  hazard  ratio  for 
other  specified  days  are  shown  here.  Notice 
that  the  estimated  hazard  ratio  appears  to  be 
increase  over  the  length  of  the  follow-up  period. 
Thus,  if  8  "hat”  is  a  positive  number,  then  the 
estimated  hazard  ratios  will  increase  over  time. 
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EXAMPLE:  (continued) 


Computer  results  for  extended  Cox 
model  involving  T(t ): 


Std.  Haz.  [95%  Conf. 

Coef.  Err.  P>|z|  Ratio  Interval] 

0.390  0.169  0.021  1.476  1.060  2.056 


dose  -0.035  0.006  0.000 

clinic  ^-0,0183)  0.347  0.958 

clinic  x  t  (  0,003)  0.001  0.001 

0.965 

0.982 

( 1.003  ) 

0.953  0.978 

0.497  1.939 

1.001  1.005 

cov^Pl55^  =  —.000259  Log  likelihood  = 

-667.642 

Pj  =  -0.0183  8  =  0.003 

HR  depends  on  P[  and  8  . 

t  =  91.5:  HR  =  exp^fi 

i  +  § 

=  1.292 

t  =  274:  HR  =  exp(fi 

i  + 

=  2.233 

t  =  458.5:  HR  =  exp^P 

i  + 

=  3.862 

f  =  639:  HR  =  exp^P 

l  + 

=  6.677 

t  =  821.5:  HR  =  exp((3 

i  + 

=  11.544 

exp 


pj  +  5t  ±  1.96W var  ^  +  btj 


Var  f (3j  +  6t^  =  Spt  +  ds?  +  2 1  cov^Pj ,  8 

t‘  A 


(0.347)2  (0.001)2  (-.000259) 


Time  (days) 

HR 

95%  Cl 

91.5 

1.292 

(0.741,  2.250) 

274 

2.233 

(1.470,  3.391) 

458.5 

3.862 

(2.298,  6.491) 

639 

6.677 

(3.102,  14.372) 

821.5 

11.544 

(3.976,  33.513) 

We  now  show  edited  results  obtained  from 
fitting  the  extended  Cox  model  we  have  just 
been  describing,  which  contains  the  product 
of  clinic  with  time.  The  covariance  estimate 
shown  at  the  bottom  of  the  table  will  be  used 
below  to  compute  confidence  intervals. 


From  these  results,  the  estimated  coefficient  of 
the  clinic  variable  is  Pi  “hat”  equals  -0.0183, 
and  the  estimated  coefficient  8  “hat”  obtained 
for  the  product  term  equals  0.003.  For  the 
model  being  fit,  the  hazard  ratio  depends  on 
the  values  of  both  Pi  "hat”  and  8  “hat.” 

On  the  left,  the  effect  of  the  variable  clinic  is 
described  by  five  increasing  hazard  ratio  esti¬ 
mates  corresponding  to  each  of  five  different 
values  of  t.  These  values,  which  range  between 
1.292  at  91.5  days  to  1 1.544  at  821.5  days,  indi¬ 
cate  how  the  effect  of  clinic  diverges  over  time 
for  the  fitted  model. 


We  can  also  obtain  95%  confidence  intervals 
for  each  of  these  hazard  ratios  using  the  large 
sample  formula  shown  here.  The  variance 
expression  in  the  formula  is  computed  using 
the  variances  and  covariances  which  can  be 
obtained  from  the  computer  results  given 
above.  In  particular,  the  variances  are  (0.347)2 
and  (0.001)2  for  Pi  “hat"  and  8  “hat,"  respec¬ 
tively;  the  covariance  value  is  —0.000259. 


A  table  showing  the  estimated  hazard  ratios 
and  their  corresponding  95%  confidence  inter¬ 
vals  for  the  clinic  effect  is  given  here.  Note  that 
all  confidence  intervals  are  quite  wide. 
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VIII.  An  Application  of  the 
Extended  Cox  Model 
to  the  Analysis  of  the 
Stanford  Heart 
Transplant  Data 

We  now  consider  another  application  of  the 
extended  Cox  model  which  involves  the  use  of 
an  internally  defined  time-dependent  variable. 
In  a  1977  report  (Crowley  and  Hu,  J.  Amer.  Stat¬ 
ist.  Assoc.)  on  the  Stanford  Heart  Transplant 
Study,  patients  identified  as  being  eligible  for  a 
heart  transplant  were  followed  until  death  or 
censorship.  Sixty-five  of  these  patients  received 
transplants  at  some  point  during  follow-up, 
whereas  thirty-eight  patients  did  not  receive  a 
transplant.  There  were,  thus,  a  total  of  n  =  103 
patients.  The  goal  of  the  study  was  to  assess 
whether  patients  receiving  transplants  survived 
longer  than  patients  not  receiving  transplants. 

One  approach  to  the  analysis  of  these  data  was 
to  separate  the  dataset  into  two  separate  groups, 
namely,  the  65  heart  transplant  patients  and  the 
38  patients  not  receiving  transplants,  and  then 
to  compare  survival  times  for  these  groups. 

A  problem  with  this  approach,  however,  is  that 
those  patients  who  received  transplants  had  to 
wait  from  the  time  they  were  identified  as  eligi¬ 
ble  for  a  transplant  until  a  suitable  transplant 
donor  was  found.  During  this  “wait-time” 
period,  they  were  at  risk  for  dying,  yet  they  did 
not  have  the  transplant.  Thus,  the  wait-time 
accrued  by  transplant  patients  contributes 
information  about  the  survival  of  nontrans¬ 
plant  patients.  Yet,  this  wait-time  information 
would  be  ignored  if  the  total  survival  time  for 
each  patient  were  used  in  the  analysis. 

Another  problem  with  this  approach  is  that  two 
covariates  of  interest,  namely,  tissue  mismatch 
score  and  age  at  transplant,  were  considered  as 
prognostic  indicators  of  survival  only  for 
patients  who  received  transplants.  Note  that 
age  at  eligibility  was  not  considered  an  important 
prognostic  factor  for  the  nontransplant  group. 


EXAMPLE 


Patients  identified  as  eligible  for  heart 
transplant: 

T  =  time  until  death  or  censorship 
65  patients  receive  transplants 
38  patients  do  not  receive  transplants 
n  =  103  patients 

Goal:  Do  patients  receiving 
transplants  survive  longer  than 
patients  not  receiving  transplants? 


One  approach: 

Compare  two  separate  groups:  65 
transplants  vs.  38  nontransplants 


Problem: 


Wait-time 


Eligibility 


Censored 
or  death 
- ¥ - 


Received 
transplant 
Total  survival  time  — > 


Time— > 


Note:  Wait-time  contributes  to 
survival  time  for  nontransplants. 


Covariates: 

Tissue  mismatch  score 
Age  at  transplant 


prognostic  only 
for  transplants 


Age  at  eligibility:  not  considered 
prognostic  for  nontransplants 
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EXAMPLE:  (continued) 


Problems: 

•  wait-time  of  transplant  recipients 

•  prognostic  factors  for  transplants 
only 

Alternative  approach: 

Uses  an  extended  Cox  model 


Exposure  variable: 

Heart  transplant  status  at  time  t, 
defined  as 


HT(t) 


0  if  did  not  receive  transplant 
by  time  I ,  i.e.,  if  t  <  wait-time 
1  if  received  transplant  prior 
to  time  f,  i.e.,  if  t  >  wait-time 


No  transplant  fIT(t) 


Time  of  transplant 


Wait-time  for  transplants  contributes 
to  survival  for  nontransplants. 


In  addition  to  HT(t),  two  time- 
dependent  covariates  included  in 
model. 


Because  of  the  problems  just  described,  which 
concern  the  wait-time  of  transplants  and  the 
effects  of  prognostic  factors  attributable  to 
transplants  only,  an  alternative  approach  to 
the  analysis  is  recommended.  This  alternative 
involves  the  use  of  time-dependent  variables  in 
an  extended  Cox  model. 


The  exposure  variable  of  interest  in  this 
extended  Cox  model  is  heart  transplant  status 
at  time  t,  denoted  by  HT(t).  This  variable  is 
defined  to  take  on  the  value  0  at  time  t  if  the 
patient  has  not  received  a  transplant  at  this 
time,  that  is,  if  t  is  less  than  the  wait-time  for 
receiving  a  transplant.  The  value  of  this  vari¬ 
able  is  1  at  time  t  if  the  patient  has  received  a 
transplant  prior  to  or  at  time  t,  that  is,  if  t  is 
equal  to  or  greater  than  the  wait-time. 


Thus,  for  a  patient  who  did  not  receive  a  trans¬ 
plant  during  the  study,  the  value  of  HT(t)  is  0  at 
all  times.  For  a  patient  receiving  a  transplant, 
the  value  of  HT{t)  is  0  at  the  start  of  eligibility 
and  continues  to  be  0  until  the  time  at  which 
the  patient  receives  the  transplant;  then,  the 
value  of  HT(t)  changes  to  1  and  remains  1 
throughout  the  remainder  of  follow-up. 


Note  that  the  variable  HT(t)  has  the  property 
that  the  wait-time  for  transplant  patients  con¬ 
tributes  to  the  survival  experience  of  nontrans¬ 
plant  patients.  In  other  words,  this  variable 
treats  a  transplant  patient  as  a  nontransplant 
patient  prior  to  receiving  the  transplant. 

In  addition  to  the  exposure  variable  HT(t),  two 
other  time-dependent  variables  are  included  in 
our  extended  Cox  model  for  the  transplant 
data.  These  variables  are  covariates  to  be 
adjusted  for  in  the  assessment  of  the  effect  of 
the  HT(t)  variable. 


Presentation:  Vlll.  An  Application  of  the  Extended  Cox  Model  267 


EXAMPLE:  (continued) 


Covariates: 


TMS(t) 

AGE(t) 


0  if  t  <  wait-time 
TMS  if  t  >  wait-time 
0  if  t  <  wait-time 
AGE  if  t  >  wait-time 


h(t,X(t))  =  /z0(f)exp[Si_f/7’(f) 

+  82TMS(t)  +  5  iAGE(t)] 


Focus: 

Asscesing  the  effect  of  HT(t)  adjusted 
for  TMS(t)  and  AGE(t) 

Note:  HT(t)  does  not  satisfy  PH 
assumption. 


Std.  Haz. 

Variable  Coef.  Err.  P>|z|  Ratio 

HT(t)  -3.1718  1.1861  0.008  0.0417 

TMS(t)  0.4442  0.2802  0.112  1.5593 

AGE(t)  0.0552  0.0226  0.014  1.0567 


HR  =  e-3'1718  =  0.0417 

/;( transplants) 

HR  = - 

h  ( nontransplants) 

Not  appropriate! 


1 

23.98 

J_? 

24  ' 


These  covariates  are  denoted  as  TMS(t)  and 
AGE(t )  and  they  are  defined  as  follows:  TMS(t) 
equals  0  if  t  is  less  than  the  wait-time  for  a 
transplant  but  changes  to  the  “tissue  mismatch 
score”  (TMS)  at  the  time  of  the  transplant  if  t  is 
equal  to  or  greater  than  the  wait-time.  Simi¬ 
larly,  AGE(t)  equals  0  if  t  is  less  than  the  wait¬ 
time  but  changes  to  AGE  at  time  of  transplant 
if  t  is  equal  to  or  greater  than  the  wait-time. 


The  extended  Cox  model  for  the  transplant 
data  is  shown  here.  The  model  contains  the 
three  time-dependent  variables  HT(t),  TMS(t) 
and  AGE(t)  as  described  above. 

For  this  model,  since  HT(t)  is  the  exposure 
variable  of  interest,  the  focus  of  the  analysis 
concerns  assessing  the  effect  of  this  variable 
adjusted  for  the  two  covariates.  Note,  how¬ 
ever,  that  because  the  HT(t)  variable  is  time- 
dependent  by  definition,  this  variable  does  not 
satisfy  the  PH  assumption,  so  that  any  hazard 
ratio  estimate  obtained  for  this  variable  is 
technically  time-dependent. 


A  summary  of  computer  results  for  the  fit  of 
the  above  extended  Cox  model  is  shown  here. 
These  results  indicate  that  the  exposure  vari¬ 
able  HT(t)  is  significant  below  the  one  percent 
significance  level  (i.e.,  the  two-sided  p-value  is 
0.008).  Thus,  transplant  status  appears  to  be 
significantly  associated  with  survival. 


To  evaluate  the  strength  of  the  association,  note 
that  e  to  the  coefficient  of  HT(t  ')  equals  0.0417. 
Since  1  over  0.0417  is  23.98,  it  appears  that 
there  is  a  24-fold  increase  in  the  hazard  of  non¬ 
transplant  patients  to  transplant  patients.  The 
preceding  interpretation  of  the  value  0.0417  as 
a  hazard  ratio  estimate  is  not  appropriate,  how¬ 
ever,  as  we  shall  now  discuss  further. 
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EXAMPLE  (continued) 


23.98  is  inappropriate  as  a  HR: 

•  does  not  compare  two  separate 
groups 

•  exposure  variable  is  not  time- 
independent 

•  wait-time  on  transplants 
contributes  to  survival  on 
nontransplants 


Alternative  interpretation: 

At  time  t, 

h(“ not  yet  received  transplant”) 
ss  24  h‘  ("already  received  transplant”) 


More  appropriate: 

Hazard  ratio  formula  should  account 
for  TMS  and  AGE. 

Transplant?  HT(t)  TMS(t)  AGE(t ) 

Yes  1  TMS  AGE 

No  0  0  0 


i  denotes  ith  transplant  patient 


X*(f)  =  ( HT(t )  =  1 ,  TMS(t)  =  TMSi,  AGE (t)  =  AGEJ 
X(t)  =  (HT(t)  =  0,  TMS(t )  =  0,  AGE(t)  =  0) 


HR(t)  =  exp[5i(l  -  0)  +  5 2(TM5,  -  0) 
+6 3(AG£,-  -  0)] 
=  exp  |^5i  +  S2  TMSi  +  83  AG£/j 
=  exp[— 3.1718  +  0.4442  TMS, 
+0.0552  AGEi) 


First,  note  that  the  value  of  23.98  inappropri¬ 
ately  suggests  that  the  hazard  ratio  is  compar¬ 
ing  two  separate  groups  of  patients.  However, 
the  exposure  variable  in  this  analysis  is  not  a 
time-independent  variable  that  distinguishes 
between  two  separate  groups.  In  contrast,  the 
exposure  variable  is  time-dependent,  and  uses 
the  wait-time  information  on  transplants  as 
contributing  to  the  survival  experience  of  non¬ 
transplants. 

Since  the  exposure  variable  is  time-dependent, 
an  alternative  interpretation  of  the  hazard  ratio 
estimate  is  that,  at  any  given  time  t,  the  hazard 
for  a  person  who  has  not  yet  received  a  trans¬ 
plant  (but  may  receive  one  later)  is  approxi¬ 
mately  24  times  the  hazard  for  a  person  who 
already  has  received  a  transplant  by  that  time. 

Actually,  we  suggest  that  a  more  appropriate 
hazard  ratio  expression  is  required  to  account 
for  a  transplant’s  TMS  and  AGE  score.  Such  an 
expression  would  compare,  at  time  t,  the  values 
of  each  of  the  three  time-dependent  variables 
in  the  model.  For  a  person  who  received  a 
transplant,  these  values  are  1  for  HT{t)  and 
TMS  and  AGE  for  the  two  covariates.  For  a 
person  who  has  not  received  a  transplant,  the 
values  of  all  three  variables  are  0. 


Using  this  approach  to  compute  the  hazard 
ratio,  the  X*(f)  vector,  which  specifies  the  pre¬ 
dictors  for  a  patient  i  who  received  a  transplant 
at  time  t,  has  the  values  1,  TMSi  and  AGEL  for 
patient  i;  the  X(t)  vector,  which  specifies  the 
predictors  at  time  t  for  a  patient  who  has  not 
received  a  transplant  at  time  t,  has  values  of 
0  for  all  three  predictors. 

The  hazard  ratio  formula  then  reduces  to  e  to 
the  sum  of  83  “hat”  plus  S2  “hat”  times  TMSj 
plus  83  “hat”  times  AGEi:  where  the  8  “hat’s”  are 
the  estimated  coefficients  of  the  three  time- 
dependent  variables.  Substituting  the  numeri¬ 
cal  values  for  these  coefficients  in  the  formula 
gives  the  exponential  expression  circled  here. 
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EXAMPLE:  (continued) 


HR(t)  is  time-dependent,  i.e.,  its  value 
at  time  t  depends  on  TMSj  and  AGEt  at 
time  t 

TMS  range:  (0-3.05) 

AGE  range:  (12-64) 


The  resulting  formula  for  the  hazard  ratio  is 
time-dependent  in  that  its  value  depends  on  the 
TMS  and  AGE  values  of  the  zth  patient  at  the 
time  of  transplant.  That  is,  different  patients 
can  have  different  values  for  TMS  and  AGE  at 
time  of  transplant.  Note  that  in  the  dataset, 
TMS  ranged  between  0  and  3.05  and  AGE 
ranged  between  12  and  64. 


We  end  our  discussion  of  the  Stanford  Heart 
Transplant  Study  at  this  point.  For  further 
insight  into  the  analysis  of  this  dataset,  we 
refer  the  reader  to  the  1977  paper  by  Crowley 
and  Hu  (/.  Amer.  Statist.  Assoc.). 


IX.  The  Extended  Cox 
Likelihood 

•  Cox  PH  likelilhood  (L)  described 
in  Chapter  3,  Section  VIII 

•  L  now  extended  for  extended 
Cox  model 


At  the  end  of  the  presentation  from  Chapter  3 
(Section  VIII),  we  illustrated  the  Cox  likelihood 
using  the  dataset  shown  on  the  left.  In  this  sec¬ 
tion,  we  extend  that  discussion  to  illustrate  the 
Cox  likelihood  with  a  time-dependent  variable. 


ID 

TIME 

STATUS 

SMOKE 

Barry 

2 

1 

1 

Gary 

3 

1 

0 

Harry 

5 

0 

0 

Larry 

8 

1 

1 

TIME  —  Survival  time  (in  years) 

STATUS  =  1  for  event,  0  for  censorship 
SMOKE  =  1  for  a  smoker,  0  for  a  nonsmoker 


Cox  PH  model:  h(t)=h0(t)elltSMOKE 


To  review:  The  data  indicate  that  Barry  got  the 
event  at  TIME  =  2  years.  Gary  got  the  event  at 
3  years,  Harry  was  censored  at  5  years  and 
Larry  got  the  event  at  8  years.  Furthermore, 
Barry  and  Larrv  were  smokers  while  Gary  and 
Harry  were  non-smokers. 

In  Chapter  3,  we  constructed  the  Cox  likelihood 
with  one  predictor,  SMOKE,  in  the  model.  The 
model  and  the  likelihood  are  shown  on  the  left. 


Cox  PH  Likelihood: 


_ hp(t)e^ _ 

hp(t)ePi  +  h0(t)e°  +  h0(t)e°  +  h0(t)e h 


o 

o 

-si 

i _ 

ho(t)eP' 

hp(t)e°  +  hp(t)e°  +  hp{t)e^ 

h0{t)ePi_ 

The  likelihood  is  a  product  of  three  terms,  one 
term  for  each  event  time  tf  (TIME  =  2,  3,  and  8). 
The  denominator  of  each  term  is  the  sum  of  the 
hazards  from  the  subjects  still  in  the  risk  set  at 
time  tf,  including  the  censored  subject,  Harry. 
The  numerator  of  each  term  is  the  hazard  of 
the  subject  who  got  the  event  at  tf.  The  reader 
may  wish  to  reread  Section  VIII  of  Chapter  3. 


Cox  extended  model: 

h{t)  =  h^)e^SMOKE+yMOKExT,ME 


Time-dependent  covariate 
(its  value  changes  over  time) 


Now  consider  an  extended  Cox  model,  which 
contains  the  predictor  SMOKE  and  a  time- 
dependent  variable  SMOKE  x  TIME.  For  this 
model,  it  is  not  only  the  baseline  hazard  that 
may  change  over  time  but  also  the  value  of  the 
predictor  variables.  This  can  be  illustrated  by 
examining  Larry's  hazard  at  each  event  time. 
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Larry  got  the  event  at  TIME  =  8 

Larry’s  hazard  at  each  event  time 
TIME  Larry’s  Hazard 

2  /;00jePl+2P2 

3  /*o(0ePl+3P2 

8  h0(t)e^+^2 


Cox  extended  model: 

h(t)  =h0(t)e^SMOKE+^SMOKExTIME 

h0(t)e^+2^ 


h0(t)e^+2^2  +  h0(t)e°  +  h0(t)e°  +  h0(t)e^+2^2 


[ 

h0(t)e°  1 

ho(t)e°  +  ho 

t)e°  +  ho(t)eh+3fo\ 

■*o(0ep,+8fel 

/io(f)eP1+8h 

V\  \ 

\  \ 

\  \ 

\  \ 

'  \ 

\  \ 

Likelihood  is  product  of  three\tenjns: 

\  \  < 

L  -  Li  x  L2  x  Lt,  \  '  i 

/  t  ^ 

Barry  Gary  Larry 
(t=  2)  (t  =  3)  (t  =  8) 


SMOKE  x  TIME  =  0  for  nonsmokers 

SMOKE  x  TIME  changes  over  time  for 
smokers 

Larry’s  hazard  changes  over  Lit  L2,  L3. 


h0(t)  cancels  in  L 

£>Pl+2P2 

/  =  _  _ 

ePi+2P2  +  e°  +  e°  +  <?Pi+2P2 


[e°  +  e°  +  ePi+3^ 
>Pi+8P2- 

^Pi+8P2 


Lairy,  a  smoker,  got  the  event  at  TIME  =  8. 
However  at  TIME  =  2,  3,  and  8,  the  covariate 
SMOKE  x  TIME  changes  values,  thus  impact¬ 
ing  Larry’s  hazard  at  each  event  time  (see  left). 
Understanding  how  the  expression  for  an  indi¬ 
vidual’s  hazard  changes  over  time  is  the  key 
addition  toward  understanding  how  the  Cox 
extended  likelihood  differs  from  the  Cox  PH 
likelihood. 

The  extended  Cox  model  for  these  data  is 
shown  again  on  the  left. 


The  extend  cox  likehood  (L)  for  these  data  is 
shown  on  the  left.  This  likehood  is  constructed 
in  a  similar  mannar  as  the  likehood  for  the  Cox 
PH  model.  Yhe  difference  is  that  the  expression 
for  the  subject’s  hazard  is  allowed  to  vary  over 
time. 


Just  as  with  the  Cox  PH  likelihood  shown  on 
the  previous  page,  the  extended  Cox  likelihood 
is  also  a  product  of  three  terms,  corresponding 
to  the  three  event  times  (L  =  Lx  xL2xL3).  Barry 
got  the  event  first  at  t  =  2,  then  Gary  at  t  =  3,  and 
finally  Larry  at  t  =  8.  Harry,  who  was  censored 
at  t  =  5,  was  still  at  risk  when  Barry  and  Gariy 
got  the  event.  Therefore,  Hany’s  hazard  is  still 
in  the  denominator  of  Lj  and  L2. 


The  inclusion  of  the  time-varying  covariate 
SMOKE  x  TIME  does  not  change  the  expres¬ 
sion  for  the  hazard  for  the  nonsmokers  (Gary 
and  Harry)  since  SMOKE  is  coded  0  for  non- 
smokers.  However,  for  smokers  (Bany  and 
Larry),  the  expression  for  the  hazard  changes 
with  time.  Notice  how  Larry’s  hazard  changes 
in  the  denominator  of  Lt,  L2,  and  L3. 

The  baseline  hazard  cancels  in  the  extended 
Cox  likelihood  as  it  does  with  the  Cox  PH  like¬ 
lihood.  Thus,  the  form  of  the  baseline  hazard 
need  not  be  specified,  as  it  plays  no  role  in  the 
estimation  of  the  regression  parameters. 


x 
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Caution:  Incorrect  Coding  of  SMOKE 
x  TIME 


Coded  as  time-independent, 
not  time-dependent 


A  word  of  caution  for  those  planning  to  run  a 
model  with  a  time-varying  covariate:  It  is  incor¬ 
rect  to  create  a  product  term  with  TIME  in  the 
data  step  by  multiplying  each  individual’s 
value  for  SMOKE  with  their  survival  time. 
In  other  words,  SMOKE x TIME  should  not 
be  coded  like  the  typical  interaction  term. 
In  fact,  if  SMOKE  x  TIME  was  coded  as  it  is 
on  the  left,  then  SMOKE  x  TIME  would  be  a 
time-independent  variable.  Larry’s  value  for 
SMOKE  x  TIME  is  incorrectly  coded  at  a  con¬ 
stant  value  of  8  even  though  Larry’s  value  for 
SMOKE  x  TIME  changes  in  the  likelihood  over 
L1(  L2,  and  L3. 


Incorrectly  coded  SMOKE x  TIME 

•  Time  independent 

•  Probably  highly  significant 

•  Survival  time  should  predict 
survival  time 

•  But  not  meaningful 


If  the  incorrectly  coded  time  independent 
SMOKE  x  TIME  was  included  in  a  Cox  model, 
it  would  not  be  surprising  if  the  coefficient 
estimate  was  highly  significant  even  if  the  PH 
assumption  was  not  violated.  It  would  be 
expected  that  a  product  term  with  each  indivi¬ 
dual’s  survival  time  would  predict  the  outcome 
(their  survival  time),  but  it  would  not  be  mean¬ 
ingful.  Nevertheless,  this  is  a  common  mistake. 


Correctly  coding  SMOKE  x  TIME 

•  Time  dependent 

•  Computer  packages  typically 
allow  definition  in  the  analytic 
procedure 


To  obtain  a  correctly  defined  SMOKE  x  TIME 
time-dependent  variable,  computer  packages 
typically  allow  the  variable  to  be  defined  within 
the  analytic  procedure. 


Alternatively,  the  (start,  stop)  or  counting  pro- 

Alternatively  can  code  using  CP  cess  (CP)  data  layout  can  be  used,  to  explicitly 
format  define  a  time-dependent  variable  in  the  data. 


•  CP  format  introduced  in 
Chapter  1 

•  Multiple  observations  per 
subject 

•  Time  intervals  at  risk 
subdivided 

•  Covariate  value  can  change 
from  interval  to  interval  for  the 
same  subject 


The  CP  data  format  was  introduced  in  Chapter 
1 .  This  data  layout  provides  a  straightforward 
approach  for  expressing  a  time-dependent 
covariate  by  allowing  multiple  observations  to 
correspond  to  the  same  individual.  With  this 
format,  an  individual's  total  at-risk  follow-up 
time  is  sub-divided  into  smaller  time  intervals 
providing  a  way  for  values  of  variables  to 
change  from  time  i  n  terval  -  to- i  n  terval  for  the 
same  individual. 
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CP  format  with  time-dependent 
variable  SMOKE  x  TIME: 


SMOKE 


X 


ID 

START 

STOP  STATUS 

SMOKE 

TIME 

Barry 

0 

2  1 

1 

2 

Gary 

0 

2  0 

0 

0 

Gary 

2 

3  1 

0 

0 

Harry 

0 

2  0 

0 

0 

Harry 

2 

3  0 

0 

0 

Harry 

3 

5  0 

0 

0 

Larry 

0 

2  0 

1 

2 

Larry 

2 

3  0 

>  X 

3 

Larry 

3 

8  1 

i 

8 

Coded  as  time  dependent 

START  =  Beginning  of  interval 

(in  months) 

STOP 

=  End  of  interval 

(in  months) 

STATUS  =  1  for  event,  0  for 

censorship 

Alternative  CP  format: 

Gary 

and  Harry 

do  not 

need 

multiple  observations 

since 

SMOKE  x  TIME  does  not 

vary 

for  them  (same  info  as  above) 

SMOKE 

ID 

START 

STOP  STATUS 

SMOKE 

TIME 

Barry 

0 

2  1 

1 

2 

Gary 

0 

3  1 

0 

0 

Harry 

0 

5  0 

0 

0 

Larry 

0 

2  0 

1 

2 

Larry 

2 

3  0 

1 

3 

Larry 

3 

8  1 

1 

8 

2  reasons  to  include  time-varying 
covariate: 

1)  To  account  for  PH  violation 

2)  The  values  actually  change  over 
time  regardless  of  the  PH 
assumption 


The  data  layout  on  the  left  illustrates  the  CP 
approach  with  the  (start,  stop)  time  intervals 
defined  by  each  event  time  (t=2,  t=3,  and  t=8) 
from  Barry,  Gary,  and  Larry. 

If  we  look  at  the  final  three  observations,  we 
can  see  that  Larry's  total  time  at  risk  is  subdi¬ 
vided  into  three  time  intervals.  Larry  got  the 
event  att=8  (STOP=8  and  STATUS=1).  The  two 
previous  observations  indicate  that  Larry  did 
not  get  an  event  (STATUS=0)  over  the  time 
intervals  (0,  2)  or  (2,  3).  Over  the  last  three 
observations,  the  time-dependent  variable 
SMOKE  x  TIME  changes  values  for  Larry 
from  2  to  3  to  8. 


An  alternative  CP  data  layout  is  shown  below 
on  the  left.  Since  Gary  and  Harry  are  non- 
smokers  and  the  coding  for  nonsmokers  is 
SMOKE=0,  their  values  for  the  SMOKE  x  TIME 
variable  stays  at  zero  throughout  their  time 
at  risk.  Therefore,  it  is  not  necessary  to  have 
multiple  observations  for  Gary  and  Harry 
(although  it  is  not  incorrect  to  do  so). 


There  are  two  main  reasons  why  a  time-varying 
covariate  might  be  included  in  a  Cox  model:  (1) 
To  account  for  a  violation  of  the  proportional 
hazards  assumption  (usually  formulated  as  a 
product  term  with  some  function  of  time)  and 
(2)  The  covariate  may  actually  change  its  values 
over  time  regardless  of  the  PH  assumption. 
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SMOKE  x  TIME 

defined  to  account  for  PH 
violation 


DOSE  changes  at  3  time  points  for 
Jane 


I  M  S 

D  O  T  D  T  D  T 

N  A  O  I  O  I 

T  T  S  M  S  M 

H  U  E  E  E  E 

S  S  1  1  2  2 

Jane  49  1  60  0  120  12 


D  T 
O  I 
S  M 
E  E 
_3 _ 3_ 

150  30 


MONTHS  =  Survival  time 
(in  months) 

STATUS  =  1  for  event,  0 
for  censorship 

Same  info  as  above  using  CP  format 
(3  observations  instead  of  1) 


ID 

START 

STOP 

STATUS 

DOSE 

Jane 

0 

12 

0 

60 

Jane 

12 

30 

0 

120 

Jane 

30 

49 

1 

150 

START  =  Beginning  of  interval 
(in  months) 

STOP  =  End  of  interval 
(in  months) 

STATUS  =  1  for  event,  0 
for  censorship 
DOSE  =  Dose  in  milligrams 

Multiple  observations  per  subject: 
revisited  in  Chapter  8  (recurrent 
events) 


The  use  of  the  SMOKE  x  TIME  variable  in  the 
last  example  was  of  the  first  type  (to  account 
for  a  PH  violation).  An  example  of  the  second 
type  could  be  the  changing  of  an  individual’s 
dosage  level  of  medication  over  time  as  illu¬ 
strated  in  the  next  example. 

The  data  on  the  left  contains  one  observation 
for  Jane  who  had  an  event  at  49  months 
(MONTHS=49  and  STATUS=1).  Her  dose  of 
medication  at  the  beginning  of  follow-up  was 
60  milligrams  (DOSE1=60  and  TIME1=0).  At 
the  12th  month  of  follow-up,  her  dose  was 
changed  to  120  milligrams  (DOSE2=120  and 
TIME2=12).  At  the  30th  month  of  follow-up, 
her  dose  was  changed  to  150  mg  (DOSE3=120 
and  TIME3=30). 


The  same  information  can  be  expressed  using 
the  counting  process  data  layout.  On  the  left, 
the  data  is  transposed  to  contain  three  observa¬ 
tions  for  Jane  allowing  DOSE  to  be  represented 
as  a  time-dependent  variable.  For  the  first 
time  interval  (START=0,  STOP=12),  Jane’s 
dose  was  60  mg.  For  the  second  time  interval 
(12-30  months),  Jane’s  dose  was  120  milli¬ 
grams.  For  the  third  time  interval  (30-49 
months),  Jane’s  dose  was  150  milligrams.  The 
data  indicates  that  Jane  had  an  event  at  49 
months  (STOP=49  and  STATUS=1). 


The  counting  process  data  layout  is  further 
discussed  in  Chapter  8  on  recurrent  events. 
With  recurrent  event  data,  subjects  may 
remain  at  risk  for  subsequent  events  after  get¬ 
ting  an  event. 
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Coding  SMOKE  x  TIME  as  time- 
dependent 


Multiple  Observations  per  Subject 


ID 

TIME 

STATUS 

SMOKE 

SMOKE 

x  TIME 

Barry 

2 

1 

1 

2 

Gary 

2 

0 

0 

0 

Gary 

3 

1 

0 

0 

Harry 

2 

0 

0 

0 

Harry 

3 

0 

0 

0 

Harry 

5 

0 

0 

0 

Larry 

2 

0 

1 

2 

Larry 

3 

0 

1 

3 

Larry 

5 

0 

1 

5 

Larry 

8 

1 

1 

8 

t 

Coded  as  time-dependent 


Multiple  observations  per  subject: 
revisited  in  Chapter  8  (recurrent 
events) 


When  a  time-dependent  variable  is  defined 
within  the  Cox  analytic  procedure,  the  variable 
is  defined  internally  such  that  the  user  may  not 
see  the  time-dependent  variable  in  the  dataset. 
However,  the  dataset  on  the  left  will  provide  a 
clearer  idea  of  the  correct  definition  of  SMOKE 
x  TIME.  The  dataset  contains  multiple  obser¬ 
vations  per  subject.  Barry  was  at  risk  at  t  = 
2  and  got  the  event  at  that  time.  Gary  was  at 
risk  at  t  =  2  and  t  =  3.  Gary  didn’t  get  the  event 
at  t  =  2  but  did  get  the  event  at  t  =  3.  Harry  was 
at  risk  at  t  =  2,  t  =  3,  t  =  5  and  didn’t  get  the 
event.  Larry  was  at  risk  at  t  =  2,  t  =  3,  t  =  5,  t  = 
8  and  got  the  event  at  t  =  8.  Notice  how  the 
SMOKE  x  TIME  variable  changes  values  for 
Larry  over  time. 


Survival  analysis  datasets  containing  multiple 
observations  per  subject  are  further  discussed 
in  Chapter  8  on  recurrent  events.  With  recur¬ 
rent  event  data,  subjects  may  remain  at  risk  for 
subsequent  events  after  getting  an  event. 


X.  Summary  a  summary  of  this  presentation  on  time- 

dependent  variables  is  now  provided.  We  began 
Review  Cox  PH  model.  by  reviewing  the  main  features  of  the  Cox  PH 

model.  We  then  defined  a  time-dependent  vari- 
Define  time-dependent  variable:  able  and  illustrated  three  types  of  these 
defined,  internal,  ancillary.  variables-defined,  internal,  and  ancillary. 


Extended  Cox  model: 


IIR(r)  =  exp 


Efc[*;-x] 


1=1 


P2  r 

Function  of  time 


Next,  we  gave  the  form  of  the  “extended  Cox 
model,”  shown  here  again,  which  allows  for 
time-dependent  as  well  as  time-independent 
variables. 


We  then  described  various  characteristics  of 
this  extended  Cox  model,  including  the  for¬ 
mula  for  the  hazard  ratio.  The  latter  formula 
is  time-dependent  so  that  the  PH  assumption  is 
not  satisfied. 
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Model  for  assessing  PH 
assumption: 


h(t,X(t))  =  h0(t)  exp 


Eft-* 


i=  1 


Examples  of  gi(t): 

t,  log  t,  heaviside  function 


Heaviside  functions: 

— 

HR 


We  also  showed  how  to  use  time-dependent 
variables  to  assess  the  PH  assumption  for 
time-independent  variables.  A  general  formula 
for  an  extended  Cox  model  that  simultaneously 
considers  all  time-independent  variables  of 
interest  is  shown  here. 


The  functions  g,(t)  denote  functions  of  time 
for  the  zth  variable  that  are  to  be  determined 
by  the  investigator.  Examples  of  such  functions 
are  g,(t)  =  t,  log  t,  or  a  heaviside  function. 

The  use  of  heaviside  functions  were  described 
and  illustrated.  Such  functions  allow  for  the 
hazard  ratio  to  be  constant  within  different 
time  intervals. 


For  two  time  intervals,  the  model  can  take  either 
one  of  two  equivalent  forms  as  shown  here.  The 
first  model  contains  a  main  effect  of  exposure  and 
only  one  heaviside  function.  The  second  model 
contains  two  heaviside  functions  without  a  main 
effect  of  exposure.  Both  models  yield  two  distinct 
and  equivalent  values  for  the  hazard  ratio. 


where 
8i(t) 

82(f) 


h(t,X(t))  =  h0(t )  exp[p£  +  8 Eg(t)] 
where 

1  if  t  >  to 


s(t )  = 


0  if  t  <  to 


)) 

=  /zoWexpfPiEgdf)  +  p  2Eg2(t)] 


EXAMPLE  1 


1991  Australian  study  of  heroin 
addicts 

•  two  methadone  maintenance 
clinics 

•  addicts  dataset  file 

•  clinic  variable  did  not  satisfy  PH 
assumption 


We  illustrated  the  use  of  time-dependent 
variables  through  two  examples.  The  first  exam¬ 
ple  considered  the  comparison  of  two  methadone 
maintenance  clinics  for  heroin  addicts.  The  data¬ 
set  file  was  called  addicts.  In  this  example,  the 
clinic  variable,  which  was  a  dichotomous  expo¬ 
sure  variable,  did  not  satisfy  the  PH  assumption. 
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Adjusted  survival  curves  stratified  by  clinic 
showed  clinic  2  to  have  consistently  higher 
survival  probabilities  than  clinic  1,  with  a 
more  pronounced  difference  in  clinics  after 
one  year  of  follow-up.  However,  this  stratifica¬ 
tion  did  not  allow  us  to  obtain  a  hazard  ratio 
estimate  for  clinic.  Such  an  estimate  was  pos¬ 
sible  using  an  extended  Cox  model  containing 
interaction  terms  involving  clinic  with  time. 

Two  extended  Cox  models  were  considered. 
The  first  used  heaviside  functions  to  obtain 
two  distinct  hazard  ratios,  one  for  the  first 
year  of  follow-up  and  the  other  for  greater 
than  one  year  of  follow-up.  The  model  is 
shown  here. 

The  second  extended  Cox  model  used  a  time- 
dependent  variable  that  allowed  for  the  two 
survival  curves  to  diverge  over  time.  This 
model  is  shown  here. 

Both  models  yielded  hazard  ratio  estimates 
that  agreed  reasonably  well  with  the  graph  of 
adjusted  survival  curves  stratified  by  clinic. 

The  second  example  considered  results 
obtained  in  the  Stanford  Heart  Transplant 
Study.  The  goal  of  the  study  was  to  assess 
whether  patients  receiving  transplants  survived 
longer  than  patients  not  receiving  transplants. 

The  analysis  of  these  data  involved  an  extended 
Cox  model  containing  three  time-dependent 
variables.  One  of  these,  the  exposure  variable, 
and  called  HT(t),  was  an  indicator  of  transplant 
status  at  time  t.  The  other  two  variables, 
TMS{t)  and  AGE{t),  gave  tissue  mismatch 
scores  and  age  for  transplant  patients  when 
time  t  occurred  after  receiving  a  transplant. 
The  value  of  each  of  these  variables  was  0  at 
times  prior  to  receiving  a  transplant. 


EXAMPLE  2:  Stanford  Heart 
Transplant  Study 


Goals:  Do  patients  receiving  transplants 
survive  longer  than  patients  not 
receiving  transplants? 
h(t,X(t))  =  h0(t)  exp[§i//r(f)  +  82TMS(t ) 
+  83 AG£(t)]  \ 


Exposure  variable 


EXAMPLE:  (continued) 


Adjusted  Survival  Curves 
Stratified  by  Clinic 


0  200  400  600  800  1000  1200 
Days 

Two  distinct  HR’s: 
h(t,X(f))  =  h0(t)  exp[p2(prison) 

+  p3(dose)  +  Sdclinicjgih) 
+  S2(clinic)g2(f)] 

// 

( Heaviside  functions  ) 


Diverging  HR’s: 

h(t,X(f))  =  h0(t)  exp[p2(prison) 

+  p3(dose)  +  pi(clinic) 
+  S(clinic  x  t)] 
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EXAMPLE:  (continued) 


Results:  HT(t)  highly  significant,  i.e., 
transplants  have  better  prognosis  than 
nontransplants. 

Hazard  ratio  estimate  problematic: 


More  appropriate  formula: 

HR  =  exp[— 3.1718  +  0.4442  TMSj 
+0.0552  AGE,] 


The  results  from  fitting  the  above  extended 
Cox  model  yielded  a  highly  significant  effect 
of  the  exposure  variable,  thus  indicating  that 
survival  prognosis  was  better  for  transplants 
than  for  nontransplants. 


From  these  data,  we  first  presented  an  inappro¬ 
priate  formula  for  the  estimated  hazard  ratio. 
This  formula  used  the  exponential  of  the  coef¬ 
ficient  of  the  exposure  variable,  which  gave  an 
estimate  of  1  over  23.98.  A  more  appropriate 
formula  considered  the  values  of  the  covariates 
TMSit)  and  AGE{t )  at  time  t.  Using  the  latter, 
the  hazard  ratio  estimate  varied  with  the  tissue 
mismatch  scores  and  age  of  each  transplant 
patient. 


Chapters 


1 .  Introduction  to  Survival 
Analysis 

2.  Kaplan-Meier  Curves  and 
the  Log-Rank  Test 

3.  The  Cox  Proportional 
Hazards  Model 

4.  Evaluating  the  Proportional 
Hazards  Assumption 

5.  The  Stratified  Cox  Procedure 


✓ 


6.  Extension  of  the  Cox 

Proportional  Hazards  Model 
for  Time-Dependent 
Variables 


This  presentation  is  now  complete.  We  suggest 
that  the  reader  review  the  detailed  outline  that 
follows  and  then  answer  the  practice  exercises 
and  test  that  follow  the  outline. 

A  key  property  of  Cox  models  is  that  the  distri¬ 
bution  of  the  outcome,  survival  time,  is  unspec¬ 
ified.  In  the  next  chapter,  parametric  models 
are  presented  in  which  the  underlying  distribu¬ 
tion  of  the  outcome  is  specified.  The  exponen¬ 
tial,  Weibull,  and  log-logistic  models  are 
examples  of  parametric  models. 


Next: 


7.  Parametric  models 


278  6.  Extension  of  the  Cox  Proportional  Hazards  Model 


Detailed 

Outline 


I.  Preview  (page  244) 

II.  Review  of  the  Cox  PH  Model  (pages  244-246) 
A.  The  formula  for  the  Cox  PH  model: 


B. 


/;(?,  X)  =  ho{t)  exp 


I _ Ltl _ JJ 

Formula  for  hazard  ratio  comparing  two  indi¬ 
viduals: 


X*  =  (xl,x*2,  •  •  •  ,x;)  and  X  =  •  •  •  ’Xr)  : 


C.  The  meaning  of  the  PH  assumption: 

•  Hazard  ratio  formula  shows  that  the  hazard 
ratio  is  independent  of  time: 

/»(f,x*) 
h(t,  X) 

•  Hazard  ratio  for  two  X’s  are  proportional: 

h(t,x*)  =  e/i(t,x) 

D.  Three  methods  for  checking  the  PH  assump¬ 
tion: 

i.  Graphical:  Compare  ln-ln  survival  curves 
or  observed  versus  predicted  curves 

ii.  Time-dependent  covariates:  Use  product 
(i.e.,  interaction)  terms  of  the  form  A  x  g(t). 

iii.  Goodness-of-fit  test:  Use  a  large  sample  Z 
statistic. 

E.  Options  when  the  PH  assumption  is  not  met: 

i.  Use  a  stratified  Cox  procedure. 

ii.  Use  an  extended  Cox  model  containing  a 
time-dependent  variable  of  the  form  X  x  g(t). 

III.  Definition  and  Examples  of  Time-Dependent 
Variables  (pages  246-249) 

A.  Definition:  any  variable  whose  values  differ 
over  time 

B.  Examples  of  defined,  internal,  and  ancillary 
time-dependent  variables 
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IV.  The  Extended  Cox  Model  for  Time-Dependent 

Variables  (pages  249-251) 

r  pi  pi 

A.  h(t,X(t))  =  /;o(r)exp 

_ L‘=1 _ y'=i _ 

where  X(t)  =  (X,,  X2,..„  XPi,  Xx{t),  X2(t)„ . ., 
XPi(t))  denotes  the  entire  collection  of  predic¬ 
tors  at  time  t,  X}  denotes  the  ith  time- 
independent  variable,  and  X;(t)  denotes  the  ;th 
time-dependent  variable. 

B.  ML  procedure  used  to  estimate  regression  coef¬ 
ficients. 

C.  List  of  computer  programs  for  the  extended  Cox 
model. 

D.  Model  assumes  that  the  hazard  at  time 
t  depends  on  the  value  of  X;(f)  at  the  same  time. 

E.  Can  modify  model  for  lag-time  effect. 

V.  The  Hazard  Ratio  Formula  for  the  Extended  Cox 

Model  (pages  251-253) 


B.  Because  HR(t )  is  a  function  of  time,  the  PH 
assumption  is  not  satisfied. 

C.  The  estimated  coefficient  of  X;(t)  is  time-inde¬ 
pendent,  and  represents  an  “overall”  effect  of 
X,(t). 

VI.  Assessing  Time-Independent  Variables  That  Do 
Not  Satisfy  the  PH  Assumption  (pages  254-259) 

A.  General  formula  for  assessing  PH  assumption: 

h(t,  X(r))  =  h0(t)  exp 

|_<=i  i=i  J 

B.  gi(t)  is  a  function  of  time  corresponding  to  X, 

C.  Test  Hq.  =  §2  =  . . .  =  8p  =  0 

D.  Heaviside  function: 

g(t)  = 


1  if  t  >  to 
0  if  t  <  to 


280 


6. 
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E.  The  model  with  a  single  heaviside  function: 
h(t,X(t))  =  h0(t)  exp[pE  +  8Eg  (t)] 


F.  The  model  with  two  heaviside  functions: 


h(t,X(t))  =  h0(t)  exp  [Si  Egi(t)  +  8  2Eg2(t)\ 


where 


8\{t) 


1  if  t  >  t0  I  1  if  t  <  t0 

and  g2(t)  =  < 

0  if  r  <  t0  I  0  if  r  >  t0 


G.  The  hazard  ratios: 


t  >  t0  :  HR  =  exp 

p  +  s' 

=  exp 

X 

t  <  t0  :  HR  =  exp 

> 

=  ' 

exp 

X 

H.  Several  heaviside  functions:  examples  given 
with  four  time-intervals: 

•  Extended  Cox  model  contains  either 
[E,  E  x  gi(t),  E  x  g2(t),  E  x  g3(f)}  or 

l E  x  g^t),  E  x  g2(t),  E  x  g3(f),  E  x  g4(f)] 

•  The  model  using  four  product  terms  and  no 
main  effect  of  E: 

h{t,X(t))  =  h0(t )  exp[<5!  Eg^t)  +  S2Eg2{t) 
+S3Egi(t)  +  S4Eg4(t)] 

where 


8i(t) 


1  if  t  is  within  interval  i 
0  if  otherwise 


VII.  An  Application  of  the  Extended  Cox  Model  to 

An  Epidemiologic  Study  on  the  Treatment  of 

Heroin  Addiction  (pages  260-264) 

A.  1991  Australian  study  of  heroin  addicts 

•  two  methadone  maintenance  clinics 

•  addicts  dataset  file 

•  clinic  variable  did  not  satisfy  PH  assump¬ 
tion 

B.  Clinic  2  has  consistently  higher  retention  pro¬ 
babilities  than  clinic  1 ,  with  a  more  pronounced 

difference  in  clinics  after  1  year  of  treatment. 

C.  Two  extended  Cox  models  were  considered: 

•  Use  heaviside  functions  to  obtain  two  dis¬ 
tinct  hazard  ratios,  one  for  less  than  1  year 
and  the  other  for  greater  than  1  year. 

•  Use  a  time-dependent  variable  that  allows  for 
the  two  survival  curves  to  diverge  over  time. 
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VIII.  An  Application  of  the  Extended  Cox  Model  to 

the  Analysis  of  the  Stanford  Heart  Transplant 

Data  (pages  265-269) 

A.  The  goal  of  the  study  was  to  assess  whether 
patients  receiving  transplants  survived  longer 
than  patients  not  receiving  transplants. 

B.  We  described  an  extended  Cox  model  contain¬ 
ing  three  time-dependent  variables: 

h(t,X(t ))  =  h0(t)exp[6,HT(t)  +  82TMS(t )  +  83A GE(t )] 

C.  The  exposure  variable,  called  HT(t),  was  an  indi¬ 
cator  of  transplant  status  at  time  t.  The  other  two 
variables,  TMS(t)  and  AGE(f),  gave  tissue  mis¬ 
match  scores  and  age  for  transplant  patients 
when  time  t  occurred  after  receiving  a  transplant. 

D.  The  results  yielded  a  highly  significant  effect  of 
the  exposure  variable. 

E.  The  use  of  a  hazard  ratio  estimate  for  this  data 
was  problematical. 

•  An  inappropriate  formula  is  the  exponential  of 
the  coefficient  oiHT(t),  which  yields  1/23.98. 

•  An  alternative  formula  considers  the  values 
of  the  covariates  TMS(t )  and  AGE(t)  at  time  t. 

IX.  The  Extended  Cox  Likelihood  (pages  269-274) 

A.  Review  of  PH  likelihood  (Chapter  3). 

B.  Barry,  Gary,  Larry,  example  of  Cox  likelihood. 

X.  Summary  (pages  274-277) 

The  following  dataset  called  “anderson.dat”  consists  of 
remission  survival  times  on  42  leukemia  patients,  half  of 
whom  receive  a  new  therapy  and  the  other  half  of  whom 
get  a  standard  therapy  (Freireich  et  al.,  Blood,  1963).  The 
exposure  variable  of  interest  is  treatment  status  (Rx  =  0  if 
new  treatment,  Rx  =  1  if  standard  treatment).  Two  other 
variables  for  control  are  log  white  blood  cell  count  (i.e.,  log 
WBC)  and  sex.  Failure  status  is  defined  by  the  relapse 
variable  (0  if  censored,  1  if  failure).  The  dataset  is  listed 
as  follows: 


Subj 

Surv 

Relapse 

Sex 

log  WBC 

Rx 

1 

35 

0 

1 

1.45 

0 

2 

34 

0 

1 

1.47 

0 

3 

32 

0 

1 

2.2 

0 

4 

32 

0 

1 

2.53 

0 

5 

25 

0 

1 

1.78 

0 

6 

23 

1 

1 

2.57 

0 

(Continued  on  next  page) 
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Subj 

Surv 

Relapse 

Sex 

log  WBC 

Rx 

7 

22 

1 

1 

2.32 

0 

8 

20 

0 

1 

2.01 

0 

9 

19 

0 

0 

2.05 

0 

10 

17 

0 

0 

2.16 

0 

11 

16 

1 

1 

3.6 

0 

12 

13 

1 

0 

2.88 

0 

13 

11 

0 

0 

2.6 

0 

14 

10 

0 

0 

2.7 

0 

15 

10 

1 

0 

2.96 

0 

16 

9 

0 

0 

2.8 

0 

17 

7 

1 

0 

4.43 

0 

18 

6 

0 

0 

3.2 

0 

19 

6 

1 

0 

2.31 

0 

20 

6 

1 

1 

4.06 

0 

21 

6 

1 

0 

3.28 

0 

22 

23 

1 

1 

1.97 

1 

23 

22 

1 

0 

2.73 

1 

24 

17 

1 

0 

2.95 

1 

25 

15 

1 

0 

2.3 

1 

26 

12 

1 

0 

1.5 

1 

27 

12 

1 

0 

3.06 

1 

28 

11 

1 

0 

3.49 

1 

29 

11 

1 

0 

2.12 

1 

30 

8 

1 

0 

3.52 

1 

31 

8 

1 

0 

3.05 

1 

32 

8 

1 

0 

2.32 

1 

33 

8 

1 

1 

3.26 

1 

34 

5 

1 

1 

3.49 

1 

35 

5 

1 

0 

3.97 

1 

36 

4 

1 

1 

4.36 

1 

37 

4 

1 

1 

2.42 

1 

38 

3 

1 

1 

4.01 

1 

39 

2 

1 

1 

4.91 

1 

40 

2 

1 

1 

4.48 

1 

41 

1 

1 

1 

2.8 

1 

42 

1 

1 

1 

5 

1 

The  following  edited  printout  gives  computer  results  for 
fitting  a  Cox  PH  model  containing  the  three  predictives  Rx, 
log  WBC,  and  Sex. 


Cox  regression 
Analysis  time_t:  survt 

Coef. 

Std.  Err. 

p  >  Izl 

Haz.  Ratio 

[95%  Conf. 
Interval] 

P(PH) 

Sex 

0.263 

0.449 

0.558 

1.301 

0.539 

3.139 

0.042 

log  WBC 

1.594 

0.330 

0.000 

4.922 

2.578 

9.397 

0.714 

Rx 

1.391 

0.457 

0.002 

4.018 

1.642 

9.834 

0.500 

Log  likelihood  =  —72.109 


No.  of  subjects  =  42 
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1.  Which  of  the  variables  in  the  model  fitted  above  are 
time-independent  and  which  are  time-dependent? 

2.  Based  on  this  printout,  is  the  PH  assumption  satisfied 
for  the  model  being  fit?  Explain  briefly. 

3.  Suppose  you  want  to  use  an  extended  Cox  model  to 
assess  the  PH  assumption  for  all  three  variables  in  the 
above  model.  State  the  general  form  of  an  extended  Cox 
model  that  will  allow  for  this  assessment. 

4.  Suppose  you  wish  to  assess  the  PH  assumption  for  the 
Sex  variable  using  a  heaviside  function  approach 
designed  to  yield  a  constant  hazard  ratio  for  less  than 
1 5  weeks  of  follow-up  and  a  constant  hazard  ratio  for 
15  weeks  or  more  of  follow-up.  State  two  equivalent 
alternative  extended  Cox  models  that  will  carry  out 
this  approach,  one  model  containing  one  heaviside 
function  and  the  other  model  containing  two  heaviside 
functions. 

5.  The  following  is  an  edited  printout  of  the  results 
obtained  by  fitting  an  extended  Cox  model  containing 
two  heaviside  functions: 

Time-Dependent  Cox  Regression  Analysis 


Analysis 
time_t:  survt 

Coef. 

Std.  Err. 

p  >  Izl 

Haz.  Ratio 

[95%  Conf. 
Interval] 

log  WBC 

1.567 

0.333 

0.000 

4.794 

2.498 

9.202 

Rx 

1.341 

0.466 

0.004 

3.822 

1.533 

9.526 

0-15  wks 

0.358 

0.483 

0.459 

1.430 

0.555 

3.682 

15+  wks 

-0.182 

0.992 

0.855 

0.834 

0.119 

5.831 

No.  of  subjects 

=  42 

Log  likelihood 

=  -71.980 

Using  the  above  computer  results,  cany  out  a  test  of 
hypothesis,  estimate  the  hazard  ratio,  and  obtain  95% 
confidence  interval  for  the  treatment  effect  adjusted  for 
log  WBC  and  the  time-dependent  Sex  variables.  What 
conclusions  do  you  draw  about  the  treatment  effect? 

6.  We  now  consider  an  alternative  approach  to 
controlling  for  Sex  using  an  extended  Cox  model.  We 
define  an  interaction  term  between  sex  and  time  that 
allows  for  diverging  survival  curves  over  time. 

For  the  situation  just  described,  write  down  the 
extended  Cox  model,  which  contains  Rx,  log  WBC,  and 
Sex  as  main  effects  plus  the  product  term  Sex  x  time. 
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7.  Using  the  model  described  in  question  6,  express  the 
hazard  ratio  for  the  effect  of  Sex  adjusted  for  Rx  and  log 
WBC  at  8  and  16  weeks. 

8.  The  following  is  an  edited  printout  of  computer  results 
obtained  by  fitting  the  model  described  in  question  6. 

Time-Dependent  Cox  Regression  Analysis 


Analysis 
time_t:  survt 

Coef. 

Std.  Err. 

p  >  Izl 

Haz.  Ratio 

[95%  Conf. 
Interval] 

Sex 

1.820 

1.012 

0.072 

6.174 

0.849  44.896 

log  WBC 

1.464 

0.336 

0.000 

4.322 

2.236  8.351 

Rx 

1.093 

0.479 

0.022 

2.984 

1.167  7.626 

Sex  x  Time 

-0.345 

0.199 

0.083 

0.708 

0.479  1.046 

No.  of  subjects  =  42  Log  likelihood  =  —70.416 

Based  on  the  above  results,  describe  the  hazard  ratio 
estimate  for  the  treatment  effect  adjusted  for  the  other 
variables  in  the  model,  and  summarize  the  results  of 
the  significance  test  and  interval  estimate  for  this  haz¬ 
ard  ratio.  How  do  these  results  compare  with  the 
results  previously  obtained  when  a  heaviside  function 
approach  was  used?  What  does  this  comparison  sug¬ 
gest  about  the  drawbacks  of  using  an  extended  Cox 
model  to  adjust  for  variables  not  satisfying  the  PH 
assumption? 

9.  The  following  gives  an  edited  printout  of  computer 
results  using  a  stratified  Cox  procedure  that  stratifies 
on  the  Sex  variable  but  keeps  Rx  and  log  WBC  in  the 
model. 


Stratified  Cox  regression 


Analysis 
time_t:  survt 

Coef. 

Std.  Err. 

p  >  Izl 

Haz.  Ratio 

[95%  Conf. 
Interval] 

log  WBC 

Rx 

1.390 

0.931 

0.338 

0.472 

0.000 

0.048 

4.016 

2.537 

2.072  7.783 
1.006  6.396 

No.  of  subjects  =  42  Log  likelihood  =  -57.560  Stratified  by  sex 

Compare  the  results  of  the  above  printout  with  previ¬ 
ously  provided  results  regarding  the  hazard  ratio  for 
the  effect  of  Rx.  Is  there  any  way  to  determine  which  set 
of  results  is  more  appropriate?  Explain. 
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The  following  questions  consider  the  analysis  of 
data  from  a  clinical  trial  concerning  gastric  carcinoma, 
in  which  90  patients  were  randomized  to  either  chemo¬ 
therapy  (coded  as  2)  alone  or  to  a  combination  of 
chemotherapy  and  radiation  (coded  as  1).  See  Stablein 
et  al.,  “Analysis  of  Survival  Data  with  Nonproportional 
Hazard  Functions,”  Controlled  Clinical  Trials,  vol.  2, 
pp.  149-159(1981). 

1.  A  plot  of  the  log-log  Kaplan-Meier  curves  for  each 
treatment  group  is  shown  below.  Based  on  this  plot, 
what  would  you  conclude  about  the  PH  assumption 
regarding  the  treatment  group  variable?  Explain. 


4.0  ■ 
2.0  ■ 
0.0. 
-2.0  ■ 


T 


_i _ i _ i _ i _ 


Log-log  Survival  Curves  for 
Each  Treatment  Group 


t - 1 - 1 - 1 - 1 - 1 - r 


0  200 
Number  at  risk 

400 

600 

800 

1000 

1200 

1400 

45  26 

20 

11 

10 

7 

5 

2 

45  40 

25 

17 

10 

7 

6 

2 

2.  The  following  is  an  edited  printout  of  computer  results 
obtained  when  fitting  the  PH  model  containing  only 
the  treatment  group  variable.  Based  on  these  results, 
what  would  you  conclude  about  the  PH  assumption 
regarding  the  treatment  group  variable?  Explain. 


Cox 

regression 
Analysis 
time_t:  survt 

Coef. 

Std.  Err. 

p  >  Izl 

[95%  Conf. 

Haz.  Ratio  Interval] 

P(PH) 

Tx 

-0.267 

0.233 

0.253 

0.766  0.485  1.21 

0 

No.  of  subjects 

90 

Log  likelihood  =  —282.744 

3.  The  following  printout  shows  the  results  from  using  a 
heaviside  function  approach  with  an  extended  Cox 
model  to  fit  these  data.  The  model  used  product  terms 
of  the  treatment  variable  ( Tx )  with  each  of  three  heavi- 
side  functions.  The  first  product  term  (called  Timel) 
involves  a  heaviside  function  for  the  period  from  0  to 
250  days,  the  second  product  term  (i.e.,  Time2)  involves 
the  period  from  250  to  500  days,  and  the  third  product 
term  (i.e.,  Time3)  involves  the  open-ended  period  from 
500  days  and  beyond. 
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Time-Dependent  Cox  Regression  Analysis 


Analysis 
time_t:  survt 

Coef. 

Std.  Err. 

p  >  Izl 

Haz.  Ratio 

[95%  Conf. 
Interval] 

-1.511 

0.089  0.545 

-Iwl 

mm 

1.629 

0.675  3.934 

WSRM 

1.441 

0.604  3.440 

No.  of  subjects  =  90  Log  likelihood  =  —275.745 

Write  down  the  hazard  function  formula  for  the  extended 

Cox  model  being  used,  making  sure  to  explicitly  define  the 

heaviside  functions  involved. 

4.  Based  on  the  printout,  describe  the  hazard  ratios  in 
each  of  the  three  time  intervals,  evaluate  each  hazard 
ratio  for  significance,  and  draw  conclusions  about  the 
extent  of  the  treatment  effect  in  each  of  the  three  time 
intervals  considered. 

5.  Inspection  of  the  printout  provided  in  question  3  indi¬ 
cates  that  the  treatment  effect  in  the  second  and  third 
intervals  appears  quite  similar.  Consequently,  another 
analysis  was  considered  that  uses  only  two  intervals, 
from  0  to  250  days  versus  250  days  and  beyond.  Write 
down  the  hazard  function  formula  for  the  extended  Cox 
model  that  considers  this  situation  (i.e.,  containing  two 
heaviside  functions).  Also,  write  down  an  equivalent 
alternative  hazard  function  formula  which  contains 
the  main  effect  of  treatment  group  plus  one  heaviside 
function  variable. 

6.  For  the  situation  described  in  question  5,  the  computer 
results  are  provided  below.  Based  on  these  results, 
describe  the  hazard  ratios  for  the  treatment  effect 
below  and  above  250  days,  summarize  the  inference 
results  for  each  hazard  ratio,  and  draw  conclusions 
about  the  treatment  effect  within  each  time  interval. 

Time-Dependent  Cox  Regression  Analysis 


Analysis  time_t:  survt 
Column  name 

Coeff 

StErr 

p-value 

HR 

0.95 

Cl 

Timel 

-1.511 

0.461 

0.001 

0.221 

0.089 

0.545 

Time2 

0.427 

0.315 

0.176 

1.532 

0.826 

2.842 

Log  likelihood  =  —275.764 


No.  of  subjects  =  90 
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1.  All  three  variables  in  the  model  are  time-independent 
variables. 

2.  The  computer  results  indicate  that  the  Sex  variables  do 
not  satisfy  the  PH  assumption  because  the  P(PH)  value 
is  0.042,  which  is  significant  at  the  0.05  level. 

3.  h(t,X(t))  =  h0(t)  exp [p! (sex)  +  P2(log  WBC)  +  P3(/fy) 

+  §i(sex)gi(r)  +  §2(log  WBC)g2(r) 

+  S3(/to)g3(f)] 

where  the  g,(f)  are  functions  of  time. 

4.  Model  1  (one  heaviside  function) 

h(t,X(t))  =  ho(t)  exp [Pj  (sex)  +  p2(log  WBC)  +  P3(/?x) 

+  8i(sex)gi(r)] 

where 


1  if  0  <t<  15  weeks 
0  if  t  >  15  weeks 


Model  2  (two  heaviside  functions): 
h(t,X(t))  =  /;o(f)exp[P2(log  WBC)  +  p3(/?x)  +  §i(sex)gi(t) 
+  §2(sexfy2(f)] 
where 


8i(t) 


1  if  0  <t<  15  weeks 
0  if  t  >  15  weeks 


and 


8i{t) 


0  if  t  >  15  weeks 
1  if  0  <  t  <  15  weeks 


5.  The  estimated  hazard  ratio  for  the  effect  of  Rx  is  3.822; 
this  estimate  is  adjusted  for  log  WBC  and  for  the  Sex 
variable  considered  as  two  time-dependent  variables 
involving  heaviside  functions.  The  Wald  test  for  signifi¬ 
cance  of  Rx  has  a  p-value  of  0.004,  which  is  highly 
significant.  The  95%  confidence  interval  for  the  treat¬ 
ment  effect  ranges  between  1.533  and  9.526,  which  is 
quite  wide,  indicating  considerable  unreliability  of  the 
3.822  point  estimate.  Nevertheless,  the  results  estimate 
a  statistically  significant  treatment  effect  of  around  3.8. 

h(t,X(t ))  =  /■z0(f)exp[p1(sex)  +  p2(log  WBC)  +  p3(i?x)  + 
8i(sex  x  t)] 


6. 
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7.  The  hazard  ratio  for  the  effect  of  Sex  in  each  time 
interval,  controlling  for  Rx  and  log  WBC  is  given  as 
follows: 

t  =  8  weeks  HR  =  exp  +  8§i 
t  =  16  weeks  HR  =  exp  (3,  +  168i 

8.  Using  the  model  containing  Sex,  log  WBC,  Rx,  and  Sex 
x  Time,  the  estimated  hazard  ratio  for  the  treatment 
effect  is  given  by  2.984,  with  a  p-value  of  0.022  and  a 
95%  confidence  interval  ranging  between  1.167  and 
7.626.  The  point  estimate  of  2.984  is  quite  different 
from  the  point  estimate  of  3.822  for  the  heaviside  func¬ 
tion  model,  although  the  confidence  intervals  for  both 
models  are  wide  enough  to  include  both  estimates.  The 
discrepancy  between  point  estimates  demonstrates 
that  when  a  time-dependent  variable  approach  is  to 
be  used  to  account  for  a  variable  not  satisfying  the  PH 
assumption,  different  results  may  be  obtained  from 
different  choices  of  time-dependent  variables. 

9.  The  stratified  Cox  analysis  yields  a  hazard  ratio  of  2.537 
with  a  p-value  of  0.048  and  a  95%  Cl  ranging  between 
1.006  and  6.396.  The  point  estimate  is  much  closer  to 
the  2.984  for  the  model  containing  the  Sex  x  Time 
product  term  than  to  the  3.822  for  the  model  contain¬ 
ing  two  heaviside  functions.  One  way  to  choose 
between  models  would  be  to  compare  goodness-of-fit 
test  statistics  for  each  model;  another  way  is  to  com¬ 
pare  graphs  of  the  adjusted  survival  curves  for  each 
model  and  determine  by  eye  which  set  of  survival 
curves  fits  the  data  better. 
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Introduction 


Abbreviated 

Outline 


The  Cox  model  is  the  most  widely  used  survival  model 
in  the  health  sciences,  but  it  is  not  the  only  model 
available.  In  this  chapter  we  present  a  class  of  survival 
models,  called  parametric  models,  in  which  the  distri¬ 
bution  of  the  outcome  (i.e.,  the  time  to  event)  is  specified 
in  terms  of  unknown  parameters.  Many  parametric  models 
are  acceleration  failure  time  models  in  which  survival 
time  is  modeled  as  a  function  of  predictor  variables. 
We  examine  the  assumptions  that  underlie  accelerated 
failure  time  models  and  compare  the  acceleration  factor 
as  an  alternative  measure  of  association  to  the  hazard 
ratio.  We  present  examples  of  the  exponential,  Weibull, 
and  log-logistic  models  and  give  a  brief  description  of 
other  parametric  approaches.  The  parametric  likelihood 
is  constructed  and  described  in  relation  to  left,  right,  and 
interval-censored  data.  Binary  regression  is  presented  as 
an  alternative  approach  for  modeling  interval-censored 
outcomes.  The  chapter  concludes  with  a  discussion  of 
frailty  models. 

The  outline  below  gives  the  user  a  preview  of  the  material 
covered  by  the  presentation.  A  detailed  outline  for  review 
purposes  follows  the  presentation. 

I.  Overview  (pages  292-294) 

II.  Probability  Density  Function  in  Relation  to 

the  Hazard  and  Survival  Function 

(pages  294-295) 

III.  Exponential  Example  (pages  295-297) 

IV.  Accelerated  Failure  Time  Assumption 

(pages  298-300) 

V.  Exponential  Example  Revisited  (pages  300-304) 

VI.  Weibull  Example  (pages  304-309) 

VII.  Log-Logistic  Example  (pages  309-314) 

VIII.  A  More  General  Form  of  the  AFT  Model 

(pages  314-316) 

IX.  Other  Parametric  Models  ( pages  316-318) 

X.  The  Parametric  Likelihood  ( pages  318-321) 

XI.  Interval-Censored  Data  (pages  321-326) 

XII.  Frailty  Models  (pages  326-340) 

XIII.  Summary  (pages  341-344) 
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Objectives 


Upon  completing  this  chapter,  the  learner  should  be  able  to: 

1 .  State  or  recognize  the  form  of  a  parametric  survival 
model  and  contrast  it  with  a  Cox  model. 

2.  State  common  distributions  used  for  parametric 
survival  models. 

3.  Contrast  an  AFT  model  with  a  PH  model. 

4.  Interpret  output  from  an  exponential  survival  model. 

5.  Interpret  output  from  a  Weibull  survival  model. 

6.  Interpret  output  from  a  log-logistic  survival  model. 

7.  State  or  recognize  the  formulation  of  a  parametric 
likelihood. 

8.  State  or  recognize  right-censored,  left-censored,  and 
interval-censored  data. 

9.  State  or  recognize  the  form  of  a  frailty  model  and  the 
purpose  of  including  a  frailty  component. 

10.  Interpret  the  output  obtained  from  a  frailty  model. 
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Presentation 

I.  Overview 


parametric  models 
exponential  example 
AFT  vs.  PH 
Weibull  example 
log-logistic  example 
other  approaches 
parametric  likelihood 
interval-censoring 
frailty  models 


Parametric  Modeling 

•  Outcome  assumed  to  follow 
some  family  of  distributions 

•  Exact  distribution  is  unknown 
if  parameters  are  unknown 

•  Data  used  to  estimate 
parameters 

•  Examples  of  parametric  models: 
o  Linear  regression 

o  Logistic  regression 
o  Poisson  regression 


Distributions  commonly  used  for 
parametric  survival  models: 

•  Weibull 

•  Exponential 

•  Log-logistic 

•  Lognormal 

•  Generalized  gamma 


In  this  chapter  we  present  parametric  survival 
models  and  the  assumptions  that  underlie 
these  models.  Specifically  we  examine  the 
accelerated  failure  time  (AFT)  assumption 
and  contrast  it  to  the  proportional  hazards 
(PH)  assumption.  We  present  examples  of 
several  parametric  models,  including  the 
exponential  model,  the  Weibull  model,  and  the 
log-logistic  model.  The  parametric  likelihood 
is  discussed  and  how  it  accommodates  left-, 
right-,  and  interval-censored  data.  We  also  con¬ 
sider  models  that  include  a  frailty  component 
to  account  for  unobserved  heterogeneity. 

Linear  regression,  logistic  regression,  and 
Poisson  regression  are  examples  of  parametric 
models  that  are  commonly  used  in  the  health 
sciences.  With  these  models,  the  outcome  is 
assumed  to  follow  some  distribution  such 
as  the  normal,  binomial,  or  Poisson  distri¬ 
bution.  Typically,  what  is  actually  meant  is 
that  the  outcome  follows  some  family  of 
distributions  of  similar  form  with  unknown 
parameters.  It  is  only  when  the  value  of  the 
parameter(s)  is  known  that  the  exact  distri¬ 
bution  is  fully  specified.  For  example,  if  one 
distribution  is  normal  with  a  mean  of  3  and 
another  distribution  is  normal  with  a  mean 
of  7,  the  distributions  are  of  the  same  family 
(i.e.,  normal)  but  they  are  not  exactly  the  same 
distribution.  For  parametric  regression  mod¬ 
els,  the  data  are  typically  used  to  estimate  the 
values  of  the  parameters  that  fully  specify  that 
distribution. 

A  parametric  survival  model  is  one  in  which 
survival  time  (the  outcome)  is  assumed  to  fol¬ 
low  a  known  distribution.  Examples  of  distribu¬ 
tions  that  are  commonly  used  for  survival  time 
are:  the  Weibull,  the  exponential  (a  special 
case  of  the  Weibull),  the  log-logistic,  the  log¬ 
normal,  and  the  generalized  gamma,  which 
are  supported  by  SAS,  Stata,  and  R  software. 
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Parametric  survival  models 

\ 

Distribution  specified  for  time 
Cox  model  is  semiparametric: 

\ 

Baseline  survival  not  specified 


The  Cox  proportional  hazards  model,  by 
contrast,  is  not  a  fully  parametric  model. 
Rather  it  is  a  semi-parametric  model  because 
even  if  the  regression  parameters  (the  betas)  are 
known,  the  distribution  of  the  outcome  remains 
unknown.  The  baseline  survival  (or  hazard) 
function  is  not  specified  in  a  Cox  model. 


Cox  model  widely  popular: 

•  No  reliance  on  assumed 
distribution 

•  Computer  packages  can  output 
Cox-adjusted  survival  estimates 
using  algorithm  that 
generalizes  KM 

•  Baseline  not  necessary  for 
estimation  of  hazard  ratio 


A  key  reason  why  the  Cox  model  is  widely 
popular  is  that  it  does  not  rely  on  distributional 
assumptions  for  the  outcome.  Although  the 
baseline  survival  function  is  not  estimated  with 
a  Cox  model,  computer  packages  such  as  SAS, 
Stata,  SPSS,  and  R  can  output  Cox-adjusted 
survival  estimates  (see  Computer  Appendix)  by 
using  a  complicated  algorithm  that  generalizes 
the  Kaplan-Meier  (KM)  approach  while 
making  use  of  estimated  regression  coefficients 
obtained  from  a  Cox  model  (Kalbfleisch  and 
Prentice,  1980).  Also,  an  estimation  of  the  base¬ 
line  hazard  is  not  necessary  for  the  estimation 
of  a  hazard  ratio  because  the  baseline  hazard 
cancels  in  the  calculation. 


S(t) 
1.0  I- 


In  theory,  as  time  ranges  from  0  to  infinity,  the 
survival  function  can  be  graphed  as  a  smooth 
curve  from  S(0)  =  1  to  S(oo)  =  0  (see  Chapter  1). 
Kaplan-Meier  and  Cox-adjusted  survival  esti¬ 
mates  use  empirical  nondistributional  methods 
that  typically  graph  as  step  functions,  particu¬ 
larly  if  the  sample  size  is  small.  If  in  the  data, 
for  example,  an  event  occurred  at  3  weeks  and 
the  next  event  occurred  at  7  weeks,  then  the 
estimated  survival  curve  would  be  flat  between 
3  and  7  weeks  using  these  nondistributional 
approaches.  Moreover,  if  the  study  ends  with 
subjects  still  remaining  at  risk,  then  the  esti¬ 
mated  survival  function  would  not  go  all  the 
way  down  to  zero. 


i - 1 - 1 - 

0  3  7 

Step  function  (nondistributional 
estimates) 
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Appeal  of  Parametric  Survival 
Models 

•  More  consistent  with  theoretical 
S(t)  than  nondistributional 
approaches 

•  Simplicity 

•  Completeness  —  h(t)  and  S(t) 
specified 


Survival  estimates  obtained  from  parametric 
survival  models  typically  yield  plots  more  con¬ 
sistent  with  a  theoretical  survival  curve.  If  the 
investigator  is  comfortable  with  the  underlying 
distributional  assumption,  then  parameters 
can  be  estimated  that  completely  specify  the 
survival  and  hazard  functions.  This  simplicity 
and  completeness  are  the  main  appeals  of 
using  a  parametric  approach. 


II.  Probability  Density 
Function  in  Relation 
to  the  Hazard  and 
Survival  Function 


Probability  density  function  known 

a- 


Survival  and  hazard  functions 


f(t)  =  dF(t)/dt  where 
F(t)  =  Pr(T  <t) 


S(t) 

h(t) 


P(T  >  t )  =  J  f(u)du 

t 

- d[S(t)\/dt 
S(t) 


For  parametric  survival  models,  time  is  assumed 
to  follow  some  distribution  whose  probability 
density  function  f(t)  can  be  expressed  in  terms 
of  unknown  parameters.  Once  a  probability  den¬ 
sity  function  is  specified  for  survival  time,  the 
corresponding  survival  and  hazard  functions 
can  be  determined.  The  survival  function  S(t)  = 
P(T  >  t)  can  be  ascertained  from  the  probability 
density  function  by  integrating  over  the  proba¬ 
bility  density  function  from  time  t  to  infinity. 
The  hazard  can  then  be  found  by  dividing  the 
negative  derivative  of  the  survival  function  by 
the  survival  function  (see  left). 


Survival  in  terms  of  hazard 


S(t)  =  exp 


The  survival  function  can  also  be  expressed  in 
terms  of  the  hazard  function  (see  Chapter  1)  by 
exponentiating  the  negative  of  the  cumulative 
hazard  function.  The  cumulative  hazard  func¬ 
tion  is  the  integral  of  the  hazard  function 
between  integration  limits  of  0  and  t. 


t 

Cumulative  hazard:  f  h(u)du 
o 


fin  =  hum) 


Finally,  the  probability  density  function  can  be 
expressed  as  the  product  of  the  hazard  and  the 
survival  functions,  f(t )  =  h(t)S(t). 


Key  Point  The  key  point  is  that  specifying  any  one  of 

the  probability  density  function,  survival 
Specifying  one  of  f(t),  S(t),  or  h(t)  function,  or  hazard  function  allows  the 
specifies  all  three  functions  other  two  functions  to  be  ascertained  by 

using  the  formulas  shown  on  the  left. 
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Survival  and  Hazard  Functions  for 
Selected  Distributions 


Distribution 

Sit) 

hit) 

Exponential 

Weibull 

Log-logistic 

exp  i—Xt) 
exp(— Xtp) 

1 

X 

Xpf~l 

Xpf 

1  +XtP 

1  +XtP 

fit)  =  h  (t)S  it) 

For  example,  Weibull: 
fit)  =  Xptp~l  expi—Xtp) 
because  h(t)  =  Xptp~l  and 
S(t)  =  exp(-ktp) 


On  the  left  is  a  table  containing  the  survival 
and  hazard  functions  for  three  of  the  more 
commonly  used  distributions  for  survival 
models:  the  exponential,  Weibull,  and  log- 
logistic  distributions. 

The  exponential  is  a  one-parameter  distribu¬ 
tion  with  a  constant  hazard  X.  The  Weibull 
and  log-logistic  distributions  have  two  para¬ 
meters  X  and  p.  Notice  that  the  Weibull  distri¬ 
bution  reduces  to  the  exponential  if  p  =  1 .  The 
probability  density  function  for  these  distribu¬ 
tions  can  be  found  by  multiplying  h(t)  and  S(t). 
As  an  example,  the  Weibull  probability  density 
function  is  shown  on  the  left. 


Typically  in  parametric  models: 

•  X  reparameterized  for 
regression 

•  p  held  fixed 


Typically  for  parametric  survival  models,  the 
parameter  X  is  reparameterized  in  terms  of 
predictor  variables  and  regression  parameters 
and  the  parameter  p  (sometimes  called  the 
shape  parameter)  is  held  fixed.  This  is  illu¬ 
strated  in  the  examples  to  come. 


III.  Exponential  Example  The  first  example  we  consider  is  the  exponen¬ 
tial  model,  which  is  the  simplest  parametric 
survival  model  in  that  the  hazard  is  constant 
over  time  (i.e.,  h(t)  =  X).  The  model  is  applied  to 
the  remission  data  (Freireich  et  al.,  1963),  in 
which  42  leukemia  patients  were  followed  until 
remission  or  censorship.  Twenty-one  patients 
received  an  experimental  treatment  (coded 
TRT  =1)  and  the  other  21  received  a  placebo 
(coded  TRT  =  0).  The  data  are  listed  in  Chapter  1 . 
The  variable  TRT  is  just  a  reverse  coding  of 
the  variable  RX  presented  in  Chapter  3. 


Simplest  parametric  survival  model: 
Hazard  function:  hit)  =  X 
(where  X  is  a  constant) 


EXAMPLE 


Remission  data  (n  =  42) 

21  patients  given  treatment  (TRT  =  1) 
21  patients  given  placebo  (TRT  =  0) 
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h(t)  =  X  =  exp(p0  +  TRT) 


TRT=l:h(t)  =  exp(Po+pi) 
TRT  =  0:  h(t)  =  exp(p0) 


HR(TRT  =  1  vs.  TRT  =  0) 


exp(/?o  +  Pj) 
exp(P0) 


exp()?i) 


For  simplicity,  we  demonstrate  an  exponential 
model  that  has  TRT  as  the  only  predictor.  We 
state  the  model  in  terms  of  the  hazard  by  repar¬ 
ameterizing  X  as  exp(p0  +  PiTRT).  With  this 
model,  the  hazard  for  subjects  in  the  treated 
group  is  exp(p0  +  Pi)  and  the  hazard  for  the 
placebo  group  is  exp(p0).  The  hazard  ratio  com¬ 
paring  the  treatment  and  placebo  (see  left  side) 
is  the  ratio  of  the  hazards  exp(Pi).  The  expo¬ 
nential  model  is  a  proportional  hazards  model. 


Constant  Hazards 

=>■  Proportional  Hazards 

Proportional  Hazards 

Constant  Hazards 

Exponential  Model  —  Hazards  are 
constant 

Cox  PH  Model  —  Hazards  are  pro¬ 
portional  not  necessarily  constant 


The  assumption  that  the  hazard  is  constant  for 
each  pattern  of  covariates  is  a  much  stronger 
assumption  than  the  PH  assumption.  If  the 
hazards  are  constant,  then  of  course  the  ratio 
of  the  hazards  is  constant.  However,  the  haz¬ 
ard  ratio  being  constant  does  not  necessar¬ 
ily  mean  that  each  hazard  is  constant.  In  a 
Cox  PH  model  the  baseline  hazard  is  not 
assumed  constant.  In  fact,  the  form  of  the  base¬ 
line  hazard  is  not  even  specified. 


Remission  Data 

Exponential  regression 
log  hazard  form 


_t 

Coef. 

Std. 

Err. 

z 

P>|z| 

trt 

-1.527 

.398 

-3.83 

0.00 

_cons 

-2.159 

.218 

-9.90 

0.00 

Output  from  running  the  exponential  model  is 
shown  on  the  left.  The  model  was  run  using 
Stata  software.  The  parameter  estimates  are 
listed  under  the  column  called  Coef.  The 
parameter  estimate  for  the  coefficient  of  TRT 
(pi)  is  —1.527.  The  estimate  of  the  intercept 
(called  cons  in  the  output)  is  —2.159.  The  stan¬ 
dard  errors  (Std.  Err.),  Wald  test  statistics  (z), 
and  p-values  for  the  Wald  test  are  also 
provided.  The  output  indicates  that  the  z  test 
statistic  for  TRT  is  statistically  significant  with 
a  p-value  <0.005  (rounds  to  0.00  in  the  output). 


Coefficient  estimates  obtained  by  MLE 

\ 


asymptotically  normal 


The  regression  coefficients  are  estimated  using 
maximum  likelihood  estimation  (MLE),  and 
are  asymptotically  normally  distributed. 
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TRT  =  1:  h(t)  =  exp(— 2.159 
+  (-1.527))  =  0.025 
TRT  =  0:  h(t)  =  exp(-2.159) 

=  0.115 

HR  (TRT  =  1  vs  .0)  =  exp(- 1.527) 
=  0.22 

95%CI  =  exp [—1.527  ±  1.96(0.398)] 
=  (0.10,0.47) 


The  estimated  hazards  for  TRT  =  1  and  TRT 
0  are  shown  on  the  left.  The  estimated  hazard 
ratio  of  0.22  is  obtained  by  exponentiating 
the  estimated  coefficient  (—1.527)  of  the  TRT 
variable.  A  95%  confidence  interval  can  be 
calculated  exp[- 1.527  ±  1.96(0.398)]  yielding 
a  Cl  of  (0.10,  0.47).  These  results  suggest  that 
the  experimental  treatment  delays  remission. 


Results:  suggest  treatment  lowers 
hazard 


Parametric  models 

•  Need  not  be  PH  models 

•  Many  are  AFT  models 

Exponential  and  Weibull 

•  Accommodate  PH  and  AFT 
assumptions 


Up  to  this  point  in  the  book,  the  key  assump¬ 
tion  for  survival  models  has  been  the  pro¬ 
portional  hazard  assumption.  However, 
parametric  survival  models  need  not  be  PH 
models.  Many  parametric  models  are  accel¬ 
eration  failure  time  (AFT)  models  rather 
than  PH  models.  The  exponential  and  Weibull 
distributions  can  accommodate  both  the  PH 
and  AFT  assumptions. 


Remission  Data 

Exponential  regression  accelerated 
failure-time  form 


_t 

Coef. 

Std.  Err. 

z 

p  >  M 

trt 

1.527 

.398 

3.83 

0.00 

_cons 

2.159 

.218 

9.90 

0.00 

On  the  left  is  Stata  output  from  the  AFT  form  of 
the  exponential  model  with  TRT  as  the  only 
predictor.  Stata  can  output  both  the  PH  or 
AFT  form  of  an  exponential  or  Weibull  model 
(see  Computer  Appendix).  SAS  only  runs  the 
AFT  form  of  parametric  models  and  SPSS  does 
not  yet  provide  commands  to  run  parametric 
models. 


AFT  vs.  PH 

•  Different  interpretation  of 
parameters 

•  AFT  applies  to  comparison  of 
survival  times 

•  PH  applies  to  comparison  of 
hazards 


The  interpretation  of  parameters  differs  for 
AFT  and  PH  models.  The  AFT  assumption  is 
applicable  for  a  comparison  of  survival  times 
whereas  the  PH  assumption  is  applicable  for  a 
comparison  of  hazards.  In  the  following  sec¬ 
tions,  we  discuss  the  AFT  assumption  and 
then  revisit  this  example  and  discuss  the  AFT 
form  of  this  model. 
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IV.  Accelerated  Failure 
Time  Assumption 


AFT  —  Multiplicative  effect  with 
survival  time 

PH  —  Multiplicative  effect  with 
hazard 


sD(t) 


/ 


SH(7t) 

\ 


Survival  Function  Survival  Function 

For  Dogs  For  Humans 


AFT  models: 

Describe  “stretching  out”  or 
contraction  of  survival  time 


The  underlying  assumption  for  AFT  models  is 
that  the  effect  of  covariates  is  multiplicative 
(proportional)  with  respect  to  survival  time, 
whereas  for  PH  models  the  underlying  assump¬ 
tion  is  that  the  effect  of  covariates  is  multipli¬ 
cative  with  respect  to  the  hazard. 


To  illustrate  the  idea  underlying  the  AFT 
assumption,  consider  the  lifespan  of  dogs.  It 
is  often  said  that  dogs  grow  older  seven  times 
faster  than  humans.  So  a  10-year-old  dog  is  in 
some  way  equivalent  to  a  70-year-old  human. 
In  AFT  terminology  we  might  say  the  probabil¬ 
ity  of  a  dog  surviving  past  10  years  equals  the 
probability  of  a  human  surviving  past  70  years. 
Similarly,  we  might  say  the  probability  of  a  dog 
surviving  past  6  years  equals  the  probability 
of  a  human  surviving  past  42  years  because 
42  equals  6  times  7.  More  generally  we  can 
say  SD(t)  =  SH(7t),  where  SD(t)  and  SH(t)  are 
the  survival  functions  for  dogs  and  humans, 
respectively.  In  this  framework  dogs  can  be 
viewed,  on  average,  as  accelerating  through 
life  7  times  faster  than  humans.  Or  from  the 
other  perspective,  the  lifespan  of  humans,  on 
average,  is  stretched  out  7  times  longer  than 
the  lifespan  of  dogs.  AFT  models  describe  this 
“stretching  out”  or  contraction  of  survival 
time  as  a  function  of  predictor  variables. 


Second  Illustration 

51  (t)  —  Survival  function  for 

smokers 

52  (t)  —  Survival  function  for 

nonsmokers 

AFT  assumption: 

S2(t)  =  Si(yt)  for  t  >  0 
Y  is  the  acceleration  factor 

If  y  =  exp(P) 

S2(t)  =  Si([exp(oc)]t) 
or 

S2([exp(— oc)]t)  =  S,(t) 


For  a  second  illustration  of  the  accelerated  fail¬ 
ure  time  assumption  consider  a  comparison  of 
survival  functions  among  smokers  Si(t)  and 
non-smokers  S2(t).  The  AFT  assumption  can 
be  expressed  as  S2(t)  =  Si(yt)  for  t  >  0,  where 
y  is  a  constant  called  the  acceleration  factor 
comparing  smokers  to  nonsmokers.  In  a 
regression  framework  the  acceleration  factor 
Y  could  be  parameterized  as  exp(a)  where  a  is 
a  parameter  to  be  estimated  from  the  data. 
With  this  parameterization,  the  AFT  assump¬ 
tion  can  be  expressed  as  S2(t)  =  Si(exp(a)t)  or 
equivalently:  S2(exp(— a)t)  =  S ,  ( t)  for  t  >  0. 


Presentation:  IV.  Accelerated  Failure  Time  Assumption  299 


Suppose  exp(a)  =  0.75 
then 

S2(80)  =  Si(60) 
S2(40)  =  Si(30) 

More  generally 
S2(t)  =  S!(0.75t) 


Suppose  exp(a)  =  0.75;  then  the  probability  of  a 
nonsmoker  surviving  80  years  equals  the  prob¬ 
ability  of  a  smoker  surviving  80(0.75)  or  60 
years.  Similarly,  the  probability  of  a  non- 
smoker  surviving  40  years  equals  the  probabil¬ 
ity  of  a  smoker  surviving  30  years.  More 
generally,  the  probability  of  a  nonsmoker  sur¬ 
viving  t  years  equals  the  probability  of  a 
smoker  surviving  0.75  times  t  years  (i.e.,  S2(t) 
=  Sj  (0.75t)). 


T  |  —  Survival  time  for  smokers 
T2  —  Survival  time  for  nonsmokers 

AFT  assumption  in  terms  of  ran¬ 
dom  variables: 

Ti  =  yT2 


The  AFT  assumption  can  also  be  expressed  in 
terms  of  random  variables  for  survival  time 
rather  than  the  survival  function.  If  T2  is  a 
random  variable  (following  some  distribution) 
representing  the  survival  time  for  nonsmokers 
and  Tj  is  a  random  variable  representing  the 
survival  time  for  smokers,  then  the  AFT 
assumption  can  be  expressed  as  Ti  =  y  T2. 


Acceleration  factor 

Measure  of  association 
on  survival  time 

Hazard  ratio 

Measure  of  association  on  the 
hazard 


The  acceleration  factor  is  the  key  measure  of 
association  obtained  in  an  AFT  model.  It  allows 
the  investigator  to  evaluate  the  effect  of  predic¬ 
tor  variables  on  survival  time  just  as  the  hazard 
ratio  allows  the  evaluation  of  predictor  vari¬ 
ables  on  the  hazard. 


Acceleration  factor  (y) 

•  Describes  stretching  or 
contraction  of  S(t) 

•  Ratio  of  times  to  any  fixed 
value  of  S(t) 

Suppose  y  =  2.0 

(Group  2  vs.  Group  1) 

•  Time  to  S(t)  =  0.50  (median)  is 
double  for  Group  2 

•  Time  to  S(t)  =  0.20  is  double  for 
Group  2 

•  Time  to  S(t)  =  0.83  is  double  for 
Group  2 

•  Time  to  S(t)  =  0.98  is  double  for 
Group  2 

•  Time  to  S(t)  =  q  is  double  for 
Group  2  (generalization) 


The  acceleration  factor  describes  the  “stretch¬ 
ing  out”  or  contraction  of  survival  functions 
when  comparing  one  group  to  another.  More 
precisely,  the  acceleration  factor  is  a  ratio 
of  survival  times  corresponding  to  any 
fixed  value  of  S(t).  For  example,  if  the  acceler¬ 
ation  factor  comparing  subjects  in  Group  2  vs. 
Group  1  is  y  =  2.0,  then  the  median  survival 
time  (value  of  t  when  S(t)  =  0.5)  for  Group  2  is 
double  the  median  survival  time  for  Group  1. 
Moreover,  the  time  it  takes  for  S(t)  to  equal  0.2 
or  0.83  or  0.98  is  double  for  Group  2  compared 
to  Group  1  for  the  same  value  of  S(t).  In  gen¬ 
eral,  the  acceleration  factor  is  a  ratio  of  survival 
times  corresponding  to  any  quantile  of  survival 
time  (S(t)  =  q). 
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Survival  curves  for  Group  1  (G  =  1) 
and  Group  2  (G  =  2) 


This  idea  is  graphically  illustrated  by 
examining  the  survival  curves  for  Group  1 
(G  =  1)  and  Group  2  (G  =  2)  shown  on  the 
left.  For  any  fixed  value  of  S(t),  the  distance  of 
the  horizontal  line  from  the  S(t)  axis  to  the 
survival  curve  for  G  =  2  is  double  the  distance 
to  the  survival  curve  for  G  =  1.  Notice  the 
median  survival  time  (as  well  as  the  25th  and 
75th  percentiles)  is  double  for  G  =  2.  For  AFT 
models,  this  ratio  of  survival  times  is 
assumed  constant  for  all  fixed  values  of  S(t). 


Horizontal  lines  are  twice  as  long 
to  G  =  2  compared  to  G  =  1 
because  y  =  2 


V.  Exponential  Example 
Revisited 


Remission  data  (n  =  42) 

2 1  patients  given  treatment  (TRT  =  1 ) 
2 1  patients  given  placebo  (TRT  =  0) 


Previously  discussed  PH  form  of 
model 

Now  discuss  AFT  form  of  model 


We  return  to  the  exponential  example  applied 
to  the  remission  data  with  treatment  status 
(TRT)  as  the  only  predictor.  In  Section  III, 
results  from  the  PH  form  of  the  exponential 
model  were  discussed.  In  this  section  we 
discuss  the  AFT  form  of  the  model. 


Exponential  survival  and  hazard 
functions: 

S(t)  =  exp(-^t) 
h(t)  =  X 

Recall  for  PH  model: 


The  exponential  survival  and  hazard  functions 
are  shown  on  the  left.  Recall  that  the  exponen¬ 
tial  hazard  is  constant  and  can  be  reparameter¬ 
ized  as  a  PH  model,  h(t)  =  X  =  exp(p0  +  Pi 
TRT).  In  this  section  we  show  how  S(t)  can  be 
reparameterized  as  an  AFT  model. 


h(t)  =  X  =  exp(p0  +  Pj  TRT) 


Presentation:  V.  Exponential  Example  Revisited  301 


AFT  assumption 
(comparing  2  levels  of  TRT) 

•  Ratio  of  times  is  constant  for  all 
fixed  S(t) 

Strategy  for  developing  the  model: 


The  underlying  AFT  assumption,  for  compar¬ 
ing  the  two  levels  of  the  TRT  covariate,  is  that 
the  ratio  of  times  to  any  fixed  value  of  S(t)  =  q 
is  constant  for  any  probability  q.  We  develop 
the  AFT  model  with  the  survival  function  by 
solving  for  t  in  terms  of  S(t).  We  then  scale 
t  in  terms  of  the  predictors. 


•  Solve  for  t  in  terms  of  S(t) 

•  Scale  t  in  terms  of  the  predictors 


S(t)  =  exp(-At) 
t  =  [— ln(S(t)]  x  i 

let  ^  =  exp(ao  +  otiTRT) 

A 

t  =  [-ln(S(t)]  x  exp(ao  +  aiTRT) 


/ 

Scaling  oft  ' 

Median  survival  time,  S(t)  =  0.5: 


tm  =  [— ln(0.5)]  x  exp(ao  +  oqTRT) 


The  exponential  survival  function  is  S(t)  =  exp 
(— A/).  By  solving  for  t,  we  can  obtain  a  formula 
for  t  in  terms  of  S(t).  Taking  the  natural  log, 
multiplying  by  negative  1,  and  then  multiply¬ 
ing  by  the  reciprocal  of  A,  yields  the  expression 
for  t  shown  on  the  left.  By  reparameterizing 
1/A  =  exp(a0  +  oqTRT),  or  equivalently  A  = 
exp[-(a0  +  a^RT)],  it  can  be  seen  how  the 
predictor  variable  TRT  is  used  to  scale  the 
time  to  any  fixed  value  of  S(t)  (see  left).  For 
example,  to  find  an  expression  for  the  median 
survival  time  tm,  substitute  S(t)  =  0.5  (see  left). 


Let  S(t)  =  q 

t  =  [-ln(q)]  x  exp(a0  +  aiTRT) 

Acceleration  Factor: 
y(TRT  =  1  vs.  TRT  =  0) 

^  _  [-  ln(</)]  exp(a0  +  «i) 

[-  ln(qr)]  exp(«o) 

=  exp(oq) 

STRT=o(t)  =  STRT=i(yt) 
where  y  =  exp(ai) 

AFT  :  \  =  exp(a0  +  aiTRT) 

A 

PH  :  X  =  exp(/?0  +  //jTRT) 


The  expression  for  t  is  restated  on  the  left  in 
terms  of  any  fixed  probability  S(t)  =  q.  The 
acceleration  factor  y  is  found  by  taking  the 
ratio  of  the  times  to  S(t)  =  q  for  TRT  =  1  and 
TRT  =  0.  After  canceling,  y  reduces  to  exp(ai). 

Thus,  the  exponential  hazard  function  for  the 
remission  data  involving  a  single  (0,1)  TRT 
predictor  satisfies  the  AFT  model  assumption 
shown  at  the  left. 

The  formula  for  A  as  the  AFT  model  form 
is  compared  at  the  left  with  the  (different) 
formula  for  A  in  the  PH  model  form. 
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Remission  Data 

Exponential  regression  accelerated 
failure-time  form 

_t  Coef.  Std.  Err.  z  P>|z| 

trt  1.527  .398  3.83  0.00 

_cons  2.159  .218  9.90  0.00 

y  =  exp(1.527)  =  4.60 
95%  Cl:  exp[1.527  ±  1.96(0.398)]  = 
(2.11,  10.05) 

t  =  [-ln(q)]  x  exp(oc0  +  oqTRT) 
t  =  [— ln(q)] 

x  exp(2.159  +  1.527(TRT)) 


Estimated  Survival  Times  by  S(t) 
Quartiles  for  TRT  =  1  and 
TRT  =  0  (Exponential  Model) 


s(t)  =  q 

r'i'> 

H 

II 

O 

C  rt  i 

0.25 

12.0 

55.3 

0.50 

6.0 

27.6 

0.75 

2.5 

11.5 

y  =  4.60  (for  TRT  =  1  vs.  TRT  =  0) 
Ratio  of  survival  times: 


12.0  6.0  2.5 

Effect  of  treatment: 

•  Stretches  survival  by  a  factor  of 
4.6 

•  Interpretation  of  y  has  intuitive 
appeal 


On  the  left  is  Stata  output  from  the  AFT  form  of 
the  exponential  model  with  TRT  as  the  only 
predictor.  The  estimate  of  the  coefficient  for 
TRT  is  1.527  with  a  standard  error  of  0.398. 
An  estimate  of  the  acceleration  factor  for  treat¬ 
ment  is  y  =  exp(  1.527)  =  4.60.  A  95%  confi¬ 
dence  interval  for  y  is  calculated  as  exp[1.527 
±  1.96(0.398)]  yielding  a  Cl  of  (2.11,  10.05). 


The  parameter  estimates  can  be  used  to  esti¬ 
mate  the  time  t  to  any  value  of  S(t)  =  q.  The 
table  on  the  left  lists  the  estimated  time  (in 
weeks)  for  the  first,  second  (median),  and 
third  quartiles  of  S(t)  using  the  expression  for 
t  shown  above  for  both  the  treated  and  placebo 
groups.  In  this  example,  survival  time  is  the 
time  to  remission  for  leukemia  patients. 


The  ratio  of  survival  times  for  each  row  in  the 
table  comparing  TRT  =  1  vs.  TRT  =  0  is  4.60, 
which  not  coincidently  is  the  estimate  of  the 
acceleration  factor  (see  left).  The  estimated 
acceleration  factor  suggests  that  the  experi¬ 
mental  treatment  is  effective  for  delaying 
remission  by  stretching  survival  time  by  a  fac¬ 
tor  of  4.60.  Although  the  hazard  ratio  is  a 
more  familiar  measure  of  association  for 
health  scientists,  the  acceleration  factor  has 
an  intuitive  appeal,  particularly  for  describing 
the  efficacy  of  a  treatment  on  survival. 
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HR  and  y  are  reciprocals  in  expo¬ 
nential  models: 

HR{ TRT  =  1  vs.  0)  =  exp(- 1.527) 

=  0.22 

y(TRT  =  1  vs.  0)  =  exp(  1.527) 

=  4.60 

In  general 

y  >  1  =>■  exposure  benefits 
survival 

HR  >  1  =>■  exposure  harmful  to 
survival 

Y  >  1  =>■  exposure  harmful  to 
survival 

HR  <  1  =>■  exposure  benefits 
survival 


Recall  from  Section  III  that  the  hazard  ratio 
for  the  effect  of  treatment  was  estimated 
at  exp(— 1.527)  =  0.22  using  the  PH  form  of 
the  exponential  model.  This  result  illustrates 
a  key  property  of  the  exponential  model:  the 
corresponding  acceleration  factor  and  hazards 
ratio  (e.g.,  TRT  =  1  vs.  TRT  =  0)  are  reciprocals 
of  each  other.  This  property  is  unique  to  the 
exponential  model.  What  can  be  generalized, 
however,  is  that  an  acceleration  factor 
greater  than  one  for  the  effect  of  an  expo¬ 
sure  implies  that  being  exposed  (i.e.,  TRT  = 
1 )  is  beneficial  to  survival  whereas  a  hazard 
ratio  greater  than  one  implies  being  exposed 
is  harmful  to  survival  (and  vice  versa). 


Y  =  HR  =  1  =>•  no  effect  from 
exposure 


Exponential  PH  and  AFT  models: 

•  Same  model 

•  Different  parameterization 

•  Same  estimates  for 
o  Survival  function 
o  Hazard  function 
o  Median  survival 


Although  the  exponential  PH  and  AFT  models 
focus  on  different  underlying  assumptions, 
they  are  in  fact  the  same  model.  The  only 
difference  is  in  their  parameterization.  The 
resulting  estimates  for  the  survival  function, 
hazard  function,  and  median  survival  do 
not  differ  between  these  models  (see  Practice 
Exercises  6  and  7). 


304  7.  Parametric  Survival  Models 


For  those  experienced  with  Poisson 
regression: 

Exponential  and  Poisson  models 

•  Assume  a  constant  rate 

•  Different  data  structure 

o  Poisson  —  aggregate  counts 
o  Exponential  —  individual 
level 

•  Use  different  outcomes 

o  Poisson  —  number  of  cases 
o  Exponential  —  time  to  event 

•  Yield  equivalent  parameter 
estimates 

o  With  same  data  and  same 
covariates  in  the  model 


For  those  who  have  experience  with  Poisson 
regression,  there  is  a  close  connection  between 
the  exponential  and  Poisson  models.  Both  dis¬ 
tributions  assume  an  underlying  constant 
rate.  In  fact,  if  the  data  are  structured  such 
that  all  the  cases  and  the  total  time  at  risk  are 
aggregated  for  each  pattern  of  covariates  (e.g., 
TRT  =  1  and  TRT  =  0)  and  the  log  of  the 
corresponding  person-time  at  risk  is  used  as 
an  offset,  then  a  Poisson  model  will  yield  equiv¬ 
alent  parameter  estimates  as  the  exponential 
PH  model.  The  difference  is  that  the  random 
outcome  for  the  Poisson  model  is  the  count  of 
events  over  a  fixed  amount  of  time  at  risk  and 
the  random  outcome  for  the  exponential  model 
is  the  time  (at  risk)  to  event. 


Exponential  model  is  special  case  We  continue  with  the  remission  data  example 
of  Weibull  model  and  present  the  more  general  Weibull  model, 

which  includes  the  exponential  model  as  a  spe¬ 
cial  case.  In  the  next  section  we  also  show  a 
graphical  approach  for  evaluating  the  appro¬ 
priateness  of  the  Weibull  (and  thus  also  the 
exponential)  model. 


VI.  Weibull  Example 

Weibull  Model : 

Hazard  function:  h(t)  =  Xptp~l 

(where  p  >  0  and  X  >  0) 

p  is  a  shape  parameter 

•  p  >  1  hazard  increases  over 
time 

•  p  =  1  constant  hazard 
(exponential  model) 

•  p  <  1  hazard  decreases  over 
time 

Additional  shape  parameter  offers 

greater  flexibility 


The  Weibull  model  is  the  most  widely  used 
parametric  survival  model.  Its  hazard  function 
is  h(t)  =  Xpf~l,  where p  and  X  >  0.  As  with  the 
exponential  model,  X  will  be  reparameterized 
with  regression  coefficients.  The  additional 
parameter  p  is  called  a  shape  parameter  and 
determines  the  shape  of  the  hazard  function. 
It  p  >  1  then  the  hazard  increases  as  time 
increases.  If  p  =  1  then  the  hazard  is  constant 
and  the  Weibull  model  reduces  to  the  exponen¬ 
tial  model  (h(t)  /.).  II  /;  <  1  then  the  hazard 

decreases  over  time.  The  addition  of  this  shape 
parameter  gives  the  Weibull  model  greater 
flexibility  than  the  exponential  model,  yet  the 
hazard  function  remains  relatively  simple  (basi¬ 
cally  a  scaling  of  t  raised  to  some  fixed  power). 
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Unique  property  for  Weibull  model 
AFT  =>  PH  and  PH  =>  AFT 
Holds  if  p  is  fixed 

HR  vs.  AFT 

Hazard  ratio  =>■  Comparison  of  rates 

Acceleration  factor  =>  Effect  on 
survival 


Useful  Weibull  property: 

•  ln[— In  S(t)]  is  linear  with  ln(t) 

•  Enables  graphical  evaluation 
using  KM  survival  estimates 


Linearity  ofln(t) 

S(t)  =  expf-Ai7’) 

=>  ln[-ln  S(t)]  =  ln(A)  +  p  ln(t) 

Z'  t 

Intercept  =  ln(A),  Slope  =  p 

Remission  data:  evaluate  Weibull 
assumption  for  TRT  =  1  and  TRT  =  0 


ln[-ln  S  (t)]  plotted  against  ln(t) 


Log  of  time 


The  Weibull  model  has  the  property  that  if  the 
AFT  assumption  holds  then  the  PH  assump¬ 
tion  also  holds  (and  vice  versa).  This  property 
is  unique  to  the  Weibull  model  (Cox  and 
Oakes,  1984)  and  holds  if  p  does  not  vary  over 
different  levels  of  covariates.  The  PH  assump¬ 
tion  allows  for  the  estimation  of  a  hazard  ratio 
enabling  a  comparison  of  rates  among  differ¬ 
ent  populations.  The  AFT  assumption  allows 
for  the  estimation  of  an  acceleration  factor, 
which  can  describe  the  direct  effect  of  an  expo¬ 
sure  on  survival  time. 

The  Weibull  model  also  has  another  key  prop¬ 
erty:  the  log(-log)  of  S(t)  is  linear  with  the  log 
of  time.  This  allows  a  graphical  evaluation  of 
the  appropriateness  of  a  Weibull  model  by  plot¬ 
ting  the  log  negative  log  of  the  Kaplan-Meier 
survival  estimates  against  the  log  of  time. 

To  see  this  linear  relationship:  start  with  the 
Weibull  survival  function  S(t)  =  exp(— }./7'),  take 
the  log  of  S(t),  multiply  by  negative  one,  and 
take  the  log  again  (see  left).  For  the  Weibull 
distribution,  the  ln[— ln(S(t))]  is  a  linear  func¬ 
tion  of  ln(t)  with  slope  p  and  intercept  p  ln(A). 
If  the  slope  equals  one  then  t  follows  an  expo¬ 
nential  distribution. 

We  again  return  to  the  remission  data  and 
evaluate  the  appropriateness  of  the  Weibull 
assumption  for  the  treated  (TRT  =1)  and  pla¬ 
cebo  (TRT  =  0)  groups.  On  the  left  is  the  plot 
of  the  log  negative  log  Kaplan-Meier  survival 
estimates  against  the  log  of  time  for  TRT  =  1 
and  TRT  =  0.  Both  plots  look  reasonably 
straight  suggesting  that  the  Weibull  assump¬ 
tion  is  reasonable.  Furthermore,  the  lines 
appear  to  have  the  same  slope  (i.e.,  are  parallel, 
same  p)  suggesting  that  the  PH  (and  thus  the 
AFT)  assumptions  hold.  If  this  common  slope 
equals  one  (i.e.,  p  =  1),  then  survival  time  fol¬ 
lows  an  exponential  distribution.  The  Weibull 
model  output  containing  the  parameter  esti¬ 
mates  includes  a  statistical  test  for  the  hypo¬ 
thesis  p  =  1  or  equivalently  for  ln(p)  =  0  (for 
testing  the  exponential  assumption).  This  is 
examined  later  in  this  section. 
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Summary  of  possible  results 

for  plot  of  ln[— In  .S' (7 )]  against  ln(t) 

1 .  Parallel  straight  lines  =>■  Weibull, 
PH,  and  AFT  assumptions  hold 

2.  Parallel  straight  lines  with  slope 
of  1  =>  Exponential.  PH  and  AFT 

3.  Parallel  but  not  straight  lines  => 
PH  but  not  Weibull,  not  AFT 
(can  use  Cox  model) 

4.  Not  parallel  and  not  straight  =>■ 
Not  Weibull,  PH  violated 

5 .  Not  parallel  but  straight  lines  =>■ 
Weibull  holds,  but  PH  and  AFT 
violated,  different  p 


Previous  plot  suggests  Weibull  and 
PH  assumption  reasonable  for  TRT 


Weibull  PH  model: 
h(t)  =  Ip?-1 

where  X  =  exp((30  +  Pi  TRT). 

Hazard  ratio  (TRT  =  1  vs.  TRT  =  0) 
HR  _exp(/?0  +  P1)ptp~] 
exP(Po)PtP~l 
=  exP(Pi) 


On  the  left  is  a  summary  of  five  possible 
results  from  an  examination  of  the  log  negative 
log  Kaplan-Meier  survival  estimates  plotted 
against  the  log  of  time  for  two  or  more  levels 
of  covariates.  The  key  points  are  that  straight 
lines  support  the  Weibull  assumption  and 
parallel  curves  support  the  PH  assumption. 
If  the  plots  are  parallel  but  not  straight  then  the 
PH  assumption  holds  but  not  the  Weibull. 
Assessing  whether  the  curves  are  parallel  is  a 
familiar  approach  for  evaluating  the  PH 
assumption  in  a  Cox  model  (see  Chapter  4 
and  Computer  Appendix).  An  interesting  sce¬ 
nario  occurs  if  the  lines  are  straight  but  not 
parallel.  In  this  situation  the  Weibull  assump¬ 
tion  is  supported  but  the  PH  and  AFT  assump¬ 
tions  are  violated.  If  the  lines  are  not  parallel, 
then  p  is  not  constant  across  levels  of  covari¬ 
ates.  In  Section  IX  of  this  chapter,  we  present  a 
method  for  modeling  the  shape  parameter  p  as 
a  function  of  predictor  variables,  but  typically 
p  is  assumed  fixed. 

An  examination  of  the  plot  on  the  previous 
page  suggests  that  the  Weibull  and  PH  assump¬ 
tions  are  reasonable  for  treatment  (TRT).  First 
the  PH  form  of  the  model  is  presented  and  then 
the  AFT  form. 

The  Weibull  hazard  function  is  h(t)  =  Xp?~x . 
A  Weibull  PH  model  is  defined  by  reparameter¬ 
izing  lambda  X  as  exp(p0  +  Pi  TRT).  The  hazard 
ratio  is  obtained  by  substituting  TRT  =  1  and 
TRT  =  0  into  the  hazard  functions  (see  left). 
After  canceling  we  obtain  the  familiar  result 
exp(Pi).  Note  that  this  result  depends  on  p  hav¬ 
ing  the  same  value  for  TRT  =  1  and  TRT  =  0, 
otherwise  time  (t)  would  not  cancel  in  the 
expression  for  the  HR  (i.e.,  PH  assumption 
not  satisfied). 


Presentation:  VI.  Weibull  Example  307 


Remission  Data 

Weibull  regression  log  relative- 
hazard  form 


_t 

Coef. 

Std. 

Err. 

z  p  >  |z| 

trt 

-1.731 

.413 

-4.19  0.000 

_cons 

-3.071 

.558 

-5.50  0.000 

/ln_p 

.312 

.147 

2.12  0.034 

P 

1.366 

.201 

1/p 

.732 

.109 

Weibull  PH 

HR(TRT  =  1  vs.  0)  = 

exp(-1.731) 

=  0.18 

95%  Cl  =  exp[—  1 .73 1  ±  1.96(0.413)] 
=  (0.08,  0.40) 


Weibull:  HR  =JU8 
Exponential:  HR  =  0.22 
Suggests  preventive  effect  of  TRT 


On  the  left  is  stata  output  from  running  the  PH 
form  of  the  Weibull  model.  There  are  para¬ 
meter  estimates  for  the  coefficient  of  TRT,  the 
intercept  (called  _cons),  and  for  three  forms  of 
the  shape  parameter:  p,  1  Ip,  and  log(p).  The 
estimate  forp  is  1.366  suggesting  an  increase 
in  the  hazard  as  survival  time  increases  (because 
p  >  1).  A  statistical  test  for  H0:  log(p)  =  0  yields 
a  p-value  of  0.034.  At  a  significance  level  of  0.05 
we  would  reject  the  null  and  decide  p  is  not 
equal  to  1,  suggesting  that  the  exponential 
model  is  not  appropriate. 


An  estimated  hazard  ratio  of  0.18  is  obtained 
by  exponentiating  the  estimated  coefficient 
(—1.731)  of  the  TRT  variable.  The  95%  confi¬ 
dence  interval  for  this  HR  is  calculated  to  be 
(0.08,  0.40)  indicating  a  significant  preventive 
effect  of  treatment.  These  results  are  similar  to 
those  obtained  from  the  exponential  model  in 
which  the  estimated  hazard  ratio  was  0.22. 


Comparing  Cox  and  Weibull  PH 
models 

Cox:  estimate  |3, 
h(t)  =  h0(t)exp((3,TRT) 

/ 

baseline  hazard  unspecified 


It  can  be  instructive  to  compare  the  Cox  and 
Weibull  PH  models.  The  Cox  PH  model  with 
treatment  as  the  only  predictor  is  stated  as  h0(t) 
exp(p!TRT).  There  is  one  parameter  to  esti¬ 
mate  (pi)  and  the  distribution  of  the  baseline 
hazard  (h0(t))  remains  unspecified. 


Weibull:  estimate  (30,  {3 1 , p 
h(t)  =  A ptv~x  where 
A  =  exp(|30  +  (3,TRT). 
h(t)  =  [exp(|30)pfp”1]exp(|31TRT). 

/ 

baseline  hazard  specified  paramet¬ 
rically 


With  some  manipulation,  the  Weibull  PH 
model  can  also  be  expressed  as  a  product  of  a 
baseline  hazard  and  exp(fi|  TRT)  (see  left). 
There  are  three  parameters  to  estimate  p0,  Pi, 
and  p  that  fully  specify  the  hazard. 
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S(t)  =  exp(— A  tp) 


solve  for  t 
t  =  [-In  S(t)]1/f 


A1/p 


let 


AI/p 


=  exp(o£fl  +  oqRX) 


t  =  [—In  S(t)]1/P  x  exp(ao  +  <XiTRT) 

/ 


Scaling  of  t 


An  AFT  model  can  also  be  formulated  with  the 
Weibull  distribution.  We  develop  the  AFT 
parameterization  similarly  to  that  done  with 
the  exponential  model,  by  solving  for  t  in 
terms  of  a  fixed  S(t).  The  Weibull  survival  func¬ 
tion  is  S(t)  =  exp (-A/p).  Taking  the  natural  log, 
multiplying  by  negative  1 ,  raising  to  the  power 
1/p,  and  then  multiplying  by  the  reciprocal  of 
A1/p,  yields  the  expression  for  t  shown  on  the 
left.  By  reparameterizing  1  l'kllp  =  exp(a0  + 
oqTRT),  it  can  be  shown  that  oq,  the  coefficient 
of  the  predictor  variable  TRT,  is  used  to  scale 
the  time  to  any  fixed  value  of  S(t)  (see  left). 


Let  S(t)  =  q 

t  =  [-ln(q)]1/p  x  exp(a0  +  oqTRT) 
Median  survival  time  (q  =  0.5) 


The  expression  for  t  is  restated  on  the  left  in 
terms  of  any  fixed  probability  S(t)  =  q.  For 
example,  to  find  an  expression  for  the  median 
survival  time  tm,  substitute  q  =  0.5  (see  left). 


tm  =  [— ln(0.5)]1/p 

x  exp(a0  +  oqTRT) 


Acceleration  factor,  y  (TRT 
TRT  =  0) 

„  _  [-  ln(q)]1/p  exp(o:o  +  «i) 

[—  ln(<?)]1/p  exp(a0) 

=  exp(oq) 


vs.  The  acceleration  factor  y  is  obtained  as  the 
ratio  of  the  times  to  S(t)  =  q  for  TRT  =  1  and 
for  TRT  =  0.  After  canceling,  y  reduces  to  exp 
(oq),  where  oq  is  the  coefficient  of  the  TRT 
variable.  As  with  the  PH  form  of  the  model, 
this  result  depends  on  p  not  varying  by  treat¬ 
ment  status;  otherwise  y  would  depend  on  q. 


Remission  Data 


Weibull  regression  accelerated 
failure-time  form 


_t 

Coef. 

Std.  Err. 

z 

P>N 

trt 

1.267 

.311 

4.08 

0.000 

_cons 

2.248 

.166 

13.55 

0.000 

/ln_p 

.312 

.147 

2.12 

0.034 

P 

1.366 

.201 

1/p 

.732 

.109 

Output  from  running  a  Weibull  AFT  model  is 
shown  on  the  left.  The  estimates  for  each  form 
of  the  shape  parameter  ( p ,  1/p,  and  ln(p))  are 
the  same  as  obtained  from  the  previously 
shown  PH  form  of  the  model. 

The  estimated  acceleration  factor  of  3.55  is 
obtained  by  exponentiating  the  estimated 
coefficient  (1.267)  of  the  TRT  variable.  The 
95%  confidence  interval  for  y  is  calculated  to 
be  (1.93,  6.53).  There  results  are  shown  at  the 
top  of  the  following  page. 
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Weibull  AFT: 

y(TRT  =  1  vs.  0)  =  exp(  1.267) 

=  3.55 

95%  Cl  =  exp[  1.267  ±  1.96(0.311)] 
=  (1.93,  6.53) 


Weibull:  y  =  3.55 
Exponential:  y  =  4.60  (assumes 
h(t)  =  X) 


These  results  suggest  that  the  median  (or  any 
other  quantile  of)  sui'vival  time  is  increased  by 
a  factor  of  3.55  for  those  receiving  the  treat¬ 
ment  compared  to  the  placebo.  Recall  that  the 
acceleration  factor  was  estimated  at  4.60  using 
the  exponential  model.  However,  the  exponen¬ 
tial  model  uses  a  much  stronger  assumption: 
that  the  hazards  are  constant. 


Relating  Weibull  AFT  and 
PH  coefficients 

AFT:  X1,p  =  exp[— (a0  +  oqTRT)] 
(l/p)ln  X  =  —  (a0  +  oqTRT) 

In  X  =  — p  (a0  +  oqTRT) 

PH:  A.  =  exp(p0  +  PiTRT) 

In  X  =  p0  +  pjTRT 


Corresponding  coefficients  obtained  from  the 
PH  and  AFT  forms  of  the  Weibull  models  are 
related  as  follows:  Pj  =  -otj  p  for  the  jth  covari¬ 
ate.  This  can  most  easily  be  seen  by  formulat¬ 
ing  the  parameterization  equivalently  in  terms 
of  ln(L)  for  both  the  PH  and  AFT  forms  of  the 
model  as  shown  on  the  left. 


Relationship  of  coefficients: 

pj  =  —  otj  p  so  that 
P  =  —  a  for  exponential  (p  =  1) 

Relating  estimates  for  TRT 
(PH  vs.  AFT) 


This  relationship  is  illustrated  utilizing  the 
coefficient  estimates  we  obtained  for  TRT: 
-1.731  =  (-1.267X1.366).  Note  for  the  expo¬ 
nential  model  in  which  p  =  1 ,  the  PH  and  AFT 
coefficients  are  related,  P  =  —a. 


-1.731  =  (-1.267X1.366) 


Next:  log-logistic  model 
•  Hazard  may  be  nonmonotonic 
Weibull  model 


In  the  next  example  the  log-logistic  model  is 
presented.  In  contrast  to  the  Weibull,  the  haz¬ 
ard  function  for  the  log-logistic  distribution 
allows  for  some  nonmonotonic  behavior  in 
the  hazard  function. 


•  Hazard  does  not  change 
direction 


VII.  Log-Logistic  Example 


Log-logistic  hazard:  h(t)  = - — 

1  +  XtP 

(where  p  >  0  and  X  >  0) 


The  log-logistic  distribution  accommodates  an 
AFT  model  but  not  a  PH  model.  Its  hazard 
function  is  shown  on  the  left.  The  shape 
parameter  is  p(>0). 
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Shape  of  hazard  function: 

p  <  1  hazard  decreases  over  time 
p  >  1  hazard  first  increases  and 
then  decreases  over  time 
(unimodal) 


If  p  <  1  then  the  hazard  decreases  over  time. 
If  p  >  1,  however,  the  hazard  increases  to  a 
maximum  point  and  then  decreases  over  time. 
In  this  case  (p  >  1),  the  hazard  function  is  said 

to  be  unimodal. 


Log-logistic  modeling  assumptions: 


AFT  PO 

PO:  Survival  Odds  ratio  constant  over  time 


Unlike  the  Weibull  model,  a  log-logistic  AFT 
model  is  not  a  PH  model.  However,  the  log- 
logistic  AFT  model  is  a  proportional  odds 
(PO)  model.  A  proportional  odds  survival 
model  is  a  model  in  which  the  survival 
odds  ratio  is  assumed  to  remain  constant 
over  time.  This  is  analogous  to  a  proportional 
hazard  model  where  the  hazard  ratio  is 
assumed  constant  over  time. 


Survival  odds 

S(t)  _  P(T  >  t) 
(1  -S(t))~P(T<t) 


The  survival  odds  is  the  odds  of  surviving  beyond 
time  t  (i.e.,  S(t)/(1  -  S(t)).  This  is  the  probability 
of  not  getting  the  event  by  time  t  divided  by  the 
probability  of  getting  the  event  by  time  t. 


Failure  odds  by  time  t 

(1  ~  S(t))  _  P{T  <  t) 
S(t)  P(T  >  t) 


The  failure  odds  is  the  odds  of  getting  the 
event  by  time  t  (i.e.,  (1  —  S(t))/S(t)),  which  is 
the  reciprocal  of  the  survival  odds  (see  left). 


Log-logistic  survival  and  failure 
functions 


S(t) 


1 

1  +  XtP 


1  -  S(t) 


fop 

1  +  foP 


The  log-logistic  survival  function  (S(t))  and 
failure  function  ( 1  —  S(t))  are  shown  on  the  left. 


Failure  odds 


The  failure  odds  simplifies  in  a  log-logistic 
model  to  Xtp  (see  left). 


Log-logistic  PO  model: 


Reparameterize  X  in  terms  of 
Xs  and  ps 


SOR  = 


S1(t)/(1  -  S^t)) 

S2(t)/(1  —  s2(t)) 


•  SOR  satisfies  PO  if  SOR 
constant  over  time 

•  SOR  constant 

t 

FOR  =  1/SOR  constant 


A  log-logistic  proportional  odds  model  can  be 
formulated  by  reparameterizing  X  in  terms  of 
predictor  variables  and  regression  parameters. 
We  come  back  to  this  point  later  in  this  section. 

A  survival  odds  ratio  (SOR)  is  defined  as  the 
ratio  of  survival  odds  for  two  groups  of  sub¬ 
jects,  as  shown  on  the  left. 

An  SOR  satisfies  a  PO  assumption  provided  the 
SOR  does  not  depend  on  time. 

Also,  if  the  SOR  does  not  depend  on  time,  then 
the  failure  odds  ratio  (FOR  =  1/SOR)  also  does 
not  depend  on  time. 
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The  log  of  the  failure  odds  is  ln(AC),  which  can 
be  rewritten  as  ln(A)  +  /;[ln(/ )].  In  other  words, 
the  log  odds  of  failure  is  a  linear  function  of 
the  log  of  time  with  slope  p  and  intercept  ln(A). 
This  is  a  useful  result  enabling  a  graphical 
evaluation  for  the  appropriateness  of  the  log- 
logistic  distribution. 


Evaluate  log-logistic  assumption 
graphically 

•  Plot  In  n  against  ln(t) 

•  If  log-logistic,  then  plot  is  linear 
with  slope  =  p 


The  log-logistic  assumption  can  be  graphically 
evaluated  by  plotting  ln(l  -  S(t))/(S(l))  against 
ln(t)  where  S(t)  are  the  Kaplan-Meier  sur¬ 
vival  estimates.  If  survival  time  follows  a  log- 
logistic  distribution,  then  the  resulting  plots 
should  be  a  straight  line  of  slope  p. 


Log  Odds  Is  Linear  with  ln(t) 

log(failure  odds)  =  ln(Atp) 

=  ln(A)  +  p[ln(i)] 

/  t 

Intercept  =  ln(A)  slope  =  p 


Alternatively 

•  Plot  In  against  ln(t) 

•  If  log-logistic,  then  plot  is  linear 
with  slope  =  —p 


We  could  alternatively  plot  the  log  of  the 
survival  odds,  ln(5(i))/(l  -  S(t)),  against  ln(t). 
If  the  log-logistic  assumption  is  correct 
the  resulting  plots  should  be  a  straight  line  of 
slope  —p. 


Remission  Data  We  next  consider  a  different  variable  from  the 

remission  data:  a  dichotomous  variable  for 
WBCCAT:  white  blood  cell  count  white  blood  cell  count  (WBCCAT)  coded 
variable  medium  =  1  vs.  high  =  2  medium  =  1  and  high  =  2. 

plotted  against  ln(t). 


0  12  3  4 


Log  of  time 


On  the  left  is  the  plot  of  the  log  odds  of 
survival  (obtained  from  the  Kaplan-Meier  sur¬ 
vival  estimates)  against  the  log  of  time  com¬ 
paring  medium  (WBCCAT  =  1)  and  high 
(WBCCAT  =  2)  blood  cell  counts.  The  points 
for  WBCCAT  =  1  lie  above  the  points  for 
WBCCAT  =  2  indicating  that  the  survival  odds 
are  higher  for  those  with  a  medium  white  blood 
cell  count  compared  to  high.  The  lines  look  rea¬ 
sonably  straight  and  parallel,  at  least  until  the 
estimated  odds  of  survival  approaches  zero. 

If  we  accept  the  proposition  that  the  lines  look 
straight,  then  the  log-logistic  assumption  is 
reasonable.  Because  the  lines  look  parallel, 
the  proportional  odds  (PO)  assumption  is  also 
reasonable.  If  the  PO  assumption  holds  in  a 
log-logistic  model  then  the  AFT  assumption 
also  holds. 


312  7.  Parametric  Survival  Models 


Straight  lines  =C>  Log-logistic  The  key  points  from  above  are: 


Parallel  plots  t=>  PO 
Log-logistic  and  PO  ■=>  AFT 


Log-logistic  and  Weibull  graphical 
approach  analogous 

•  Check  PH  for  Weibull 

•  Check  PO  for  log-logistic 


AFT  log-logistic  model 
1  1 

S(t)  = 


1  +k  P 


1 


solve  for  t  to  obtain 


t  = 


1 

IW) 


- 1 


Up 


)}!p 


let  — ^  =  exp(«o  +  aiWBCCAT) 


t  = 


[5(r) 


-  1 


Up 


x  exp(ao  +  aiWBCCAT) 

/ 

Scaling  of  t 


a.  straight  lines  support  the  log-logistic 
assumption, 

b.  parallel  curves  support  the  PO  assump¬ 
tion,  and 

c.  If  the  log-logistic  and  PO  assumptions 
hold,  then  the  AFT  assumption  also 
holds. 

The  graphical  evaluation  for  the  log-logistic 
assumption  is  analogous  to  the  graphical  anal¬ 
ysis  of  the  Weibull  assumption  presented  in 
the  last  section,  except  here  the  PO  assumption 
rather  than  the  PH  assumption  is  evaluated  by 
checking  for  parallel  lines. 

Next  we  consider  an  AFT  log-logistic  model 
with  white  blood  cell  count  as  the  only  predic¬ 
tor  comparing  WBCCAT  =  2  (high  count)  and 
WBCCAT  =  1  (medium  count). 

We  develop  the  AFT  parameterization  by  solv¬ 
ing  for  t  in  terms  of  a  fixed  S(t).  Starting  with 
the  expression  for  S(t),  taking  reciprocals,  sub¬ 
tracting  1,  raising  to  the  power  \lp,  and  then 
multiplying  by  the  reciprocal  of  A,  ,  yields  the 
expression  for  t  shown  on  the  left.  By  repara¬ 
meterizing  l/A1/p  =  exp(a0  +  aj  WBCCAT),  we 
allow  oq,  the  coefficient  of  the  predictor  vari¬ 
able  WBCCAT,  to  be  used  for  the  multiplicative 
scaling  of  time  to  any  fixed  value  of  S(t) 
(see  left). 


Let  S(t)  -  q 
t  =  [q-1  -  I]17? 

x  exp(a0  +  oq  WBCCAT) 
Median  survival  time  (q  =  0.5): 


tm=[2—  1]'/P 

x  exp(a0  +  oq  WBCCAT) 


The  expression  for  t  is  restated  on  the  left  in 
terms  of  any  fixed  probability  S(t)  =  q.  For 
example,  to  find  an  expression  for  the  median 
survival  time  tm,  substitute  q  =  0.5  (see  left). 
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Acceleration  factor, 
y  (WBCCAT  =  2  vs.  WBCCAT  =  1) 

_  [q_1  -  l]1/,p  exp(a0  +  2oq) 

[q-1  -  l]1/pexp(a0  +  loq) 

=  exp(cq) 


Log-logistic  regression  accelerated 
failure-time  form 


_t 

Coef. 

Std. 

Err. 

z 

P>|z| 

wbccat 

-.871 

.296 

-2.94 

0.003 

_cons 

3.495 

.498 

7.09 

0.000 

ln_gam 

-.779 

.164 

-4.73 

0.000 

gamma 

.459 

0.756 

p  =  1/(0.459)  =  2.18 


WBCCAT  =  2  vs.  WBCCAT  =  1 
(log-logistic): 


y  =  exp(— 0.871)  =  0.42 
95%  Cl  for y  =  exp[-0.871 

±  1.96(0.296)] 
=  (0.23,  0.75) 

Comparing  estimated  survival 


S,(0 

/ 

Survival 
function  for 
WBCCAT  = 


S2(0.42t) 

\ 

Survival 
function  for 
WBCCAT  =  2 


Failure  odds 


where  X  =  exp(p0  +  Pi  WBCCAT) 


OR  (WBCCAT  =  2  vs.  WBCCAT  =  1) 

tp  exp(/?0  +  2/?i) 
tP  exp(/l0  +  l/lj) 


The  acceleration  factor  y  is  found  by  taking  the 
ratio  of  the  times  to  S(t)  =  q  for  WBCCAT  =  2 
and  for  WBCCAT  =  1 .  After  canceling,  y 
reduces  to  exp(oq),  where  oq  is  the  coefficient 
of  the  WBCCAT  variable  in  the  AFT  model. 


The  output  from  running  the  AFT  log-logistic 
model  is  shown  on  the  left.  The  coefficient  esti¬ 
mate  for  WBCCAT  is  —0.871,  which  is  statisti¬ 
cally  significant  with  a  p-value  of  0.003  (far 
right  column  of  output). 

Stata  provides  estimates  of  the  reciprocal  of 
p  (gamma  =  1  Ip)  rather  than  for  p.  The  esti¬ 
mate  for  gamma  is  0.459.  Therefore,  the  esti¬ 
mate  forp  is  1/(0.459)  =  2.18. 


An  estimate  of  the  acceleration  factor  y  com¬ 
paring  WBCCAT  =  2  to  WBCCAT  =  1  is  found 
by  exponentiating  the  estimate  -0.871  of  oq  to 
obtain  0.42.  The  95%  confidence  interval  for  y 
is  calculated  to  be  (0.23,  0.75). 


These  results  suggest  that  the  time  for  going 
out  of  remission  is  “accelerated”  for  patients 
with  a  high  white  blood  cell  count  compared 
to  those  with  a  medium  count  by  an  estimated 
factor  of  0.42.  In  terms  of  the  survival  functions 
estimated  from  this  model,  Sfr/)  =  52(0.42Z) 
where  Si(t)  and  S2(l)  are  the  respective  survival 
functions  for  patients  with  medium  and  high 
blood  cell  counts. 

The  proportional  odds  form  of  the  log-logistic 
model  can  also  be  formulated  by  reparameter¬ 
izing  X.  Recall  that  the  log-logistic  failure  odds 

is  Xf . 

By  setting  X  =  exp(p0  +  pi  WBCCAT),  a  (failure) 
odds  ratio  comparing  WBCCAT  =  2  to  WBCCAT 
=  1  can  be  calculated  (see  left).  After  canceling, 
the  odds  ratio  reduces  to  exp(Pi). 


exp(/?i) 
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Comparing  AFT  and  PO 
(log-logistic) 

Relationship  of  coefficients: 

Pj  =  -OCj  p 


The  corresponding  coefficients  for  log-logistic 
PO  and  AFT  models  are  related  by  ft,  =  —  ocj 
p  for  the  jth  covariate.  This  result  is  obtained 
using  a  similar  argument  to  that  presented  for 
the  Weibull  example  in  the  previous  section. 


Since  a  =  —0.871  and  p  =  2.18 
Then, 

/i,  =  — (— 0.871)(2.18)  =  1.90 
and 

OR  =  exp  (1.90)  =  6.69 


The  estimate  for  oti  in  the  AFT  model  is  —0.871 
and  the  estimate  for  p  is  2.18.  Therefore,  an 
estimate  for  (h  can  be  found  by  multiplying 
-(—0.871)  times  2.18  yielding  1.90.  An  estimate 
of  the  odds  ratio  is  found  by  exponentiating 
this  estimate,  exp(1.90)  =  6.69.  (Unfortunately, 
neither  Stata  nor  SAS  estimates  the  propor¬ 
tional  odds  form  of  the  model.) 


VIII.  A  More  General  Form 
of  the  AFT  Model 

Exponential:  S(t)  =  exp(— /./) 

•  AFT  Form:  [ 

=  exp(oc0  +  TRT) 

•  PH  Form:  X 

=  exp(p0  +  p!  TRT) 


On  the  left  is  a  summary  of  the  models 
discussed  in  the  previous  sections.  These 
models  were  formulated  by  reparameterizing 
the  survival  (and  hazard)  functions  in  terms  of 
regression  parameters  and  predictor  variables. 

An  advantage  for  stating  the  models  in  this 
form  is  that  the  interpretation  and  relation¬ 
ships  between  parameters  are  specific  to  each 
distribution. 


Weibull:  S(t)  =  exp(— Xtp) 

•  AFT  Form:  — ^ 

Xl/p 

=  exp(a0  +  a!  TRT) 

•  PH  Form:  X 

=  exp(p0  +  Pi  TRT) 

Log-logistic:  S(t)  =  {  1 

•  AFT  Form:  — ^ 

Al/p 

=  exp(a0  +  oti  WBCCAT) 


However,  there  are  more  general  ways  these 
models  could  be  stated.  The  Cox  PH  model  is 
a  more  general  way  of  stating  the  proportional 
hazards  model.  In  this  section  we  discuss  a 
more  general  formulation  of  the  AFT  model. 


•  PO  Form:  X 

=  exp(p0  +  Pi  WBCCAT) 
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General  Form  of  AFT  Model 
( One  Predictor) 

ln(T)  =  oto  +  aiTRT  +  t 

/ 

random  error 


Consider  an  AFT  model  with  one  predictor 
(TRT)  in  which  T  represents  a  random  variable 
for  survival  time.  The  model  can  be  expressed 
on  the  log  scale  as  shown  on  the  left,  where  e  is 
random  error  following  some  distribution. 


With  additional  parameter 
ln(T)  =  oco  +  at  TRT  +  at 

/ 

a  scales  the  error 


Some  distributions  have  an  additional  para¬ 
meter  (cr)  scaling  t.  The  model  including  this 
additional  parameter  can  be  expressed  as 
shown  on  the  left,  where  the  random  error  e  is 
multiplied  by  a  scale  parameter  cr. 


If  e  ~  N(0,  1),  then 

ln(T)  ~  N(/t  =  a0  +  aiTRT,  sd  =  a) 

Similar  to  linear  regression  (except 
for  inclusion  of  censorships) 


If  e  follows  a  standard  normal  distribution  and 
ln(T)  =  a0  +  a^RT  +  at,  then  ln(T)  would 
follow  a  normal  distribution  with  mean  ft  = 
a0  +  aiTRT  and  standard  deviation  cr.  For  this 
situation,  the  model  would  look  like  a  standard 
linear  regression.  The  key  difference  between 
fitting  this  survival  model  and  a  standard  linear 
regression  is  the  inclusion  of  censored  obser¬ 
vations  in  the  data. 


In  general, 


/<in(T)  ±  (“o  +  aiTRT),  sd  ^  a 


Interpretation  of  parameters  de¬ 
pends  on  distribution 

Let  a  =  - ,  then 

,  _  P  1 
ln(T)  =  ao  +  aiTRT  H —  £ 


In  general,  for  other  then  the  normal  distribu¬ 
tion,  the  mean  of  ln(T)  is  not  necessarily  a0  + 
a  | TRT  and  its  standard  deviation  is  not  cr.  In 
other  words,  it  should  not  be  assumed  that  the 
mean  of  t  is  0  and  the  standard  deviation  is  1 . 
The  interpretation  of  the  parameters  depends 
on  the  underlying  distribution. 

Sometimes  the  model  is  parameterized  using 
a  =  1  Ip.  The  model  can  then  be  restated  by 
replacing  at  with  (1  Ip)  t. 


Additive  model  in  terms  of  ln(T) 
but 

multiplicative  model  in  terms  of  T 


T  =  exp  ^a0  +  aiTRT  +  -e^ 
=  exp[(a0  +  a[TRT)]  x  exp 


The  AFT  model  is  additive  on  the  log  scale 
but  a  multiplicative  model  with  respect  to  T. 

In  particular,  the  model  can  be  expressed  in 
terms  of  T  by  exponentiating  ln(T),  as  shown 
on  the  left. 
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Collapse  a0  into  baseline  term 


Tq  =  exp(ao)  exp 


The  model  may  also  be  expressed  by  collapsing 
the  intercept  into  a  baseline  random  term  T0 
(see  left).  In  this  setting,  T0  is  a  random  vari¬ 
able  representing  the  survival  time  of  the  pla¬ 
cebo  group  (TRT  =  0). 


so  that  T  =  T0  expfoc]  TRT)  where  Thus,  the  AFT  model  has  a  form  analogous  to 
T0  is  a  random  variable  for  TRT  =  0  the  a  Cox  PH  model,  although  the  baseline  term 

T0  is  a  random  variable  rather  than  a  constant. 


AFT  model  may  be  expressed  in 
terms  of  T  or  ln(T) 


Comparing  Distributions:  T  and 
ln(T) 


T 

ln(T) 

Exponential 

Extreme  minimum 
value 

Weibull 

Extreme  minimum 
value 

Log-logistic 

Logistic 

Lognormal 

Normal 

In  summary,  an  AFT  model  may  be  expressed 
by  reparameterizing  a  specific  distribution,  or 
may  be  more  generally  expressed  either  in  terms 
of  a  random  variable  T  (for  survival  time),  or  In 
(T).  If  T  follows  a  Weibull  distribution  then  In 
(T)  follows  a  distribution  called  the  extreme 
minimum  value  distribution  (see  table  on  left). 
Similarly,  if  T  follows  a  log-logistic  or  lognormal 
distribution  then  ln(T)  follows  a  logistic  or 
normal  distribution,  respectively.  The  logistic 
and  normal  distributions  are  similarly  shaped, 
and  are  both  symmetric  about  their  mean. 


IX.  Other  Parametric 
Models 


In  the  previous  sections  we  presented  examples 
of  the  exponential,  Weibull,  and  log-logistic 
models.  In  this  section  we  briefly  discuss 
some  other  parametric  survival  models. 


Generalized  Gamma  Model 

•  Supported  by  SAS  and  Stata 

•  S(t),  h(t)  expressed  in  terms  of 
integrals 

•  Contains  three  parameters 

•  Weibull,  lognormal  are  special 
cases 


The  generalized  gamma  model  is  a  parametric 
survival  model  that  is  supported  by  both  SAS 
and  Stata  software.  The  hazard  and  survival 
function  for  this  model  is  complicated  and  can 
only  be  expressed  in  terms  of  integrals.  The 
generalized  gamma  distribution  has  three  para¬ 
meters  allowing  for  great  flexibility  in  its  shape. 
The  Weibull  and  lognormal  distributions  are 
special  cases  of  the  generalized  gamma  distri¬ 
bution  (see  Practice  Exercises  12  to  14). 


Lognormal  Model 

Similar  to  log-logistic 
Difference: 

Log-logistic:  AFT  and  PO 
Lognormal:  AFT  but  not  PO 


The  lognormal  model  also  has  a  relatively 
complicated  hazard  and  survival  function  that 
can  only  be  expressed  in  terms  of  integrals.  The 
shape  of  the  lognormal  distribution  is  very  sim¬ 
ilar  to  the  log-logistic  distribution  and  yields 
similar  model  results.  A  difference  is  that 
although  the  lognormal  model  accommodates 
an  accelerated  failure  time  model,  it  is  not  a 
proportional  odds  model. 
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Gompertz  Model 

•  PH  model  but  not  AFT 

•  One  predictor  (TRT)  in  model: 


h(t)  =  [exp(yt)]  x  exp(p0  +  p]  TRT) 


h0(t) 


parametrically  specified 
exp  (yt) 


y  >  0  hazard  exponentially 
increases  with  t 
y  <  0  hazard  exponentially 
decreases  with  t 
y  =  0  constant  hazard 

(exponential  model) 


Parametric  models  need  not  be  AFT  models. 
The  Gompertz  model  is  a  parametric  propor¬ 
tional  hazards  model  but  not  an  AFT  model. 
The  model  can  be  expressed  in  a  form  similar 
to  that  of  a  Cox  PH  model  except  that  the 
baseline  hazard  is  specified  as  the  hazard  of 
a  Gompertz  distribution  containing  a  shape 
parameter  y  (see  left). 

If  y  >  0  then  the  hazard  exponentially  increases 
over  time.  If  y  <  0  then  the  hazard  exponen¬ 
tially  decreases  over  time.  If  y  =  0  then  the 
hazard  is  constant  and  reduces  to  the  exponen¬ 
tial  model. 


AFT  model:  multiplicative  The  AFT  model  is  a  multiplicative  model  (i.e., 

a  multiplicative  scaling  of  failure  time).  It 
T  =  exp(a0  +  TRT  +  e)  becomes  an  additive  model  on  the  log  scale 

=  exp(ao)  x  expla,  TRT)  x  exp(e)  (see  left  side). 

but 

additive  on  log  scale: 
ln(T)  =  oc0  +  oci  TRT  +  e 


Additive  failure  time  model 
T  =  ao  +  otiTRT  +  e 

/ 

T  rather  than  log(T)  is  linear  with 
TRT 


An  alternative  parametric  model  is  to  define  an 
additive  failure  time  model  in  terms  of  T, 
rather  than  ln(T).  Consider  the  model:  T  = 
a0  +  cci  TRT  +  e.  Now  T,  rather  than  ln(T),  is 
expressed  as  a  linear  function  of  the  regression 
parameters.  SAS  supports  such  an  additive  fail¬ 
ure  time  model  (see  Computer  Appendix). 


Modeling  the  Shape  Parameter 
(e.g.,  Weibull  and  log-logistic) 

Typical  Weibull  model 
h(t)  =  A pf-1 

where  A  =  exp(  (!0  +  (1,  TRT) 

p  unaffected  by  predictors 


Many  parametric  models  contain  an  extra 
shape  (or  ancillary)  parameter  beyond  the 
regression  parameters.  For  example,  the  Wei¬ 
bull  and  log-logistic  models  contain  a  shape 
parameter  p.  Typically,  this  parameter  is  con¬ 
sidered  fixed,  unaffected  by  changes  in  the 
values  of  predictor  variables. 
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Alternative  Weibull  model 

models  the  ancillary  parameter  p 

h(t)  =  Xptp~l 

where  X  =  exp(  |!0  +  |i,  TRT) 
p  =  exp(§0  +  §!  TRT) 

Not  a  PH  or  AFT  model  if  §!  0 

but  still  a  Weibull  model 


An  alternative  approach  is  to  model  the  shape 
parameter  in  terms  of  predictor  variables  and 
regression  coefficients.  In  the  Weibull  model 
shown  on  the  left,  both  X  and  p  are  modeled 
as  functions  of  treatment  status  (TRT).  If  §j  is 
not  equal  to  zero,  then  the  value  of  p  differs  by 
TRT.  For  that  situation,  the  PH  (and  thus  the 
AFT)  assumption  is  violated  because  tp~l  will 
not  cancel  in  the  hazard  ratio  for  TRT  (see 
Practice  Exercises  15  to  17). 


Choosing  appropriate  model 

•  Evaluate  graphically 
o  Exponential 

o  Weibull 
o  Log-logistic 

•  Akaike’s  information  criterion 
o  Compares  model  fit 

o  Uses  —2  log  likelihood 


Choosing  the  most  appropriate  parametric 
model  can  be  difficult.  We  have  provided 
graphical  approaches  for  evaluating  the  appro¬ 
priateness  of  the  exponential,  Weibull,  and  log- 
logistic  models.  Akaike’s  information  crite¬ 
rion  (AIC)  provides  an  approach  for  compar¬ 
ing  the  fit  of  models  with  different  underlying 
distributions,  making  use  of  the  —2  log  likeli¬ 
hood  statistic  (described  in  Practice  Exercises 
1 1  and  14). 


X.  The  Parametric 
Likelihood 

•  Function  of  observed  data  and 
unknown  parameters 

•  Based  on  outcome  distribution 
«t) 

•  Censoring  complicates  survival 
data 

o  Right-censored 
o  Left-censored 
o  Interval-censored 


The  likelihood  for  any  parametric  model  is 
a  function  of  the  observed  data  and  the 
model’s  unknown  parameters.  The  form  of  the 
likelihood  is  based  on  the  probability  density 
function  f(t)  of  the  outcome  variable.  A  compli¬ 
cation  of  survival  data  is  the  possible  inclusion 
of  censored  observations  (i.e.,  observations 
in  which  the  exact  time  of  the  outcome  is 
unobserved).  We  consider  three  types  of 
censored  observations:  right-censored,  left- 
censored,  and  interval-censored. 


Examples  of  Censored  Subjects  Right-censored.  Suppose  a  subject  is  lost  to 

follow-up  after  10  years  of  observation.  The 
time  of  event  is  not  observed  because  it  hap- 

Right-censored:  - ^  X  time  pened  after  the  10th  year.  This  subject  is  right- 

censored  at  1 0  years  because  the  event  happened 
to  the  right  of  10  on  the  time  line  (i.e.,  t  >  10). 


Left-censored:  X 


time 


10 


Interval-censored:  I  X  1 _  time 

8  10 


Left-censored.  Suppose  a  subject  had  an  event 
before  the  10th  year  but  the  exact  time  of  the 
event  is  unknown.  This  subject  is  left-censored 
at  10  years  (i.e.,  t  <  10). 


Interval-censored.  Suppose  a  subject  had  an 
event  between  the  8th  and  10th  year  (exact 
time  unknown).  This  subject  is  interval- 
censored  (i.e.,  8  <  t  <  10). 
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Formulating  the  Likelihood 


Barry,  Gary,  Lany, . . .,  Outcome 
Distribution  fit) 


Subject 

Event 

Time 

Likelihood 

Contribution 

Barry 

t  =  2 

m 

Gary 

t  >  8 

(right-censored) 

OO 

I  f{t)dt 

8 

Harry 

t  =  6 

m 

Carrie 

t  <  2 

(left-censored) 

0 

Larry 

4  <  t  <  8 

(interval-censored) 

4 

Likelihood  (L) 

Product  of  individual  contributions 

OO 

L=f(2)x  J  f(t)dt  x/(6) 

8 

x  [  x  f  f(t)dt 

Jo  Jo 

(Barry  x  Gaiy  x  Harry 
x  Carrie  x  Larry) 

Assumptions  for  formulating  L 

•  No  competing  risks 

o  Competing  event  does  not 
prohibit  event  of  interest 
o  Death  of  all  causes  is  classic 
example  of  no  competing 
risk 


•  Subjects  independent 

o  Allows  L  to  be  formulated  as 
product  of  subjects’ 
contributions 


The  table  on  the  left  illustrates  how  the  likeli¬ 
hood  is  formulated  for  data  on  five  subjects. 
We  assume  a  probability  density  function  f(t) 
for  the  outcome.  Barry  gets  the  event  at  time 
t  =  2.  His  contribution  to  the  likelihood  is  f(2). 
Gary  is  right-censored  at  t  =  8.  The  probability 
that  Gary  gets  the  event  after  t  =  8  is  found  by 
integrating  f(t)  from  8  to  infinity.  This  is  Gary’s 
contribution  to  the  likelihood.  Hany  gets  the 
event  at  time  t  =  6.  His  contribution  to  the 
likelihood  is  f(6).  Carrie  is  left-censored  at  t  = 
2.  Her  contribution  to  the  likelihood  is 
obtained  by  integrating  f(t)  from  zero  to  2. 
Finally,  Lany  is  interval-censored  between  t  = 
4  and  t  =  8.  His  contribution  to  the  likelihood  is 
found  by  integrating  f(t)  from  4  to  8. 

The  full  likelihood  (L)  is  found  by  taking  the 
product  of  each  subject’s  independent  contri¬ 
bution  to  the  likelihood.  The  likelihood  for  this 
example  is  shown  on  the  left. 


The  formulation  of  this  likelihood  uses  the 
assumption  of  no  competing  risks.  In  other 
words,  we  assume  that  no  competing  event 
will  prohibit  any  subject  from  eventually  get¬ 
ting  the  event  of  interest  (see  Chapter  9).  Death 
from  all  causes  is  the  classic  example  of  an 
outcome  that  in  reality  has  no  competing  risk. 
For  other  outcomes,  the  no  competing  risk 
assumption  is  more  of  a  theoretical  construct. 

Another  assumption  is  that  individual  contri¬ 
butions  to  the  likelihood  are  independent.  This 
assumption  allows  the  full  likelihood  to  be  for¬ 
mulated  as  the  product  of  each  individual’s 
contribution. 
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•  Follow-up  time  continuous 
o  No  gaps  in  follow-up 


Revisit  example  with  Barry,  Gary, 

Larry,  . . . 

f(t)  is  Weibull 

SMOKE  is  only  predictor 

1  =  Smoker 
0  =  Nonsmoker 

Weibull:  h(t)  =  Xptp~I , 

S(t)  =  exp(— ktp) 

f(t)  =  h(t)S(t) 

f(t)  =  Apt  p — 1  exp(-ltp) 

where  X  =  exp((30  +  pi  SMOKE) 
(PH  form  of  the  model) 


Data  Layout  for  Right-,  Left-,  and 
Interval-Censoring  Using  SAS 


ID 

LOWER 

UPPER 

SMOKE 

Barry 

2 

2 

1 

Gary 

8 

- 

0 

Harry 

6 

6 

0 

Carrie 

- 

2 

0 

Larry 

4 

8 

1 

Right-censored:  UPPER  missing 
Left-censored:  LOWER  missing 
Interval-censored:  LOWER  < 
UPPER 

Not  censored:  LOWER  =  UPPER 


A  third  assumption  is  that  each  subjects 
follow-up  time  is  continuous  without  gaps  (i.e., 
once  subjects  are  out  of  the  study,  they  do  not 
return).  If  gaps  are  allowed,  the  likelihood  can  be 
modified  to  accommodate  such  a  scenario. 

In  the  last  example,  we  did  not  specify  the 
probability  density  f(t),  nor  did  we  specify  any 
covariates.  We  revisit  this  example,  assuming 
f(t)  is  Weibull  with  one  predictor  SMOKE  in 
the  model  (coded  1  for  smokers  and  0  for 
nonsmokers). 


The  Weibull  hazard  and  survival  functions  are 
shown  on  the  left.  The  probability  density  func¬ 
tion  f(t)  is  the  product  of  the  hazard  and  sur¬ 
vival  functions.  The  parameterization  will  use 
the  proportional  hazards  (PH)  form  of  the 
Weibull  model:  X  =  p0  +  p,  SMOKE. 


On  the  left  is  the  data  layout  for  running  para¬ 
metric  models  containing  right-,  left-,  and 
interval  censored  data  in  a  form  suitable  for 
using  the  SAS  procedure  PROC  LIFETEST 
(version  8.2).  There  are  two  time  variables 
LOWERand  UPPER.  Barry  got  the  event  at 
t  =  2,  so  both  LOWER  and  UPPER  get  the 
value  2.  Gary  was  right-censored  at  8  (t  <  8) 
so  LOWER  gets  the  value  8  and  UPPER  is  set  to 
missing.  Carrie  is  left-censored  at  2  (t  >  2)  so 
LOWER  is  set  to  missing  and  UPPER  gets  the 
value  2.  Larry  was  interval-censored  with 
LOWER  =  4  and  UPPER  =  8.  Barry  and  Larry 
are  smokers  whereas  Gary,  Harry,  and  Carrie 
are  nonsmokers. 
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Weibull  Likelihood  (L) 

Product  of  individual  contributions 

oo  2 

L=f( 2)X  J f(t)d,  x/(6)  x  Jf(t)dt 
8  0 
8 

X  J 

4 

L  =  exp(/?0  +  P,)p(2)P~'  exp(-  exp(/?0  +  P,)2P) 


The  full  likelihood  using  the  Weibull  distribu¬ 
tion  can  now  be  formulated  as  a  product  of 
each  individual’s  contribution  (shown  on  the 
left).  We  have  used  a  small  dataset  (five  sub¬ 
jects)  for  ease  of  illustration  but  the  process 
can  be  generalized  for  any  number  of  subjects. 


x  J  exp(/?0 )p(tf  1  exp(- exp(/?0 )f)dt 
8 

x  exp(fS0)p(6)p~'  exp(—  exp(/?0)6p) 

2 

x  J  exp(fS0)p(f)p~l  exp(-  exp(f!0)f)dt 
0 


X 


8 


4 


exp(/?0  +  Pi)p(t)p  ' 

x  exp(—  exp(p0  +  Pi)tp)dr 


Obtaining  maximum  likelihood 
estimates 


Solve  system  of  equations: 
«  =  0j  =  l,2,...,N 

dfh 

where  N  =  #  of  parameters 


Once  the  likelihood  is  formulated,  the  question 
becomes:  which  values  of  the  regression  para¬ 
meters  would  maximize  L?  The  process  of  max¬ 
imizing  the  likelihood  is  typically  carried  out 
by  setting  the  partial  derivative  of  the  natural 
log  of  L  to  zero  and  then  solving  the  system  of 
equations  (called  the  score  equations).  The 
parameter  estimates  (e.g.,  p,  fS0,  jf  )  that  maxi¬ 
mize  L  are  the  maximum  likelihood  estimates. 


XI.  Interval-Censored  Data 

Parametric  likelihood 

•  Handles  right-,  left-,  or  interval- 
censored  data 

Cox  likelihood 


One  advantage  of  a  parametric  model  com¬ 
pared  to  a  Cox  model  is  that  the  parametric 
likelihood  easily  accommodates  right-,  left-,  or 
interval-censored  data.  The  Cox  likelihood,  by 
contrast,  easily  handles  right-censored  data 
but  does  not  directly  accommodate  left-  or 
interval-censored  data. 


Designed  to  handle  right- 
censored  data. 
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Interval-censored  study  design 

•  Check  for  nonsymptomatic 
outcome  once  a  year 

•  If  outcome  newly  detected, 
exact  time  occurred  during 
previous  year 

•  Left-censoring  special  case  of 
interval-censoring 

o  Zero  the  lower  boundary  of 
the  interval 


Sometimes  the  design  of  a  study  is  such  that  all 
the  data  are  interval-censored.  For  example, 
consider  a  study  in  which  healthcare  workers 
examine  subjects  once  a  year,  checking  for  a 
nonsymptomatic  outcome.  If  an  event  was  first 
detected  in  the  beginning  of  the  third  year,  then 
the  exact  time  of  the  outcome  occurred  some¬ 
time  between  the  second  and  third  years.  In 
this  framework  left-censoring  can  be  consid¬ 
ered  a  special  case  of  interval-censoring  with 
zero  as  the  lower  boundary  of  the  interval. 


Parametric  model  can  be  fitted 

•  f(t)  specified 

•  Contribution  to  likelihood  for 
each  subject 

o  Integrate  f(t)  over  event 
interval 


A  parametric  model  can  easily  be  fitted  using 
the  methods  described  in  the  previous  section. 
Once  a  distribution  for  the  outcome,  f(t),  is 
specified,  each  subject’s  contribution  to  the 
likelihood  is  obtained  by  integrating  f(t)  over 
the  interval  in  which  he  or  she  had  the  event. 


Binary  regression 

•  Alternative  approach  for 
interval-censored  data 

•  Outcome  coded 

o  0  if  subject  survives  interval 
o  1  if  subject  gets  event  during 
interval 

•  Useful  approach  if 

o  Ample  number  of  events  in 
each  interval 

o  Prefer  not  to  specify  f(t) 


A  binary  regression  (e.g.,  logistic  regression) 
is  an  alternative  approach  that  may  be  consid¬ 
ered  if  all  the  data  are  interval-censored.  With 
this  method,  the  outcome  variable  can  be 
coded  zero  if  the  subject  survives  the  interval 
and  coded  one  if  the  subject  gets  the  event 
during  the  interval.  This  approach  is  particu¬ 
larly  useful  if  there  are  an  ample  number  of 
events  in  each  interval  and  the  analyst  prefers 
not  to  specify  a  distribution  f(t)  for  continuous 
survival  time. 


Information  on  Three  Subjects 

Subject  1:  Gets  event  in  first 
interval 

Subject  2:  Survives  first  interval 
Survives  second 
interval  Gets  event  in 
third  interval 

Subject  3:  Survives  first  interval 
Gets  event  in  second 
interval 


For  illustration,  consider  a  small  dataset  con¬ 
taining  three  subjects.  Subject  1  gets  the  event 
in  the  first  interval  of  follow-up,  subject  2  gets 
the  event  in  the  third  interval,  and  subject  3  gets 
the  event  in  the  second  interval  of  follow-up. 
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Data  Layout  for  Binary  Regression 


SUBJECT 

EVENT 

Di 

d2 

d3 

TRT 

1 

1 

1 

0 

0 

1 

2 

0 

1 

0 

0 

0 

2 

0 

0 

1 

0 

0 

2 

1 

0 

0 

1 

0 

3 

0 

1 

0 

0 

1 

3 

1 

0 

1 

0 

1 

EVENT:  dichotomous  outcome 
coded  1  if  event,  0  for  no  event  dur¬ 
ing  the  interval 


Dj,  D2,  D3:  dummy  variables  for 
intervals  1,2,  and  3  coded  1  if  in  the 
corresponding  interval,  0  otherwise 

TRT:  Treatment  coded  1  for  new 
treatment,  0  for  placebo 


Logistic  Model 

Logit  P(Y  =  1)  =  piDj  +  p2D2 

+  p3D3  +  p4TRT 

where  P(Y  =  1)  is  the  probability 
of  event  for  a  given  interval 
conditioned  on  survival  of  previous 
intervals 

Interpretation  of  Parameters 

pp  Log  odds  of  event  in  1st 
interval  among  TRT  =  0 
p2:  Log  odds  of  event  in  2nd 
interval  given  survival  of  1st 
interval  among  TRT  =  0 
p3:  Log  odds  of  event  in  3rd 

interval  given  survival  of  first 
two  intervals  among  TRT  =  0 
p4:  Log  odds  ratio  for  TRT 


The  data  layout  is  shown  on  the  left.  Each  obser¬ 
vation  represents  one  interval  of  follow-up  time 
allowing  multiple  observations  per  subject. 
EVENT  is  the  dichotomous  outcome  variable. 
Subject  1  had  the  event  in  the  first  interval 
(EVENT  =1)  and  thus  has  one  observation. 
Subject  2  has  three  observations  because  she 
survived  the  first  two  intervals  (EVENT  =  0) 
but  got  the  event  in  the  third  interval.  Di  is  a 
dummy  variable  coded  1  if  the  observation 
represents  the  first  interval  and  0  otherwise. 
Similarly,  D2  is  coded  1  for  the  second  interval 
and  D3  is  coded  1  for  the  third  interval. 

TRT  is  the  predictor  of  interest,  coded  1  for 
the  new  treatment  and  0  for  the  placebo. 
TRT  could  be  coded  as  a  time-independent 
or  time-dependent  variable.  In  this  example, 
TRT  is  time-independent  because  TRT  does 
not  change  values  over  different  intervals 
corresponding  to  the  same  subject. 

A  logistic  model  (shown  at  left)  containing  the 
three  dummy  variables  and  TRT  can  be  formu¬ 
lated  with  the  data  in  this  form. 


Care  must  be  taken  with  the  interpretation  of 
the  parameters:  pi  is  the  log  odds  of  the  event 
occurring  in  the  first  interval  among  the  pla¬ 
cebo  group;  p2  is  the  log  odds  of  the  event 
occurring  in  the  second  interval  conditioned 
on  survival  of  the  first  interval  among  the  pla¬ 
cebo  group;  p3  is  the  log  odds  of  the  event 
occurring  in  the  third  interval  conditioned  on 
survival  of  the  first  and  second  intervals  among 
the  placebo  group;  and  p4  is  the  log  odds  ratio 
for  TRT. 
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D1(  D2,  D3  play  similar  role  as  inter¬ 
cept 

•  Baseline  measure  when 
covariates  are  zero 

•  3  parameters  rather  than  1 
intercept 

o  Baseline  measure  may  differ 
for  each  interval 

Odds  Ratio  (TRT  =  1  vs.  TRT  =  0) 
=  exp(p4) 

Model  uses  PO  assumption 

•  OR  constant  over  time 

•  PO  assumption  can  be  tested 
o  Include  interaction  terms 

with  TRT  and  dummy 
variables 

o  Significant  interaction 
suggests  PO  violation 
o  Need  ample  data  to 
practically  carry  out  test 

Alternative  Binary  Model 

log(  log(  1  -  P(Y  =  1))) 

=  piDj  +  p2D2  +  P3D3  +  p4TRT 

where  1  —  P(Y  =  1 )  is  the  probabil¬ 
ity  of  surviving  a  given  interval 
conditioned  on  survival  of  previous 
intervals 

Complementary  log-log  link 

•  Log-log  survival  modeled  as 
linear  function  of  regression 
parameters 

Logit  link 

•  Log  odds  of  failure  modeled  as 
linear  function  of  regression 
parameters 


The  dummy  variables  play  a  similar  role  to  that 
of  the  intercept  in  a  conventional  regression, 
providing  a  baseline  outcome  measure  for 
the  case  in  which  all  predictors  are  zero  (e.g., 
TRT  =  0).  In  general,  the  baseline  measure  may 
differ  for  each  interval,  which  is  the  reason  that 
the  model  contains  3  dummy  variables  rather 
than  1  intercept. 


The  odds  ratio  comparing  TRT  =  1  to  TRT  =  0 
is  obtained  by  exponentiating  p4.  This  model 
uses  the  proportional  odds  (PO)  assumption 

in  that  the  odds  ratio  is  assumed  constant  over 
time  (or  at  least  constant  at  the  end  of  each 
interval).  This  assumption  can  be  tested  by 
including  interaction  (product)  terms  with 
TRT  and  two  of  the  dummy  variables  in  the 
model.  A  statistically  significant  product  term 
would  suggest  a  violation  of  the  PO  assump¬ 
tion.  However,  if  there  are  sparse  data 
corresponding  to  particular  intervals,  it  will 
not  be  practical  to  cany  out  such  a  test  on 
those  intervals. 

Logistic  regression  is  not  the  only  type  of 
binary  regression  that  may  be  considered  for 
interval-censored  data.  An  alternative  binary 
model  (shown  on  the  left)  uses  the  comple¬ 
mentary  log-log  link  function  rather  than 
the  logit  link  function  that  is  used  for  the 
more  familiar  logistic  regression. 


A  model  using  a  complementary  log-log  link 
function  expresses  the  log  negative  log  survival 
probability  as  a  linear  function  of  regression 
parameters.  By  contrast,  a  model  using  a  logit 
link  function  expresses  the  log  odds  of  failure 
(i.e.,  getting  the  event)  as  a  linear  function  of 
regression  parameters. 
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Complementary  log-log  model  is 
PH  model 

•  HR  (TRT  =  1  vs.  TRT  =  0) 

=  exp(p4) 

•  HR  constant  over  time 

Log-log  survival  curves: 
parallel  =>  additive  effects 
=>  PH 

Complementary  log-log  link: 
additive  effects  on  log-log  scale 
=^>PH 


In  theory 

•  Survival  time  is  continuous 
In  practice 

•  Survival  time  measured  in 
intervals 

o  If  event  occurred  in  month  7 
then  event  occurred  in  an 
interval  of  time 

Discrete  survival  analysis 

•  Discrete  time 

•  for  example,  number  of 
menstrual  cycles  to  pregnancy 
rather  than  time  to  pregnancy 
o  Fraction  of  cycle  does  not 

make  sense 


The  complementary  log-log  binary  model  is 
a  proportional  hazards  model.  The  hazard 
ratio  comparing  TRT  =  1  to  TRT  =  0  is 
obtained  by  exponentiating  p4. 


Recall  we  can  use  log-log  survival  curves  to 
evaluate  the  PH  assumption  for  a  Cox  model. 
If  the  effects  are  additive  (e.g.,  parallel  for  TRT 
=  1  and  TRT  =  0)  then  the  PH  assumption  is 
assumed  to  hold.  The  underlying  idea  is  similar 
for  the  complementary  log-log  link  function  in 
that  additive  effects  are  assumed  on  the  log-log 
scale  (e.g.,  comparing  TRT  =  1  to  TRT  =  0). 

In  theory,  the  time-to-event  variable  in  survival 
analyses  is  thought  of  as  a  continuous  variable. 
In  practice,  however,  the  time  variable  is  typi¬ 
cally  an  interval  of  time.  For  example,  if  time  is 
measured  in  months  and  an  event  occurs  in 
month  7  then  the  event  is  recorded  as  having 
occurred  in  a  specific  interval  lasting  a  month. 


Discrete  survival  analysis  is  a  survival  analy¬ 
sis  in  which  the  outcome  variable  is  discrete, 
both  in  theory  and  in  practice.  For  example, 
consider  a  study  in  which  women  who  stop 
using  oral  contraception  are  followed  until 
pregnancy.  The  outcome  is  defined  as  the  num¬ 
ber  of  menstrual  cycles  until  pregnancy.  The 
number  of  cycles  rather  than  the  time  to  preg¬ 
nancy  is  used  because  the  cycle  length  varies 
among  women  and  a  woman  ovulates  only 
once  per  menstrual  cycle  (i.e.,  one  opportunity 
per  cycle  to  become  pregnant).  The  number  of 
cycles  is  a  discrete  outcome.  A  fraction  of  a 
cycle  does  not  make  sense. 
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Analyzing  discrete  survival  data 

•  Can  use  binary  regression 

•  Analogous  to  interval-censored 
data 

o  Discrete  outcome  — 
subjects  survive  discrete 
units  of  time 
o  Interval  outcomes  — 

subjects  survive  intervals  of 
time 


Binary  regression,  as  described  in  this  section, 
can  be  applied  for  discrete  survival  outcomes 
in  a  similar  manner  to  that  described  for 
interval-censored  outcomes.  With  this  method, 
subjects  can  be  conceptualized  as  surviving 
discrete  units  of  time  analogously  as  subjects 
surviving  continuous  intervals  of  time. 


XII.  Frailty  Models 

Frailty 

•  Random  component 

•  Accounts  for  extra  variability 
from  unobserved  factors 


In  this  section  we  consider  the  inclusion  of 
frailty  to  a  survival  model.  Frailty  is  a  random 
component  designed  to  account  for  variability 
due  to  unobserved  individual-level  factors  that 
is  otherwise  unaccounted  for  by  the  other  pre¬ 
dictors  in  the  model. 


Conceptualize  S(t)  two  ways: 

•  For  an  individual 

•  Averaging  over  a  theoretical 
large  population 


Consider  a  survival  model  with  a  continuous 
age  variable  and  dichotomous  smoking  status 
variable  as  the  only  predictors.  Under  this 
model  the  survival  function  for  a  33-year-old 
smoker  might  be  conceptualized  in  different 
ways.  One  way  is  as  the  survival  function  for 
an  individual  33-year-old  smoker.  The  second 
way  is  as  some  kind  of  averaging  over  a  theoret¬ 
ical  large  population  of  33-year-old  smokers. 


With  Frailty  Component 

Jake  and  Blake 

1 .  May  have  different  S(t)  due  to 
unobserved  factors 

2.  Extra  source  of  variability  in 
outcome  (e.g.,  more  variation 
than  expected  under  Weibull) 

Without  Frailty  Component 

Jake  and  Blake 

1 .  Have  same  S(t) 

2.  May  have  different  event  times 
because  event  time  is  random, 
following  some  distribution 
(e.g.,  Weibull) 


Now  suppose  a  "frailty”  component  is  included 
in  the  model.  Under  this  model,  we  can  concep¬ 
tualize  survival  functions  specific  to  each  indi¬ 
vidual.  If  Jake  and  Blake  are  both  33-year-old 
smokers,  not  only  might  their  observed  failure 
times  be  different,  but  under  this  model  their 
individual  survival  functions  could  also  be 
different.  Jake  may  be  more  "frail”  than  Blake 
due  to  unobserved  factors  accounting  for  indi¬ 
vidual  level  differences  in  his  hazard  and  sur¬ 
vival  functions.  These  unobserved  factors  may 
contribute  an  extra  layer  of  heterogeneity, 
leading  to  greater  variability  in  survival  times 
than  might  be  expected  under  the  model  (e.g., 
Weibull)  without  the  frailty  component. 
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The  frailty  component  a  (a  >  0) 

•  Unobserved  multiplicative 
effect  on  hazard 

•  Follows  distribution  g(a)  with 
p  =  E(a)  =  1 

•  Var(a)  =  0,  parameter  to  be 
estimated 

Hazard  and  survival  conditioned 

on  frailty 
h(t[a)  =  ah(t) 

S(t|a)  =  S(t)“ 


a  >  1 

•  Increased  hazard:  ah(t)  >  h(t) 

•  Decreased  survival:  S(t)a  <  S(t) 


a  <  1 

•  Decreased  hazard:  ah(t)  <  h(t) 

•  Increases  survival:  S(t)a  >  S(t) 


a  =  1  (average  frailty):  ah(t)  =  h(t) 

Survival  functions 
(with  frailty  models) 


1.  Conditional,  S(t|a),  individual 
level 

2.  Unconditional,  Su(t), 
population  level 


Unconditional  survival  function 

Su(t) 


Su(0 

hv{t) 


f  S(f|a)g{a}da 
o 

—d[Su(t)\/dt 

Su(0 


The  frailty  a  is  an  unobserved  multiplicative 
effect  on  the  hazard  function  assumed  to  fol¬ 
low  some  distribution  g (a)  with  a  >  0  and  the 
mean  of  a  equal  to  1.  The  variance  of  a  is  a 
parameter  0  (theta)  that  is  typically  estimated 
from  the  data. 


An  individual’s  hazard  function  conditional  on 
the  frailty  can  be  expressed  as  a  multiplied  by 
h(t).  Using  the  relationship  between  the  sur¬ 
vival  and  hazard  functions,  the  corresponding 
conditional  survival  function  can  be  expressed 
as  S(t)  raised  to  the  a  power. 

Individuals  with  a  >  1  have  an  increased  haz¬ 
ard  and  decreased  probability  of  survival  com¬ 
pared  to  those  of  average  frailty  (a  =  1). 
Similarly,  individuals  with  a  <  1  have  a 
decreased  hazard  and  increased  probability  of 
survival  compared  to  those  of  average  frailty. 


With  frailty  models,  we  distinguish  the  individ¬ 
ual  level  or  conditional  survival  function 

S(t|a)  discussed  above,  from  the  population 
level  or  unconditional  survival  function 

Su(t),  which  represents  a  population  average. 
Once  the  frailty  distribution  g(a)  is  chosen,  the 
unconditional  survival  function  is  found  by 
integrating  over  the  conditional  survival  func¬ 
tion  S(t|a)  times  g(a),  with  respect  to  a.  The 
corresponding  unconditional  hazard  hu(t)  can 
then  be  found  using  the  relationship  between 
the  survival  and  hazard  functions  (see  left). 
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Frailty  distribution  g(a),  a  >  0, 
E(a)  =  1 

Stata  offers  choices  for  g(a) 

1 .  Gamma 

2.  Inverse-Gaussian 

Both  distributions  parameterized 
in  terms  of  0,  where  Var(a)  =  0 


Any  distribution  for  a  >  0  with  a  mean  of  1 
can  theoretically  be  used  for  the  distribution 
of  the  frailty.  Stata  supports  two  distributions: 
the  gamma  distribution  and  the  inverse- 
Gaussian  distribution  for  the  frailty.  With 
the  mean  fixed  at  1,  both  these  distributions 
are  parameterized  in  terms  of  the  variance  0 
and  typically  yield  similar  results. 


EXAMPLE 


Vet  Lung  Cancer  Trial 
Predictors: 

TX  (dichotomous:  1  =  standard,  2  =  test) 
PERF  (continuous:  0  =  worst, 

100  =  best) 

DD  (disease  duration  in  months) 

AGE  (in  years) 

PRIORTX  (dichotomous:  0  =  none, 

10  =  some) 

Model  1.  No  Frailty 


Weibull  regression  (PH  form) 
Log  likelihood  =  —206.20418 


_t 

Coef. 

Std.  Err. 

z 

p>|z| 

tx 

.137 

.181 

0.76 

0.450 

perf 

-.034 

.005 

-6.43 

0.000 

dd 

.003 

.007 

0.32 

0.746 

age 

-.001 

.009 

-0.09 

0.927 

priortx 

-.013 

.022 

-0.57 

0.566 

_cons 

-2.758 

.742 

-3.72 

0.000 

/ln_p 

-.018 

.065 

-0.27 

0.786 

P 

.982 

.064 

1/p 

1.02 

.066 

To  illustrate  the  use  of  a  frailty  model,  we  apply 
the  data  from  the  Veteran’s  Administration 
Lung  Cancer  Trial  described  in  Chapter  5.  The 
exposure  of  interest  is  treatment  status  TX 
(standard  =  1,  test  =  2).  The  control  variables 
are  performance  status  (PERF),  disease  dura¬ 
tion  (DD),  AGE,  and  prior  therapy  (PRIORTX), 
whose  coding  is  shown  on  the  left.  The  out¬ 
come  is  time  to  death  (in  days). 

Output  from  running  a  Weibull  PH  model 
without  frailty  using  Stata  software  is  shown 
on  the  left  (Model  1).  The  model  can  be 
expressed:  h(t)  =  'kptp~l  where 

2  =  exp(ft  +  ft  TX  +  ftPERF  +  ftDD 
+ft  AGE  +  ft  PRIORTX). 

The  estimate  of  the  hazard  ratio  comparing 
TX  =  2  vs.  TX  =  1  is  exp(0.137)  =  1.15 
controlling  for  performance  status,  disease 
duration,  age,  and  prior  therapy.  The  estimate 
for  the  shape  parameter  is  0.982  suggesting  a 
slightly  decreasing  hazard  over  time. 
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1  EXAMPLE:  (continued)  1 

Model  2.  With  Frailty 

Weibull  regression  (PH  form) 

Gamma  frailty 

Log  likelihood  = 

-200.11338 

_t 

Coef. 

Std.  Err. 

z 

p>M 

tx 

.105 

.291 

0.36 

0.719 

perf 

-.061 

.012 

-5.00 

0.000 

dd 

-.006 

.017 

-0.44 

0.663 

age 

-.013 

.015 

-0.87 

0.385 

priortx 

-.006 

.035 

-0.18 

0.859 

_cons 

-2.256 

1.100 

-2.05 

0.040 

/In  p 

.435 

.141 

3.09 

0.002 

/ln_the 

-.150 

.382 

-0.39 

0.695 

P 

1.54 

.217 

1/p 

.647 

.091 

theta 

.861 

.329 

Likelihood  ratio  test  of  theta  =  ( 

): 

chibar2(01)  =  12.18 

Prob>= 

=chibar2= 

=0.000 

Comparing  Model  2  with  Model  1 

•  There  is  one  additional 
parameter  to  estimate  in 
Model  2 

•  The  actual  values  of 
individuals’  frailty  are  not 
estimated  in  Model  2 

•  The  coefficients  for  the 
predictor  variables  in  Models  1 
and  2  have  different  estimates 
and  interpretation 

•  The  estimate  of  the  shape 
parameter  is  <1.0  for  Model  1 
and  >1.0  for  Model  2 


Model  2  (output  on  left)  is  the  same  Weibull 
model  as  Model  1  except  that  a  frailty  compo¬ 
nent  has  been  included.  The  frailty  in  Model 
2  is  assumed  to  follow  a  gamma  distribution 
with  mean  1  and  variance  equal  to  theta  (0). 
The  estimate  of  theta  is  0.861  (bottom  row  of 
output).  A  variance  of  zero  (theta  =  0)  would 
indicate  that  the  frailty  component  does  not 
contribute  to  the  model.  A  likelihood  ratio  test 
for  the  hypothesis  theta  =  0  is  shown  directly 
below  the  parameter  estimates  and  indicates  a 
chi-square  value  of  12.18  with  1  degree  of  free¬ 
dom  yielding  a  highly  significant  p-value  of 
0.000  (rounded  to  three  decimals). 

Notice  how  all  the  parameter  estimates  are 
altered  with  the  inclusion  of  the  frailty.  The 

estimate  for  the  shape  parameter  is  now  1.54, 
quite  different  from  the  estimate  0.982  obtained 
from  Model  1 .  The  inclusion  of  frailty  not  only 
has  an  impact  on  the  parameter  estimates  but 
also  complicates  their  interpretation. 

Before  discussing  in  detail  how  the  inclusion  of 
frailty  influences  the  interpretation  of  the  para¬ 
meters,  we  overview  some  of  the  key  points 
(listed  on  the  left)  that  differentiate  Model  2 
(containing  the  frailty)  and  Model  1 . 

Model  2  contains  one  additional  parameter, 
the  variance  of  the  frailty.  However,  the  actual 
values  of  each  subject’s  frailty  are  not  esti¬ 
mated.  The  regression  coefficients  and  Weibull 
shape  parameter  also  differ  in  their  interpreta¬ 
tions  for  Model  2  compared  to  Model  1.  We 
now  elaborate  on  these  points. 
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Model  2 

Hazard  for  jth  individual: 


For  Model  2  we  can  express  the  Weibull  model 
with  a  gamma  distributed  frailty  in  terms  of  the 
individual  level  hazard  for  the  jth  subject. 


hj(t|ocj)  =  0Cjh(t)  j  =  1,  2,...,  n 

where  h(t)  =  Xptp~l 
with  X  —  exp(/?0  +  /?!  TX 

+  [l2  PERF  +  3  DD 

+  P4  AGE  +  [}5  PRIORTX) 
and  where  a  ~  gamma  (p  =  1 , 
variance  =  9) 


If  Xj  denotes  the  frailty  for  the  jth  subject,  then 
that  subject's  hazard  hj  (t|<Xj)  can  be  expressed 
as  a,  multiplied  by  h(t),  where  h(t)  is  the 
Weibull  hazard  function  parameterized  in 
terms  of  the  predictor  variables  and  their 
regression  coefficients  (see  left). 


otj  not  estimable 

•  An  ocj  associated  with  each 
subject 

•  Too  many  parameters 


The  values  for  each  aj  are  not  estimable  because 
there  is  a  level  of  frailty  associated  with  each 
data  point.  If  we  tried  to  estimate  each  subject’s 
frailty,  then  there  would  be  more  parameters  to 
estimate  than  observations  in  the  dataset  and 
the  model  would  be  overparameterized. 


Rather,  var[g(a)]  is  estimated 
•  Gamma  is  2-parameter 
distribution 
o  Mean  set  at  1.0 
o  6  =  Var(a)  is  estimated 


Rather,  the  variance  of  the  frailty  is  estimated. 
The  gamma  distribution  is  a  two-parameter 
distribution.  Because  the  mean  is  set  at  1,  we 
need  only  estimate  its  variance  to  fully  specify 
the  frailty  distribution. 


Interpreting  coefficients  in  Model  2 

HR  =  exp^j  =  1.11 

Estimates  HR  comparing  two  indi¬ 
viduals 

•  With  same  a 

•  One  with  TX  =  2,  other  with 
TX  =  1 

•  With  same  levels  of  other 
predictors 


The  estimated  coefficient  for  TX  using  Model 
2  is  0.105.  By  exponentiating,  we  obtain  exp 
(0.105)  =  1.11.  This  is  the  estimated  hazard 
ratio  for  two  individuals  having  the  same 
frailty  in  which  one  takes  the  test  treatment 
and  the  other  takes  the  standard  treatment 
controlling  for  the  other  co-variates  in  the 
model.  Thus,  for  two  individuals  with  the 
same  frailty,  we  can  use  the  coefficient  esti¬ 
mates  from  Model  2  to  estimate  the  ratio  of 
conditional  hazards. 
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Recall:  h(t|a)  =  ah(t) 


TX  =  1:  h!(t|cxi)  =  oqhi(t) 
TX  =  2:  h2(tja2)  =  a2hj(t) 


If  ^44  =  exp(^) 


then 


hi{t) 

8^/?i  (r) 


*2^2 (0 


=  exp(ft) 


only  if  oq  =  a2 


Another  interpretation  for  exp([h) 

•  Ratio  of  conditional  hazards 
from  the  same  individual 

•  Effect  for  individual  taking  test 
rather  than  standard  treatment 

Model  1  (p=  0.982) 

Decreasing  hazard  for  individual 
and  population  because  (/5<  1) 

Model  2  (p=  1.54) 

Complication: 

Individual  level  hazard 
vs 

Population  level  hazard 


For  Model  2 

Conditional  hazard  increases 
but 

unconditional  hazard  unimodal 


To  clarify,  recall  that  the  individual  level  or 
conditional  hazard  function  can  be  expressed 
as  a  multiplied  by  h(t).  Suppose  fqfrlai)  and 
h2(t|a2)  are  the  conditional  hazard  functions 
for  individuals  who  use  the  standard  and  test 
treatments,  respectively,  at  the  mean  levels  of 
the  other  covariates.  If  the  ratio  of  h2(t)  and 
h | (t)  equals  exp((h),  then  the  ratio  of  h2(t|a2) 
and  hiltloq)  equals  exp(|h)  only  if  the  indivi¬ 
duals  have  the  same  level  of  frailty  (i.e.,  oq  = 
a2;  see  left). 

Another  way  to  interpret  the  exponentiated 
coefficient  for  TRT,  exp^),  is  as  a  ratio  of 
conditional  hazards  from  the  same  individual. 
This  measure  can  be  used  to  estimate  an  effect 
for  an  individual  taking  the  test  treatment 
instead  of  the  standard  treatment. 

A  somewhat  striking  difference  in  the  output 
from  Model  1  and  Model  2  is  the  estimate  of  the 
shape  parameter.  The  hazard  estimated  from 
Model  1  (without  the  frailty)  is  estimated  to 
decrease  over  time  because  p  <  1 .  By  contrast, 
the  estimated  individual  level  hazard  from 
Model  2  is  estimated  to  increase  over  time 
because  p  >  1.  However,  the  interpretation  of 
the  shape  parameter  in  Model  2  has  an  addi¬ 
tional  complication  that  should  be  considered 
before  making  direct  comparisons  with  Model  1 . 
For  frailty  models,  we  have  to  distinguish 
between  the  individual  level  and  population 
level  hazards. 

Although  the  estimated  individual  level  or  con¬ 
ditional  hazard  from  Model  2  is  estimated  to 
increase  over  time,  the  estimated  population 
level  or  unconditional  hazard  does  not  strictly 
increase.  The  unconditional  hazard  first 
increases  but  then  decreases  to  zero,  resulting 
in  a  unimodal  shape  due  to  the  effect  of  the 
frailty,  as  we  will  now  explain. 
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Estimated  unconditional  hazard 
Model  2  (TX  =  1,  mean  level  for 
other  covariates,  p  =  1.54) 


1 

analysis  time 
Weibull  regression 


On  the  left  is  a  plot  (from  Model  2)  of  the 
estimated  unconditional  hazard  for  those  on 
standard  treatment  (TX  =  1 )  with  mean  values 
for  the  other  covariates.  The  graph  is  unimo- 
dal,  with  the  hazard  first  increasing  and  then 
decreasing  over  time.  So  each  individual  has  an 
estimated  increasing  hazard  (p  =  1.54),  yet  the 
hazard  averaged  over  the  population  is  unim- 
odal,  rather  than  increasing.  How  can  this  be? 

The  answer  is  that  the  population  is  comprised 
of  individuals  with  different  levels  of  frailty. 
The  more  frail  individuals  (a  >  1)  have  a 
greater  hazard  and  are  more  likely  to  get  the 
event  earlier.  Consequently,  over  time,  the  “at 
risk  group’’  has  an  increasing  proportion  of  less 
frail  individuals  (a  >  1),  decreasing  the  popula¬ 
tion  average,  or  unconditional,  hazard. 


Four  increasing  individual  level 
hazards,  but  average  hazard  de¬ 
creases  from  ti  to  t2 


To  clarify  the  above  explanation,  consider  the 
graph  on  the  left  in  which  the  hazards  for  four 
individuals  increase  linearly  over  time  until 
their  event  occurs.  The  two  individuals  with 
the  highest  hazards  failed  between  times  ti 
and  t2  and  the  other  two  failed  after  t2.  Conse¬ 
quently,  the  average  hazard  (h2)  of  the  two 
individuals  still  at  risk  at  t2  is  less  than  the 
average  hazard  (/q)  of  the  four  individuals  at 
risk  at  ti.  Thus  the  average  hazard  of  the  “at 
risk”  population  decreased  from  tj  to  t2  (i.e., 
h2<h\)  because  the  individuals  surviving  past 
t2  were  less  frail  than  the  two  individuals  who 
failed  earlier. 


Frailty  Effect 

hu(t)  eventually  decreases 
because 

“at  risk  group”  becoming  less  frail 
over  time 


This  property,  in  which  the  unconditional  haz¬ 
ard  eventually  decreases  over  time  because  the 
“at  risk  group”  has  an  increasing  proportion  of 
less  frail  individuals,  is  called  the  frailty  effect. 
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Unconditional  hazard  hu(t)  with 
gamma  frailty 


hu(t)  l-01n[S(f)] 


The  unconditional  hazard  function  hu(t),  with 
gamma  frailty  is  shown  on  the  left. 


If  0  =  0  then  hu(t)  =  h(t)  If  0  =  0,  then  hu(t)  reduces  to  h(t)  indicating 

(no  frailty)  that  there  is  no  frailty. 


For  Model  2: 

•  h(t)  and  S(t)  are  Weibull 

•  At  t  =  0 

o  hu(t)  =  h(t)  (increasing) 

•  As  t  gets  large 

o  If  0  >  0  then  hu(t)  — >  0 

•  So  hu(t)  increases  and  then 
decreases  (unimodal) 


An  examination  of  the  expression  for  hv(t ) 
gives  further  insight  into  how  we  obtained  an 
estimated  unconditional  hazard  of  unimodal 
shape.  S(t)  and  h(t)  represent  the  survival  and 
hazard  functions  ignoring  the  frailty,  which  for 
Model  2  corresponds  to  a  Weibull  distribution. 
If  t  =  0  then  hu(t)  =  h(t),  which  for  Model 
2  yields  an  estimated  increasing  hazard.  As 
t  gets  larger,  and  if  9  >  0,  the  denominator 
gets  larger  (because  ln[S(t)]  is  negative)  until 
eventually  hu(t)  approaches  zero.  So  hu(t)  is 
increasing  at  t  =  0  but  eventually  decreases  to 
zero,  which  means  at  some  point  in  time,  hu(t) 
changes  direction. 


Population  level  hazards  (with 
gamma  frailty) 


hui(t) 

hu2(t) 


MO 

1  -  0  ln[M0] 
MO 

1  -  0  ln[s2(0] 


for  TX  =  1 


for  TX  =  2 


Ratio  of  unconditional  hazards 
(not  PH) 

hui(t)  _  MO  1  —  0  ln[M0] 

M(0  MO  X  1  -  0  MMO] 


A  consequence  of  the  frailty  effect  is  the  need  to 
distinguish  between  the  ratio  of  individual 
level  hazards  and  the  ratio  of  population  level 
hazards.  For  the  population  level  hazards,  the 
PH  assumption  is  violated  when  a  gamma  (or 
inverse-Gaussian)  distributed  frailty  is  added 
to  a  PH  model.  To  see  this  for  gamma  frailty, 
let  hMt)  and  hU2(t)  be  the  unconditional  haz¬ 
ard  functions  representing  the  standard  and 
test  treatments,  respectively,  at  the  mean  levels 
of  the  other  covariates.  The  ratio  of  these 
hazards  is  shown  on  the  left. 
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If 


h2(t) 

h\{t) 


exp(|3i) 


then 


limit) 

limit) 


exp(  (3  j )  x 


1  -9  ln[Si  (f )] 
1  -  0ln[S2(t)] 


not  constant  over  time, 
PH  violated 


Plots  of  $(i) 

•  Generally  averaged  over 
population 

o  An  important  consideration 
for  frailty  models 


Suppose  ln[-ln  S  (/)]  curves  for  TX 
start  parallel  but  then  converge  over 
time: 


1. 


2. 


It  may  be  effect  of  TX  weakens 
over  time || 

PH  model  not  appropriate 


It  may  be  effect  of  TX  is 
constant  over  time  but 
unobserved  heterogeneity  is  in 
population  || 


PH  model  with  frailty  is 
appropriate 


Model  2  (Weibull  with  frailty) 


•  Used  PH  parameterization 

•  Can  equivalently  use  AFT 
parameterization 


If  the  ratio  of  h2(t)  and  hi(t)  equals  expCffr), 
then  the  ratio  of  the  unconditional  hazards 
equals  expipj  times  the  ratio  of  1  -  0  ln[Si(7)] 
and  1—0  ln[S2(t)].  This  latter  ratio  is  a  function 
of  time  and  only  cancels  when  t  equals  zero. 
Therefore  the  ratio  of  the  unconditional 
hazards  is  not  constant  over  time,  thus  violat¬ 
ing  the  PH  assumption. 


Generally,  survival  plots  are  estimated  over  a 
population  average  (e.g.,  Kaplan-Meier). 
When  considering  PH  models  without  frailty, 
we  do  not  need  to  distinguish  between  the  con¬ 
ditional  and  unconditional  survival  functions. 
However,  this  distinction  needs  to  be  consid¬ 
ered  with  frailty  models. 

Suppose  we  plot  Kaplan-Meier  log-log  survival 
estimates  evaluating  the  PH  assumption  for 
treatment  (TX  =  2  vs.  TX  =  1),  and  the  plots 
start  out  parallel  but  then  begin  to  converge 
over  time.  One  interpretation  is  that  the  effect 
of  the  treatment  weakens  over  time.  For  this 
interpretation,  a  PH  model  is  not  appropriate. 
Another  interpretation  is  that  the  effect  of  the 
treatment  remains  constant  over  time  but  the 
plots  converge  due  to  unobserved  heterogene¬ 
ity  in  the  population.  For  this  interpretation,  a 
PH  model  with  frailty  would  be  appropriate. 


Recall,  from  Section  VI  of  this  chapter,  that  a 
Weibull  PH  model  is  also  an  AFT  model.  The 
only  difference  is  in  the  way  the  model  is  para¬ 
meterized.  We  next  present  the  AFT  form  of 
Model  2. 
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Unconditional  survival  function 
Su(t)  with  gamma  frailty  g(a) 

OO 

Su(t)  =  J  S(t\cc)g  (a.)d a 

o 

=  [[1  -01n  S(r)]-1/e 

Model  3  (Weibull  AFT  with  gamma 
frailty) 

Su(t)  =  [l-OlnS(t)]~l/0 

where  S(t)  =  exp( -/./'')  (Weibull) 
and 

YJp  =  exp(a0  +  aiTX 

+  a.2  PERF  +  ot3  DD 
+a4  AGE  +  0C5  PRIORTX) 

Model  3  Output 


Weibull  regression  (AFT  form) 

Gamma  frailty 

Log  likelihood  =  —200. 11338 


_t 

Coef. 

Std.  Err. 

z 

p>  N 

tx 

-.068 

.190 

-0.36 

0.721 

perf 

.040 

.005 

8.37 

0.000 

dd 

.004 

.009 

0.44 

0.661 

age 

.008 

.009 

0.89 

0.376 

priortx 

.004 

.023 

0.18 

0.860 

_cons 

1.460 

.752 

1.94 

0.052 

/ln_p 

.435 

.141 

3.09 

0.002 

/ln_the 

-.150 

.382 

-0.39 

0.695 

P 

1.54 

.217 

1/p 

.647 

.091 

theta 

.861 

.329 

Likelihood  ratio  test  of  theta  =  0: 
chibar2(01)  =  12.18 
Prob>=chibar2  =  0.000 

y(TX  =  2  vs.  1)  =  exp(— 0.068) 

=  0.93 

Comparing  individuals  with  same  a 


Before  stating  the  model,  we  show  the  uncon¬ 
ditional  survival  function  using  gamma  frailty. 
Recall  that  the  unconditional  survival  function 
is  obtained  by  integrating  over  the  frailty,  as 
shown  on  the  left. 


Model  3  (the  AFT  form  of  Model  2)  is  presented 
in  terms  of  the  unconditional  survival  function 
Su(t).  The  unconditional  survival  function  is  a 
function  of  S(t),  which  represents  the  Weibull 
survival  function.  The  Weibull  survival  func¬ 
tion,  in  turn,  is  parameterized  in  terms  of  the 
shape  parameter  p  and  regression  coefficients 
using  AFT  parameterization  (see  left). 


The  output  for  Model  3,  shown  on  the  left,  is 
similar  to  that  obtained  from  Model  2.  The 
estimates  for  theta  and  p  are  identical  to  those 
obtained  from  Model  2.  The  difference  is  that 
the  regression  coefficients  obtained  with 
Model  3  use  AFT  parameterization,  i.e.,  multi¬ 
ply  by  — p  =  -1.54  to  get  the  PH  coefficient 
estimates  in  Model  2. 

An  estimated  acceleration  factor  of  0.93  com¬ 
paring  two  individuals  with  the  same  level  of 
frailty,  for  the  effect  of  treatment  (TX  =  2  vs. 
TX  =1)  and  controlling  for  the  other  covari¬ 
ates,  is  obtained  by  exponentiating  the  esti¬ 
mated  coefficient  (—0.068)  of  the  TX  variable. 
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Interpreting  y 

•  Taking  test  treatment  reduces 
individual’s  median  survival 
time  by  factor  of  0.93 

•  Suggests  slightly  harmful  effect 

•  ot!  is  not  significant  (p  =  0.721) 


Another  interpretation  for  this  estimate  is  that 
an  individual  taking  the  test  treatment  instead 
of  the  standard  treatment  reduces  her  median 
survival  time  (i.e.,  contracts  her  individual  level 
survival  function)  by  an  estimated  factor  of 
0.93.  This  estimate  suggests  a  slight  harmful 
effect  from  the  test  treatment  compared  to  the 
standard  treatment.  However,  the  estimated 
coefficient  for  TX  is  not  significant,  with  a  p- 
value  of  0.721. 


PH  assumption 

Individual  level  PH  A-  Population 
level  PH 

AFT  assumption 

Individual  level  AFT  =>■  Population 
level  AFT 


A  key  difference  between  the  PH  and  AFT 
formulations  of  this  model  is  that  if  the 
AFT  assumption  holds  at  the  individual 
level,  then  it  will  also  hold  at  the  population 
level  using  the  gamma  (or  inverse-Gaussian) 
distributed  frailty. 


Population  level  survival 
gamma  frailty) 


Sm(t)=[[l  -einSiCt)]-170 
Su2(t)=[[l  -  0  In  S2(t)]- 1/0 


(with 


To  see  this  for  gamma  frailty,  let  Sui(t)  and  SU2(t) 
be  the  unconditional  survival  functions  repre¬ 
senting  the  standard  and  test  treatments  respec¬ 
tively,  at  the  mean  levels  of  the  other  covariates. 


Also  let  y  represent  the  individual  level  acceler- 
If  Si(t)  =  S2(yt)  ation  factor  for  treatment;  that  is,  Si(t)  = 

then  S2(yt).  Then  Sui(t)  =  Su2(yt)  (see  left). 

Sui(f)  =  [[1  -6InSi(t)]~1/e 
=  [[1  -9InS2(yt)]-1/e 
=  Sm(yt) 


Thus, 

Individual  level  AFT 
=>  Population  level  AFT 


Thus,  for  models  with  gamma  frailty,  if  the 
AFT  assumption  holds  at  the  individual  level 
then  it  also  holds  at  the  population  level. 
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Coefficient  estimates  from  Model  3 

•  Applies  to  individual  or 
population 

•  Interpretation  of  explo^)  =  0.93 
o  Median  survival  time  for 

individual  reduced  by  factor 
of  0.93 

o  Median  survival  time 
reduced  in  population  by 
factor  of  0.93 

Models  2  and  3: 

Same  model,  different 
parameterization 
Same  estimates  for 
S(t),  Su(t),  h(t),  hu(t) 

Models  2  and  3:  Weibull  with 

gamma  frailty 

•  Unimodal  unconditional 
hazard 

Log-logistic  model 

•  Accommodates  unimodal 
hazard  without  a  frailty 
component 

Parametric  likelihood  with  frailty 

•  Uses  fu(t),  where  fu(t)  =  hu(t) 
Su(t) 

•  Formulated  similarly  to  that 
described  in  Section  X  with 
fu(t)  replacing  f(t) 

•  Additional  parameter  9 


The  coefficient  estimates  obtained  from  Model 
3  can  therefore  be  used  at  the  population  level 
as  well  as  the  individual  level.  So  another  inter¬ 
pretation  for  the  estimated  acceleration  factor 
for  treatment  is  that  the  test  treatment  reduces 
the  median  survival  time  in  the  population  by 
an  estimated  factor  of  0.93. 


Model  2  and  Model  3  are  the  same  model  but 
use  different  parameterizations.  The  models 
provide  identical  estimates  for  the  hazard  and 
survival  functions. 


Recall  that  the  estimated  unconditional  hazard 
function  obtained  from  this  frailty  model  is  of 
unimodal  shape.  Alternatively,  a  log-logistic 
(or  lognormal)  model,  which  accommodates  a 
unimodal-shaped  hazard  function,  could  have 
been  run  without  the  frailty  (see  Practice  Exer¬ 
cises  8  to  11  for  comparison). 


The  likelihood  for  Model  3  can  be  formulated 
using  the  unconditional  probability  density 
function  fu(t)  which  is  the  product  of  the 
unconditional  hazard  and  survival  functions. 
The  likelihood  is  constructed  in  a  similar  man¬ 
ner  to  that  described  previously  in  this  chapter 
except  that  fu(t)  is  used  for  the  likelihood 
rather  than  f(t)  (see  Section  X).  The  main  dif¬ 
ference  is  that  there  is  one  additional  parame¬ 
ter  to  estimate,  the  variance  of  the  frailty. 
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Shared  Frailty 

•  Clusters  share  same  frailty 

•  For  example,  subjects  from 
same  family  may  share 
unobserved  factors 

o  Shared  frailty  designed  to 
account  for  such  similarities 


Another  type  of  frailty  model  is  the  shared 
frailty  model.  With  this  model,  clusters  of  sub¬ 
jects  are  assumed  to  share  the  same  frailty.  For 
example,  subjects  from  the  same  family  may  be 
similar  with  respect  to  some  unobserved 
genetic  or  environmental  factors.  Allowing 
family  members  to  share  the  same  frailty  is 
designed  to  account  for  such  similarities. 


Unshared  Frailty  By  contrast,  the  frailty  described  previous  to 

this  point  (unshared  frailty)  has  been  assumed 

•  The  type  of  frailty  we  have  to  be  distributed  independently  among  subjects, 

described  previous  to  this  point 

•  Frailty  distributed 
independently  among  subjects 


Shared  Frailty  Models 

•  Similar  to  random  effect 
regression  models 

•  Accounts  for  within-cluster 
correlation 

•  6  is  a  measure  of  the  degree  of 
correlation 


Adding  shared  frailty  to  a  survival  model  plays 
an  analogous  role  to  that  of  adding  a  random 
effect  to  a  linear  regression  as  a  way  to  account 
for  correlation  between  clusters  of  observa¬ 
tions  (Kleinbaum  and  Klein  2010).  The  esti¬ 
mate  for  the  variance  parameter  6  in  a  shared 
frailty  model  can  be  thought  of  as  a  measure  of 
the  degree  of  correlation,  where  0  =  0  indicates 
no  within-cluster  correlation. 


Hazard  conditional  on  shared 
frailty  (for  jth  subject  in  kth  cluster) 


hjk(t|ak)  =  akhjk(t) 
where 

hjk(t)  =  h(t|Xjk) 
for  j  =  1,  2,. . .,  nk 
and  total  nk  subjects  in  kth  cluster 

If  family  is  the  cluster  variable, 
then 

subjects  of  same  family  have  same 
ak 


For  a  shared  frailty  model,  the  conditional 
hazard  function  for  the  jth  subject  from  the 
kth  cluster  can  be  expressed  as  ak  multiplied 
by  hjk(t)  where  lijk(t)  depends  on  the  subject’s 
covariates  Xjk.  Notice  that  the  frailty  ak  is  sub¬ 
scripted  by  k,  but  not  by  j.  This  indicates  that 
subjects  from  the  same  cluster  share  the  same 
frailty.  If,  for  example,  subjects  are  clustered  by 
family,  then  subjects  from  the  same  family  are 
assumed  to  have  the  same  frailty. 
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Shared  and  unshared  frailty 

•  Fundamentally  the  same 

o  Accounts  for  variation  due 
to  unobservable  factors 

•  Difference  in  data  to  which 
they  are  applied 

o  Affects  interpretation  and 
methods  of  estimation 


The  frailty  in  a  shared  frailty  model  or 
unshared  frailty  model  is  fundamentally  the 
same,  a  random  effect  to  account  for  a  source 
of  variation  due  to  unobservable,  or  latent,  fac¬ 
tors.  However,  the  data  to  which  the  shared 
and  unshared  frailty  is  applied  are  different, 
affecting  differences  in  interpretation  and 
methods  of  estimation. 


Unshared  frailty  models 

•  Subjects  assumed  independent 

Shared  frailty  models 

•  Accounts  for  dependence 
among  subjects  who  share 
frailty 


For  unshared  frailty  models,  a  subject’s  sur¬ 
vival  is  assumed  to  be  independent  of  the  sur¬ 
vival  of  other  subjects  in  the  study  population. 
For  shared  frailty  models,  however,  the  frailty 
accounts  for  dependence  among  subjects  who 
share  the  same  frailty.  Shared  frailty  provides 
an  approach  to  account  for  correlation  in  the 
data  due  to  unobservable  factors  common 
within  clusters  of  subjects. 


Likelihood  for  shared  frailty  models 

•  More  complicated  than  for 
unshared  frailty  models 

•  Unconditional  contribution  of 
each  cluster  formulated 
separately  by  integrating  out 
g(a) 

•  Full  likelihood  formed  as 
product  of  unconditional 
contribution  from  each  cluster 


The  formulation  of  the  likelihood  is  more  com¬ 
plicated  for  shared  frailty  models  than  it  is 
for  unshared  frailty  models.  To  construct  the 
shared  frailty  likelihood,  the  unconditional 
contribution  for  each  cluster  of  subjects  is 
formulated  separately  by  integrating  out  the 
frailty  from  the  product  of  each  subject’s  con¬ 
ditional  contribution.  The  full  likelihood  is 
then  formulated  as  the  product  of  the  contri¬ 
butions  from  each  cluster  (see  Gutierrez  2002 
for  details). 
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Shared  frailty  in  Cox  model 

•  Provided  by  Stata 

o  Only  gamma  distributed 
shared  frailty  available 

•  Accounts  for  within-group 
correlation 

Cox  shared  frailty  model 

hij(t|<Xj)  =  akh0(t)  exp(pXjk) 

for  j  =  1,  2,...,  nk 
total  of  nk  subjects  in  kth  cluster 

PH  violation  of  hu(t)  in  Cox  model 

•  if  gamma-distributed  frailty 
included 

•  Interpreting  coefficient 
estimates 

o  Only  used  for  HR  estimates 
among  those  who  share 
same  a 


Recurrent  events 

•  Multiple  events  from  same 
subject 

•  Events  from  same  subject  may 
be  correlated 

•  Clusters  are  formed 
representing  each  subject 

o  Different  subjects  do  not 
share  frailty 

o  Observations  from  same 
subject  share  frailty 

Recurrent  events: 

•  Topic  of  next  chapter 
(Chapter  8) 


Up  to  this  point  we  have  discussed  frailty  in 
terms  of  parametric  models.  Stata  (version  8) 
allows  shared  frailty  to  be  included  in  a  Cox 
model  in  order  to  account  for  within-group 
correlation.  The  conditional  hazard  function 
for  the  jth  subject  from  the  kth  cluster  can  be 
expressed  as  ak  multiplied  by  the  baseline  haz¬ 
ard  h0(t)  multiplied  by  exp(pXjk).  The  frailty 
component  is  assumed  to  follow  some  distribu¬ 
tion  even  though  the  distribution  is  unspecified 
for  the  rest  of  the  model.  Stata  only  allows  a 
gamma  distribution  for  the  frailty  to  be 
included  with  a  Cox  model. 

If  a  gamma-distributed  frailty  component  is 
added  to  the  Cox  model,  then  the  PH  assump¬ 
tion  is  not  satisfied  for  the  unconditional 
hazards.  In  this  framework,  the  frailty  in  a 
Cox  model  can  be  thought  of  as  a  source  of 
random  error  that  causes  violation  of  the  PH 
assumption  at  the  population  level.  Conse¬ 
quently,  care  must  be  taken  in  the  interpreta¬ 
tion  of  the  coefficient  estimates.  They  can  only 
be  used  to  obtain  estimates  for  hazard  ratios 
conditioned  on  the  same  level  of  frailty. 

Shared  frailty  models  can  also  be  applied  to 
recurrent  event  data.  It  is  reasonable  to  expect 
that  multiple  events  occurring  over  follow-up 
from  the  same  individual  would  be  correlated. 
To  handle  within-subject  correlation,  clusters 
are  formed,  each  containing  observations  from 
the  same  subject.  In  this  setting,  it  is  not  the 
case  that  different  subjects  share  the  same 
frailty.  Rather,  multiple  observations  repre¬ 
senting  the  same  subject  share  the  same  frailty. 


Survival  analyses  on  recurrent  events  are 
the  focus  of  the  next  chapter  (Chapter  8)  of 
this  text.  An  example  of  a  Weibull  model  with 
shared  frailty  applied  to  recurrent  event  data 
is  presented  in  the  next  chapter. 
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XIII.  Summary 

Parametric  Models 

•  Assume  distribution  for 
survival  time 

•  Distribution  specified  in  terms 
of  parameters 

•  Parameters  estimated  from  data 


In  this  chapter  we  presented  parametric  survival 
models  as  an  alternative  to  the  Cox  model.  They 
are  called  parametric  models  because  the  dis¬ 
tribution  of  the  time-to-event  variable  is  speci¬ 
fied  in  terms  of  unknown  parameters,  which  are 
estimated  from  the  data.  Distributions  that 
are  commonly  utilized  are  the  exponential,  the 
Weibull,  the  log-logistic,  the  lognormal,  and  the 
generalized  gamma. 


f(t)  specified  ^corresponding  S(t), 
h(t)  also  determined 


Moreover, 

Specifying  one  of  f(t),  S(t),  or  h 
(t)  determines  all  three 
functions 


More  precisely,  for  parametric  survival  models, 
it  is  the  probability  density  function  f(t)  of  the 
distribution  that  is  specified  in  terms  of  the 
parameters.  Once  f(t)  is  specified,  the  corres¬ 
ponding  survival  and  hazard  functions  S(t)  and 
h(t)  can  also  be  determined.  Moreover,  specify¬ 
ing  any  one  of  the  probability  density  function, 
survival  function,  or  hazard  function  allows 
the  other  two  functions  to  be  determined. 


Parametric  models 

•  Need  not  be  PH  models 

•  Many  are  AFT  models 


The  proportional  hazards  (PH)  assumption  is 
the  underlying  assumption  for  a  Cox  PH  model. 
However,  parametric  survival  models  need  not 
be  proportional  hazards  models.  Many  para¬ 
metric  models  are  acceleration  failure  time 
(AFT)  models  rather  than  proportional  hazards 
models. 


Acceleration  factor  (y) 

•  Key  measure  of  association  in 
AFT  models 

•  Describes  stretching  or 
contraction  of  S(t) 

AFT  assumption 


The  acceleration  factor  (y)  is  the  key  measure 
of  association  obtained  in  an  AFT  model. 
It  describes  the  “stretching  out”  or  contraction 
of  survival  functions  when  comparing  one 
group  to  another.  If  Sfrt)  and  S2(t)  are  the 
survival  functions  for  Group  1  and  Group  2, 
respectively,  then  the  AFT  assumption  can  be 
expressed  as  S2(t)  =  Sfryt). 


S2(t)  =  S1(yt) 

T  T 

Group  2  Group  1 


Detailed  examples  presented:  We  presented  detailed  examples  of  the  expo¬ 

nential,  Weibull,  and  log-logistic  model  using 

•  Exponential  model  the  remission  dataset. 

•  Weibull  model 

•  Log-logistic  model 


342  7.  Parametric  Survival  Models 


Exponential  Model 

•  h(t)  =  X  (constant  hazard) 

•  Special  case  of  Weibull  model 

Weibull  Model 

•  AFT  PH 

Log-logistic  Model 

•  Not  a  PH  model 

•  AFT  PO 


The  underlying  assumption  for  an  exponential 
model,  a  special  case  of  the  Weibull  model,  is 
that  the  hazard  function  is  constant  over  time 
(i.e.,  h(t)  =  X).  The  Weibull  model  is  unique 
in  that  if  the  PH  assumption  holds  then  the 
AFT  assumption  also  holds  (and  vice  versa). 
The  log-logistic  model  does  not  satisfy  the  PH 
assumption.  However,  if  the  AFT  assumption 
holds  in  a  log-logistic  model,  then  the  pro¬ 
portional  odds  (PO)  assumption  also  holds 
(and  vice  versa). 


PO  assumption 

S{t,x)/[l-S{t,x)] 

OR  is  constant  over  time 


The  idea  underlying  the  proportional  odds 
assumption  is  that  the  survival  (or  failure) 
odds  ratio  comparing  two  specifications  of 
covariates  remains  constant  over  time. 


Graphical  Evaluation 
Weibull  and  Exponential 
•  Plot  ln[— In  §(t)]  against  ln(t) 
Log-logistic: 


Plot  In 


S(t) 

(i-s(O) 


against  ln(t). 


Check  for  linearity 


We  presented  graphical  approaches  for  evalu¬ 
ating  the  appropriateness  of  the  exponential, 
Weibull,  and  log-logistic  model  by  plotting  a 
function  of  the  Kaplan-Meier  survival  esti¬ 
mates  S(t)  against  the  log  of  time  and  then 
checking  for  linearity. 

For  evaluation  of  the  exponential  and  Weibull 
assumptions,  the  ln[— In  S(t)]  is  plotted  against 
ln(t)  and  for  evaluation  of  the  log-logistic 
assumption  the  log  odds  of  S(t)  is  plotted 
against  ln(t). 


Presented  other  parametric  models  We  briefly  discussed  other  parametric  models 

such  as  the  generalized  gamma,  lognormal, 

•  Generalized  gamma  model  and  Gompertz  models  and  showed  additional 

•  Lognormal  model  parametric  approaches  such  as  modeling  ancil- 

•  Gompertz  model  lary  (shape)  parameters  as  a  function  of  pre¬ 

dictor  variables. 
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Contributions  to  Likelihood 

If  event  at  t,  contributes  f(t) 

If  censored,  integrate  over  f(t) 
h 

J f(t)dt :  left  -  censored  at  ti 
o 

OO 

J  f{t)dt :  right  —  censored  at  t\ 

h 

t2 

j  f(t)dt  dnterval  —  censored 
/  from  t\  to  ?2 

Full  likelihood  (L) 

N 

L  =  l[Lj  j  =  1,2, ...  ,N 

j=  i 

where  L,  is  the  contribution  from 
jth  subject  and  N  =  #  of  subject’s 

Binary  regression  for  interval- 
censored  data 

•  Follow-up  divided  into 
intervals 

o  Allows  for  multiple 

observations  per  subject 

•  Binary  outcome  variable 
defined 

o  Indicates  survival  or  failure 
over  each  interval 

Binary  regression  for  discrete  sur¬ 
vival  analysis 

•  Analogous  to  interval-censored 
data 

o  Discrete  outcome-subjects 
survive  discrete  units  of 
time 

o  Interval  outcomes-subjects 
survive  intervals  of  time 


The  parametric  likelihood  was  developed  and 
includes  a  discussion  of  left-,  right-,  and  interval- 
censored  data.  If  a  subject  has  an  event  at  time  t, 
then  that  subject’s  contribution  to  the  like¬ 
lihood  is  f(t).  On  the  other  hand,  if  a  subject  is 
censored  (i.e.,  exact  time  of  event  unknown), 
then  the  subject’s  contribution  to  the  likelihood 
is  found  by  integrating  over  f(t).  The  integra¬ 
tion  limits  are  determined  by  the  time  and  type 
of  censorship  (see  left). 


Assuming  independence  among  subjects,  the 
full  likelihood  can  be  formulated  as  a  product 
of  each  subject’s  contribution. 


We  showed  how  binary  regression  could  be 
applied  to  interval-censored  data  by  defining 
a  dichotomous  outcome  variable  indicating 
subjects’  survival  or  failure  over  each  interval 
of  their  follow-up.  The  data  layout  for  this  type 
of  analysis  allows  multiple  observations  per 
subject,  representing  intervals  of  survival  prior 
to  failure  (or  censorship). 


Binary  regression  can  also  be  used  for  discrete 
survival  analysis  in  which  the  “time-to-event” 
variable  is  considered  discrete  rather  than 
continuous.  The  data  layout  is  similar  to  that 
for  interval-censored  data  except  subjects  are 
conceptualized  as  surviving  discrete  units  of 
time  rather  than  continuous  intervals  of  time. 
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Frailty,  a 

h(t|a)  =  a  h(t) 


multiplicative  from  effect  on  h(t) 
mean  =  1,  varience  =  0 

6  estimated  from  data 


We  concluded  with  a  discussion  of  frailty 
models.  The  frailty  a  is  a  multiplicative  random 
effect  on  the  hazard  designed  to  account  for 
individual-level  unobserved  factors  that  add 
an  extra  layer  of  variability  beyond  what  has 
already  been  specified  in  the  model.  The  frailty 
is  generally  assumed  to  follow  a  distribution 
with  mean  equal  to  1  and  is  typically  para¬ 
meterized  in  terms  of  the  variance  0  which  is 
estimated  from  the  data. 


Chapters 

1 .  Introduction  to  Survival 
Analysis 

2.  Kaplan-Meier  Curves  and  the 
Log-Rank  Test 

3.  The  Cox  Proportional  Hazard 
Model 

4.  Evaluating  the  Proportional 
Hazards  Assumption 

5.  The  Stratified  Cox  Procedure 

6.  Extension  of  the  Cox 
Proportional  Hazards  Model 
for  Time-Dependent 
Covariates 

SI.  Parametric  Survival  Models 


The  presentation  is  now  complete.  The  reader 
can  review  the  detailed  outline  that  follows  and 
then  answer  the  practice  exercises  and  test. 

In  the  next  chapter  (8)  entitled  “Recurrent  Event 
Survival  Analysis,”  we  consider  approaches  for 
analyzing  data  in  which  individuals  may  have 
more  than  one  event  over  the  course  of  their 
follow-up. 


Next: 

8.  Recurrent  Event  Survival 
Analysis 
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Detailed 

Outline 


I.  Overview  (pages  292-294) 

A.  Parametric  Survival  Models 

i.  Outcome  assumed  to  follow  specified 
distribution 

ii.  Weibull,  exponential  (a  special  case  of  the 
Weibull),  log-logistic,  lognormal,  and 
generalized  gamma  are  supported  with 
popular  software  (SAS  and  Stata) 

iii.  Contrasts  with  Cox  model  in  which  baseline 
hazard  and  survival  functions  are  not 
specified 

II.  Probability  Density  Function  in  Relation  to  the 

Hazard  and  Survival  Function  (pages  294-295) 

A.  If  any  one  of  the  hazard  h(t),  survival  S(t),  or 
probability  density  f(t)  functions  is  known  then 
the  other  two  functions  can  be  determined. 

OO 

B.  If  f(t)  is  specified,  then  S(t)  =  f  f(u)du 

t 

C.  If  S(t)  is  specified,  then 

hit)  =  (-d[S(t)]/dt)/S(t)  and 
m  =  i-dism/dt  , 

D.  If  h(t)  is  specified,  then  S(t )  =  exp  I  —  f  h(u)du 

\  o 

and  f(t)  =  h(t)Sit) 

III.  Exponential  Example  (pages  295-297) 

A.  Hazard  is  constant  (i.e.,  not  a  function  of  time) 
in  an  exponential  model 

i.  Stronger  assumption  than  the  PH 
assumption  that  the  HR  is  constant 

B.  Exponential  PH  model  (one  predictor  Xi) 

i.  In  terms  of  the  hazard:  h(t)  =  X  where 
X  =  exp(p0  +  PjXO 

ii.  Hazard  ratio:  HR  (Xi  =  1  vs.  Xi  =  0)  = 
exp(fh) 

IV.  Accelerated  Failure  Time  Assumption 

(pages  298-300) 

A.  Underlying  assumptions 

i.  AFT  —  effect  of  covariates  is  multiplicative 
with  respect  to  survival  time 

ii.  PH  —  effect  of  covariates  is  multiplicative 
with  respect  to  the  hazard 
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B.  The  acceleration  factor  (y)  is  the  key  measure  of 
association  in  an  AFT 

i.  Acceleration  factor  is  a  ratio  of  survival 
times  corresponding  to  any  fixed  value  of 
S(t);  that  is,  tA/tB  where  A  and  B  denote  two 
individuals  for  which  S(tA)  =  S(tB) 

ii.  S2(t)  =  S|(yt),  survival  function  for  Group  1 , 
Si(t)  is  stretched  (or  contracted)  by  a  factor 
of  y  compared  to  survival  function  for 
Group  2,  S2(t) 

C.  AFT  illustration 

i.  Dogs  are  said  to  grow  older  7  times  faster 
than  humans,  SD(t)  =  SH(7t) 

V.  Exponential  Example  Revisited  (pages  300-304) 

A.  Exponential  AFT  model  (one  predictor  Xi) 

i.  In  terms  of  survival:  S(t)  =  exp(— /.t)  where 
X  =  exp[-(a0  +  ai  Xi)] 

ii.  In  terms  of  time: 

t  =  [ — ln(S(t)]  x  exp(a0  +  aA) 

iii.  Acceleration  factor  (Xi  =  1  vs.  X,  =  0), 
y  =  expfaj) 

B.  An  exponential  PH  model  is  an  exponential  AFT 
model  (but  uses  different  parameterization) 

i.  Pj  =  —  aj,  where  Pj  and  y.f  are  PH  and  AFT 
parameterization  for  the  jth  covariate 

ii.  a  >  1  for  (Xj  =  1  vs.  Xj  =  0)  implies  effect  of 
Xj  =  1  is  beneficial  to  survival 

iii.  HR  >  1  for  (Xj  =  1  vs.  Xj  =  0)  implies  effect 
of  Xj  =  1  is  harmful  to  survival 

C.  Exponential  model  is  a  special  case  of  a  Weibull 
model 

i.  Graphical  approach  for  evaluating 

appropriateness  of  exponential  model  is 
described  in  the  section  on  the  Weibull 
example 

VI.  Weibull  Example  (pages  304-309) 

A.  PH  form  of  the  Weibull  model  (one  predictor  Xi) 

i.  In  terms  of  the  hazard:  h(t)  =  Xptp~l  where 
X  =  exp(p0  +  piXj) 

ii.  Hazard  ratio:  HR  (Xj  =  1  vs.  Xj  =  0)  = 
exp(pi) 
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iii.  Weibull  hazard  is  monotonic  with  its 
direction  determined  by  the  value  of  the 
shape  parameter  p 

a.  p  >  1  hazard  increases  over  time 

b.  p  =  1  constant  hazard  (exponential 
model) 

c.  p  <  1  hazard  decreases  over  time 

A.  Graphical  approach  for  evaluating 
appropriateness  of  Weibull  model 

i.  Plot  the  log  negative  log  of  the 

Kaplan-Meier  survival  estimates  against 
the  log  of  time  for  each  pattern  of  covariates 

a.  If  Weibull  assumption  is  correct  then 
plots  should  be  straight  lines  of  slope  p 

b.  If  exponential  assumption  is  correct 
then  plots  should  be  straight  lines  with 
slope  equal  to  one  (p  =  1) 

c.  If  plots  are  parallel  straight  lines  then 
Weibull  PH  and  AFT  assumptions  are 
reasonable 

B.  AFT  form  of  the  Weibull  model  (one  predictor  X,) 

i.  In  terms  of  survival: 

S(t)  =  exp(— )4P)  =  expf-a^f]  where 
X1,p  =  exp[— (a0  +  aiXJ] 

ii.  In  terms  of  time: 

t  =  [ — ln(S(t)] 1/p  x  exp(a0  +  oqXi) 

iii.  Acceleration  factor  (X]  =  1  vs.  X!  =  0), 

Y  =  expfoq) 

C.  A  Weibull  PH  model  is  a  Weibull  AFT  model 
(but  uses  different  parameterization) 

i.  Unique  property  of  Weibull  model 
(exponential  is  special  case,  p  =  1) 

ii.  pj  =  p  where  Pj  and  aj  are  PH  and  AFT 
parameterization,  respectively,  for  the  jth 
covariate 

VII.  Log-Logistic  Example  (pages  309-314) 

A.  Log-logistic  hazard  function: 
h(t)  =  Xptp~ll{\  +  Xtp). 

i.  p  <  1  hazard  decreases  over  time 

ii.  p  >  1  hazard  first  increases  and  then 
decreases  over  time  (unimodal) 
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B.  Graphical  approach  for  evaluating 
appropriateness  of  log-logistic  model 

i.  Plot  the  log  of  the  survival  odds  (using  KM 
estimates)  against  the  log  of  time  for  each 
pattern  of  covariates 

a.  if  log-logistic  assumption  is  correct  then 
plots  should  be  straight  line  of  slope  -p 

b.  If  plots  are  parallel  straight  lines  then 
log-logistic  proportional  odds  (PO)  and 
AFT  assumptions  are  reasonable 

C.  Log-logistic  AFT  model  (one  predictor  Xj): 

i.  In  terms  of  survival: 

S(t)  =  1/(1+L/P)  =  l/(l+(L/1/pt)p)  where 
X1,p  =  exp(— (a0  +  o^XO) 

ii.  In  terms  of  time: 


t  = 


m 


x  exp(  ao  +  aiXi) 


iii.  Acceleration  factor  (X|  =  1  vs.  X,  =  0), 

Y  =  expfaj) 

D.  Log-logistic  proportional  odds  (PO)  model  (one 
predictor  X]) 

i.  In  terms  of  survival:  S(t)  =  1/(1  +  Xtp)  where 
X  =  exp(p0  +  PA) 

ii.  Odds  of  an  event  (failure  odds)  by  time  t: 

(1  -  S(t))/S(t )  =  Xf 

iii.  Odds  of  surviving  event  (survival  odds) 
beyond  t:  S(t)/(1  -  S{t))  =  1  /Xtp 

iv.  Failure  odds  ratio: 

HR  (X:  =  1  vs.  Xj  =  0)  =  explpj) 
a.  PO  assumption  is  that  the  odds  ratio  is 
constant  over  time 

v.  Survival  odds  ratio: 

HR  (X:  =  1  vs.  Xj  =  0)  =  explpj) 
a.  Survival  odds  ratio  is  reciprocal  of 
failure  odds  ratio 


E.  A  log-logistic  AFT  model  is  a  log-logistic 

PO  model  (but  uses  different  parameterization) 

i.  Log-logistic  model  is  not  a  proportional 
hazards  (PH)  model 

ii.  pj  =  --'i-p  where  Pj  and  xj  are  PO  and  AFT 
parameterization  for  the  jth  covariate 

a.  Shape  parameter  with  Stata  is 
parameterized  as  gamma  =  1  Ip 
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VIII.  A  More  General  Form  of  the  AFT  Model 

(pages  314-316) 

A.  General  form  with  one  predictor  (Xj): 
ln(T)  —  c(q  T  tXiXi  r 

B.  Include  additional  parameter,  a: 
ln(T)  =  a0  +  aiXi  +  ere 

C.  Let  a  =  1/p  =>  ln(T)  =  a0  +  aiXt  +  (1/p)  e 

D.  Additive  in  terms  of  ln(T)  but  multiplicative  in 
terms  of  T: 

T  =  exp  (  ao  +  oci  Xi  +  -  e  ) 

V  p  J 

=  exp[a0  +  a]  X[]  x  exp^-e^ 

E.  Collapse  a0  into  baseline  term, let 
T0  =  exp(a0)  exp^e): 

so  T  =  expfajXi)  x  T0 

IX.  Other  Parametric  Models  ( pages  316-318) 

A.  Generalized  gamma  model 

i.  Additional  shape  parameters  give  flexibility 
in  distribution  shape 

ii.  Weibull  and  lognormal  are  special  cases 

B.  Lognormal  model 

i.  ln(T)  follows  a  normal  distribution 

ii.  Accommodates  AFT  model 

C.  Gompertz  model 

i.  PH  model,  not  AFT  model 

D.  Modeling  failure  time  as  an  additive  model 
i.  Additive  model  with  one  predictor: 

T  =  a0  +  ajTRT  +  e  (no  log  link) 

E.  Modeling  ancillary  parameters 

i.  Typically  shape  parameter  p  is  considered  a 
fixed  constant 

ii.  Can  reparameterize  shape  parameter  in 
terms  of  predictor  variables  and  regression 
coefficients 

X.  The  Parametric  Likelihood  (pages  318-321) 

A.  Product  of  each  subject  contribution  (assuming 
independence) 

B.  Subject’s  contribution  uses  probability  density 
function  f(t) 

i.  Subject  contributes  f(t)  if  event  is  observed 
at  time  t 
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ii.  Integrate  over  f(t)  if  subject  is  censored 

a.  Integrate  from  0  to  t  if  subject  is  left- 
censored  at  t 

b.  Integrate  from  t  to  infinity  if  subject  is 
right-censored  at  t 

c.  Integrate  over  interval  of  censorship  if 
subject  is  interval-censored 

XI.  Interval-Censored  Data  (pages  321-326) 

A.  Binary  regression  is  alternative  approach  if  data 
are  interval-censored 

B.  Binary  outcome  variable  represents  survival 
or  failure  over  each  subinterval  of  subject’s 
follow-up 

C.  Specify  a  link  function  when  using  binary 
regression 

i.  Logit  link  for  logistic  regression 

ii.  Complementary  log-log  link  is  an 
alternative  to  logistic  regression 

D.  Discrete  survival  analysis 

i.  Time-to-event  variable  is  discrete 

ii.  Binary  regression  can  be  applied  in  a 
similar  manner  to  that  of  interval-censored 
data 

XII.  Frailty  Models  (pages  326-340) 

A.  The  frailty  a  is  an  unobserved  multiplicative 
effect  on  the  hazard  function 

i.  Hazard,  conditioned  on  the  frailty, 
h(t|a)  =  ah(t) 

ii.  Survival,  conditioned  on  the  frailty, 

S(t|a)  =  S(t)a 

B.  Frailty  assumed  to  follow  some  distribution 
g(a)  of  mean  1  and  variance  9 

i.  The  variance  6  is  a  parameter  estimated  by 
the  data 

ii.  Gamma  distribution  offered  by  Stata  and 
R  software 

C.  Designed  to  account  for  unobserved  individual- 
level  factors  that  influence  survival 

i.  Distinction  is  made  between  the  individual- 
level  and  population-level  hazards.  PH 
assumption  may  hold  on  individual  level 
but  not  on  population  level 
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D.  Shared  frailty  models  allow  individuals  to  share 
the  same  frailty 

i.  Play  similar  role  as  adding  a  random  effect 
to  a  linear  regression 

ii.  Can  account  for  within-group  correlation. 
XIII.  Summary  (pages  341-344) 


Practice  Answer  questions  1  to  5  as  true  or  false  (circle  T  or  F) 

Exercises 

T  F  1.  The  acceleration  factor  comparing  exposed  and 
unexposed  subjects,  (E  =  1  vs.  E  =  0),  is  a  ratio  of 
their  median  survival  times  (time  to  S(t)  =  0.5),  or 
more  generally  the  ratio  of  their  times  to  any  fixed 
value  of  S(t)  =  q. 

T  F  2.  Let  S0(t)  be  the  survival  function  for  unexposed 
subjects  (E  =  0)  and  let  Si(t)  be  the  survival  func¬ 
tion  for  exposed  subjects  (E  =  1).  If  S0(t)  =  Si(3t) 
then  the  median  survival  time  for  the  unexposed 
subjects  is  3  times  longer  than  the  median  sur¬ 
vival  time  for  the  exposed  subjects. 

T  F  3.  The  Cox  proportional  hazards  model  is  a  para¬ 
metric  model. 

T  F  4.  if  the  acceleration  failure  time  (AFT)  assumption 
holds  in  a  Weibull  model  then  the  proportional 
hazards  assumption  also  holds. 

T  F  5.  The  hazard  is  assumed  constant  in  a  log-logistic 
model. 

Questions  6  and  7  make  use  of  the  output  (copied  below) 
presented  in  Sections  III  and  V  containing  an  example  of 
the  exponential  model.  This  example  used  the  remission 
data  with  treatment  status  (coded  TRT  =  1  for  the  experi¬ 
mental  treatment  and  TRT  =  0  for  the  placebo).  The  expo¬ 
nential  survival  and  hazard  functions  are,  respectively, 
S(t)  =  exp(— At)  and  h(t)  =  A  where  A,  =  exp[-(a0  +  oqTRT)] 
for  the  AFT  parameterization  and  A  =  exp(p0  +  P  i  TRT )  for 
the  PH  parameterization.  The  output  for  both  the  AFT  and 
PH  forms  of  the  model  are  presented. 


Exponential  regression  Exponential  regression  log 

accelerated  failure-time  form  relative-hazard  form 

A  =  exp[— (a0  +  ajTRT)]  A  =  exp(p0  +  PiTRT) 


_t 

Coef. 

Std.  Err. 

z 

p>|z| 

_t 

Coef. 

Std.  Err. 

z  P>lzl 

trt 

1.527 

.398 

3.83 

0.00 

trt 

-1.527 

.398 

3.83 

0.00 

_cons 

2.159 

.218 

9.90 

0.00 

_cons 

-2.159 

.218 

-9.90 

0.00 
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6.  In  this  chapter  it  was  shown  in  an  exponential  model 
that  the  time  to  event  is  t  =  [—  log(S(t)]  x  (1/k)  given  a 
fixed  value  of  S(t).  Use  the  output  from  the  AFT  form 
of  the  model  to  estimate  the  median  survival  time 
(in  weeks)  for  the  treated  group  (TRT  =1)  and  the 
placebo  group  (TRT  =  0). 

7.  Use  the  output  from  the  PH  form  of  the  model  to 
estimate  the  median  survival  time  for  the  treated 
group  (TRT  =1)  and  the  placebo  group  (TRT  =  0). 
Notice  the  answers  from  Questions  6  and  7  are  the 
same,  illustrating  that  the  AFT  and  PH  forms  of  the 
exponential  model  are  just  different  parameterizations 
of  the  same  model. 

Questions  8  to  1 1  refer  to  a  log-logistic  AFT  model  using 
the  data  from  the  Veteran’s  Administration  Lung  Cancer 
Trial.  The  exposure  of  interest  is  treatment  status  TX  (stan¬ 
dard  =  1,  test  =  2).  The  control  variables  are  performance 
status  (PERF),  disease  duration  (DD),  AGE,  and  prior  ther¬ 
apy  (PRIORTX).  These  predictors  are  used  in  the  section 
on  frailty  models.  The  outcome  is  time  to  death  (in  days). 
The  output  is  shown  below. 

Log-logistic  regression  —  accelerated  failure-time  form 

LR  chi2(5)  =  61.31 


Log  likelihood  =  —200.196  Prob  >  chi2  =  0.0000 


_t 

Coef. 

Std.  Err. 

z 

P>|z| 

tx 

-.054087 

.1863349 

-0.29 

0.772 

perf 

.0401825 

.0046188 

8.70 

0.000 

dd 

.0042271 

.0095831 

0.44 

0.659 

age 

.0086776 

.0092693 

0.94 

0.349 

priortx 

.0032806 

.0225789 

0.15 

0.884 

_cons 

1.347464 

.6964462 

1.93 

0.053 

/ln_gam 

-.4831864 

.0743015 

-6.50 

0.000 

gamma 

.6168149 

.0458303 

8.  State  the  AFT  log-logistic  model  in  terms  of  S(t)  (note 
gamma  =  1  Ip). 

9.  Estimate  the  acceleration  factor  y  with  a  95%  confi¬ 
dence  interval  comparing  the  test  and  standard  treat¬ 
ment  (TX  =  2  vs.  TX  =1).  Interpret  your  answer. 
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10.  The  AFT  log-logistic  model  is  also  a  proportional  odds 
model.  Use  the  output  to  estimate  the  odds  ratio  (odds 
of  death)  comparing  the  test  and  standard  treatment. 
Also  estimate  the  survival  odds  ratio  comparing  the 
test  and  standard  treatment. 

11.  The  Akaike  Information  Criterion  (AIC)  is  a  method 
designed  to  compare  the  fit  of  different  models.  For 
this  question,  three  models  are  compared  using  the 
same  5  predictors: 

1.  A  Weibull  model  without  frailty  (presented  as 
Model  1  in  the  section  on  frailty  models); 

2.  A  Weibull  model  containing  a  frailty  component 
(presented  as  Model  2  in  the  section  on  frailty 
models);  and 

3.  The  log-logistic  model  presented  above. 

Below  is  a  table  containing  the  log  likelihood  statistic  for 
each  model. 


Model 

Frailty 

Number  of 
parameters 

Log  likelihood 

1.  Weibull 

No 

7 

-206.204 

2.  Weibull 

Yes 

8 

-200.193 

3.  Log-logistic 

No 

7 

-200.196 

The  goal  for  this  question  is  to  calculate  the  AIC  statistic 
for  each  model  and  select  the  model  based  on  this  crite¬ 
rion.  The  AIC  statistic  is  calculated  as:  —2  log  likelihood 
+  2p  (where  p  is  the  number  of  parameters  in  the  model). 
A  smaller  AIC  statistic  suggests  a  better  fit.  The  addition  of 
2  times  p  can  be  thought  of  as  a  penalty  if  nonpredictive 
parameters  are  added  to  the  model.  Each  model  contains 
the  5  predictors,  an  intercept,  and  a  shape  parameter. 
Model  2  contains  an  additional  variance  parameter  (theta) 
because  a  frailty  component  is  included  in  the  model.  The 
log  likelihood  was  unchanged  when  a  frailty  component 
was  added  to  the  log-logistic  model  (not  shown  in  table). 

Note  that  if  we  are  just  comparing  Models  1  and  2  we  could 
use  the  likelihood  ratio  test  because  Model  1  is  nested 
(contained)  in  Model  2.  The  likelihood  ratio  test  is  consid¬ 
ered  a  superior  method  to  the  AIC  for  comparing  models 
but  cannot  be  used  to  compare  the  log-logistic  model  to  the 
other  two,  because  that  model  uses  a  different  distribution. 

Which  of  the  three  models  should  be  selected  based  on 
the  AIC? 
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Questions  12  to  14  refer  to  a  generalized  gamma  model 
using  the  Veterans’  data  with  the  same  five  predictor  vari¬ 
ables  that  were  used  in  the  model  for  Questions  8  to  10.  The 
generalized  gamma  distribution  contains  two  shape  para¬ 
meters  (kappa  and  sigma)  that  allow  great  flexibility  in  the 
shape  of  the  hazard.  If  kappa  =  1,  the  model  reduces  to  a 
Weibull  distribution  with  p  =  1/sigma.  If  kappa  =  0  the 
model  reduces  to  a  lognormal  distribution.  The  output  is 
shown  below. 

Gamma  regression  —  accelerated  failure-time  form 


Log  likelihood 

=  -200.626 

LR  chi2(5)  = 
Prob  >  chi2 

=  52.86 
=  0.0000 

_t 

Coef. 

Std.  Err. 

z 

p>|z| 

tx 

-.131 

.1908 

-0.69 

0.491 

perf 

.039 

.0051 

7.77 

0.000 

dd 

.0004 

.0097 

0.04 

0.965 

age 

.008 

.0095 

0.89 

0.376 

priortx 

.004 

.0229 

0.17 

0.864 

_cons 

1.665 

.7725 

2.16 

0.031 

/ln_sig 

.0859 

.0654 

1.31 

0.189 

/kappa 

.2376 

.2193 

1.08 

0.279 

sigma 

1.0898 

.0714 

12.  Estimate  the  acceleration  factor  y  with  a  95%  confi¬ 
dence  interval  comparing  the  test  and  standard  treat¬ 
ment  (TX  =  2  vs.  TX  =  1). 

13.  Use  the  output  to  test  the  null  hypothesis  that  a  log¬ 
normal  distribution  is  appropriate  for  this  model. 

14.  A  lognormal  model  was  run  with  the  same  five  pre¬ 
dictors  (output  not  shown)  and  yielded  very  similar 
parameter  estimates  to  those  obtained  from  the 
generalized  gamma  model  shown  above.  The  value 
of  the  log  likelihood  for  the  lognormal  model  was 
—201.210.  Compare  the  AIC  of  the  generalized  gamma 
model,  the  lognormal  model,  and  the  log-logistic  model 
from  Question  11  and  select  a  model  based  on  that 
criterion.  Note:  each  model  contains  an  intercept  and 
five  predictors.  The  generalized  gamma  distribution 
contains  two  additional  shape  parameters  and  the  log- 
logistic  and  lognormal  distributions  each  contain  one 
additional  shape  parameter  (see  Question  1 1  for  further 
details  on  the  AIC). 
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Questions  15  to  17  refer  to  a  Weibull  model  using  the 
remission  data  with  treatment  as  the  only  predictor 
(coded  TRT  =  1  for  the  test  treatment  and  TRT  =  0  for 
the  placebo).  In  this  model  both  X  and  p  are  modeled  as 
functions  of  the  predictor  TRT.  The  model  can  be  stated  in 
terms  of  the  hazard  function:  h(t)  =  Xptp~  1  where  X  =  exp 
(Po  +  Pi  TRT)  and  p  =  exp(<50  +  <h  TRT).  Typically,  the 
shape  parameter  in  a  Weibull  model  is  assumed  constant 
(i.e.,  t>i  =  0)  across  levels  of  covariates.  This  model  is 
discussed  in  the  section  of  this  chapter  called  “Other  Para¬ 
metric  Models.”  The  output  obtained  using  Stata  is  shown 
below. 

Weibull  regression  —  log  relative-hazard  form 

LR  chi2(l)  =  1.69 

Log  likelihood  =  —47.063396  Prob  >  chi2  =  0.1941 


_t 

Coef. 

Std.  Err. 

z 

P>M 

_t 

trt 

-1.682 

1.374 

-1.22 

0.221 

_cons 

-3.083 

.646 

-4.77 

0.000 

ln_p 

trt 

-.012 

.328 

-0.04 

0.970 

_cons 

.315 

.174 

1.82 

0.069 

15.  Even  though  X  is  parameterized  similarly  to  that  in  a 
PH  Weibull  model,  this  model  is  not  a  PH  model 
because  the  shape  parameter  p  varies  across  treat¬ 
ment  groups.  Show  the  PH  assumption  is  violated 
in  this  model  by  estimating  the  hazard  ratios  for 
TRT  =  0  vs.  TRT  =  1  after  10  weeks  and  after  20  weeks 
of  follow-up. 

16.  Perform  a  statistical  test  on  the  hypothesis  §i  =  0  (the 
coefficient  for  the  treatment  term  for  ln(p)).  Note: 
if  we  assume  Sj  =  0,  then  the  model  reduces  to  the 
example  of  the  Weibull  PH  model  presented  in  Section 
VI  of  this  chapter. 

17.  Consider  the  plot  of  the  log  negative  log  of  the 
Kaplan-Meier  survival  estimates  against  the  log  of 
time  for  TRT  =  1  and  TRT  =  0.  How  should  the 
graph  look  if  §!  =  0? 
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JggJ  Answer  the  following  true  or  false  questions  (circle  T 

orF). 

T  F  1.  The  accelerated  failure  time  model  and  propor¬ 
tional  hazards  model  are  both  additive  models. 

T  F  2.  If  the  survival  function  is  known  then  the  hazard 
function  can  be  ascertained  (and  vice  versa). 

T  F  3.  If  survival  time  follows  a  Weibull  distribution 
then  a  plot  of  the  ln[-ln  S(t)]  against  ln(t)  should 
be  a  straight  line. 

T  F  4.  If  the  acceleration  failure  time  (AFT)  assumption 
holds  in  a  log-logistic  model  then  the  propor¬ 
tional  hazards  assumption  also  holds. 

T  F  5.  If  the  acceleration  factor  for  the  effect  of  an  expo¬ 
sure  (exposed  vs.  unexposed)  is  greater  than  one, 
then  the  exposure  is  harmful  to  survival. 

T  F  6.  Let  S0(t)  be  the  survival  function  for  unexposed 
subjects  (E  =  0)  and  let  Si(t)  be  the  survival 
function  for  exposed  subjects  (E  =  1).  If  y  is  the 
acceleration  factor  comparing  E  =  1  vs.  E  = 
0  then  S0(t)  =  Si(yt). 

T  F  7.  Frailty  models  are  designed  to  provide  an 
approach  to  account  for  unobserved  individual- 
level  characteristics. 

T  F  8.  If  you  include  a  gamma  distributed  frailty  com¬ 
ponent  to  the  model,  then  you  will  see  an  addi¬ 
tional  parameter  estimate  for  the  variance  of  the 
frailty  in  the  model  output. 

T  F  9.  If  survival  time  T  follows  a  Weibull  distribution, 
then  ln(T)  also  follows  a  Weibull  distribution. 

T  F  10.  If  a  subject  is  lost  to  follow-up  after  5  years,  then 
the  subject  is  left-censored. 

Questions  11  to  17  refer  to  a  Weibull  model  run  with  the 
“addicts”  dataset.  The  predictor  of  interest  is  CLINIC 
(coded  1  or  2)  for  two  methadone  clinics  for  heroin  addicts. 
Covariates  include  DOSE  (continuous)  for  methadone 
dose  (mg/day),  PRISON  (coded  1  if  patient  has  a  prison 
record  and  0  if  not),  and  a  prison-dose  product  term 
(called  PRISDOSE).  The  outcome  is  time  (in  days)  until 
the  person  dropped  out  of  the  clinic  or  was  censored.  The 
Weibull  survival  and  hazard  functions  are,  respectively, 
S(t)  =  exp(— Xtp)  and  h(t)  =  'kpf  ~  1  where  L1/p  =  exp[— (a0 
+  o^CLINIC  +  a2PRISON  +  a3DOSE  +  04PRISDOSE)]  for 
the  AFT  parameterization  and  X  =  exp[p0  +  PiCLINIC  + 
P2PRISON  +  P3DOSE  +  P4PRISDOSE]  for  the  PH  parame¬ 
terization.  The  Stata  output  for  both  the  AFT  and  PH  forms 
of  the  model  are  presented  as  follows: 


Test  357 


Weibull  regression  Weibull  regression 

accelerated  failure-time  form  log  relative-hazard  form 

Log  likelihood  =  —260.74854  Log  likelihood  =  —260.74854 


_t 

Coef. 

Std. 

Err. 

z 

P  > 

|z| 

_t 

Coef. 

Std. 

Err. 

z 

P  > 

M 

clinic 

.698 

.158 

4.42 

0.000 

clinic 

-.957 

.213 

-4.49 

0.000 

prison 

.145 

.558 

0.26 

0.795 

prison 

-.198 

.765 

-0.26 

0.795 

dose 

.027 

.006 

4.60 

0.000 

dose 

-.037 

.008 

-4.63 

0.000 

prisdose 

-.006 

.009 

-0.69 

0.492 

prisdose 

.009 

.013 

0.69 

0.491 

_cons 

3.977 

.376 

10.58 

0.000 

_cons 

-5.450 

.702 

-7.76 

0.000 

/ln_p 

.315 

.068 

4.67 

0.000 

/ln.p 

.315 

.068 

4.67 

0.000 

P 

1.370467 

P 

1.370467 

1/p 

.729678 

1/p 

.729678 

1 1 .  Estimate  the  acceleration  factor  with  a  95%  confidence 
interval  comparing  CLINIC  =  2  vs.  CLINIC  =  1 .  Inter¬ 
pret  this  result. 

12.  Estimate  the  hazard  ratio  with  a  95%  confidence 
interval  comparing  CLINIC  =  2  vs.  CLINIC  =  1.  Inter¬ 
pret  this  result. 

13.  Estimate  the  coefficient  for  CLINIC  in  the  PH  Weibull 
model  using  the  results  reported  in  the  output  from 
the  AFT  form  of  the  model.  Hint:  the  coefficients  for 
a  Weibull  PH  and  AFT  model  are  related  (3 iy  =  -v.jp  for 
the  jth  covariate. 

14.  Is  the  product  term  PRISDOSE  included  in  the  model 
to  account  for  potential  interaction  or  potential  con¬ 
founding  of  the  effect  of  CLINIC  on  survival? 

15.  Use  the  output  to  estimate  the  median  survival  time 
for  a  patient  from  CLINIC  =  2  who  has  a  prison  record 
and  receives  a  methadone  dose  of  50  mg/day.  Hint: 
use  the  relationship  that  t  =  [-In  S(t)]1/p  x  ( l/X1/p)  for 
a  Weibull  model. 

16.  Use  the  output  to  estimate  the  median  survival  time 
for  a  patient  from  CLINIC  =  1  who  has  a  prison  record 
and  receives  a  methadone  dose  of  50  mg/day. 

17.  What  is  the  ratio  of  your  answers  from  Questions 
1 5  and  1 6  and  how  does  this  ratio  relate  to  the  accel¬ 
eration  factor? 
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Questions  18  and  19  refer  to  the  Weibull  model  (in  AFT 
form)  that  was  used  for  the  previous  set  of  questions 
(Questions  11  to  17).  The  only  difference  is  that  a  frailty 
component  is  now  included  in  the  model.  A  gamma  distri¬ 
bution  of  mean  1  and  variance  theta  is  assumed  for  the 
frailty.  The  output  shown  on  in  the  following  contains  one 
additional  parameter  estimate  (for  theta). 

Weibull  regression 
accelerated  failure-time  form 
Gamma  frailty 


Log  likelihood  =  —260.74854 


_t 

Coef. 

Std.  Err. 

z 

p  >  M 

clinic 

.698 

.158 

4.42 

0.000 

prison 

.145 

.558 

0.26 

0.795 

dose 

.027 

.006 

4.60 

0.000 

prisdose 

-.006 

.009 

-0.69 

0.492 

_cons 

3.977 

.376 

10.58 

0.000 

/ln_p 

.315 

.068 

4.67 

0.000 

P 

1.370467 

1/p 

.729678 

theta 

.00000002 

.0000262 

Likelihood  ratio  test  of  theta=0: 
chibar2(01)  =  0.00 
Prob>=chibar2  =  1.000 

18.  Did  the  addition  of  the  frailty  component  change  any 
of  the  other  parameter  estimates  (besides  theta)?  Did 
it  change  the  log  likelihood? 

19.  A  likelihood  ratio  test  for  the  hypothesis  H0:  theta  = 
0  yields  a  p-value  of  1.0  (bottom  of  the  output).  The 
parameter  estimate  for  theta  is  essentially  zero.  What 
does  it  mean  if  theta  =  0? 
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Answers  to 

Practice 

Exercises 


1.  T 

2.  F:  The  median  survival  time  for  the  unexposed  is  1/3  of 
the  median  survival  time  for  the  exposed. 

3.  F:  The  Cox  model  is  a  semiparametric  model.  The 
distribution  of  survival  time  is  unspecified  in  a  Cox 
model. 

4.  T 

5.  F:  The  hazard  is  assumed  constant  in  an  exponential 
model. 

6.  t  =  [ — log(S(t)]  x  (1/A.),  where  S(t)  =  0.5,  and  1/A  =  exp 
(a0  +  «i  TRT). 

For  TRT  =  0:  estimated  median  survival  =  [— ln(0.5)] 
exp(2.159)  =  6.0  weeks. 

For  TRT  =  1:  estimated  median  survival  =  [— ln(0.5)] 
exp(2.159  +  1.527)  =  27.6  weeks. 

7.  t  =  [ — log(S(t)]  (1/A.),  where  S(t)  =  0.5,  and 

A.  =  exp(p0  +  P!  TRT)  =>  1/A,  =  exp[-(p0  +  piTRT)]. 
For  TRT  =  0:  estimated  median  survival  =  [— ln(0.5)] 
exp[— (— 2.159)]  =  6.0  weeks. 

For  TRT  =  1:  estimated  median  survival  =  [— ln(0.5)] 
exp[— (— 2.159  -  1.527)]  =  27.6  weeks. 

8.  S(t)  =  1/(1  +  A tp)  where  A1/p  =  exp[-(a0  +  <x.\  TX  + 
a2  PERF  +  a3DD  +  a4AGE  +  a5PRIORTX)]. 

g  exp[a0  +  (2)  +  a2  PERF  +  a3  DD  +  a  4  AGE  +  a5  PRIORTX] 

'  exp[  a0  +  ai(l)  +  a  2PERF  +  ot3Z)£>+  a  a,  AGEE  0(5  PRIORTX] 
=  exp(  aj) 

f  =  exp(— 0.054087)  =  0.95 

95%  Cl  =  exp[— 0.054087  ±  1.96(0.1863349)]  =  (0.66,  1.36) 

The  point  estimate  along  with  the  95%  Cl  suggests  a 
null  result. 

10.  The  coefficients  for  a  log-logistic  proportional  odds 
(PO)  and  AFT  model  are  related  pi  =  — ai  p  =  —  pi/ 
gamma,  where  p3  is  the  coefficient  for  TX  in  a  PO 
model. 

OR  =  exp(— «]/gamma) 

estimated  OR  =  exp(-0.054087/0.6168149)  =  0.92 
estimated  survival  OR  =  l/[exp(— 0.054087/0.6168149)] 
=  1.09. 
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1 1 .  The  AIC  statistic  is  calculated  as  —2  log  likelihood  +2p 
(where  p  is  the  number  of  parameters  in  the  model).  A 
smaller  AIC  statistic  suggests  a  better  fit.  The  AIC 
statistic  is  shown  below  for  each  of  the  three  models. 


Model 

Number  of 
Frailty  parameters 

Log 

likelihood 

AIC 

1.  Weibull 

No 

7 

—206.204 

426.408 

2.  Weibull 

Yes 

8 

-200.193 

416.386 

3.  Log-logistic  No 

7 

-200.196 

414.392 

Based  on  the  AIC,  the  log-logistic  model  is  selected 
yielding  the  smallest  AIC  statistic  at  414.392. 


12  exp  [o?o  +  ai(2)  +  a  2PERF  +  a  3DD  +  a  4AGE  +  a  ^PRIORTX] 

exp[  ao  +  «i(l)  +  CL2PERF  +  0C3 DD  +  04 AGE  +  a$PRIORTX\ 
=  exp(  ai) 

y  =  exp(— 0.131)  =  0.88 

95%  Cl  =  exp[(— 0.131  ±  1.96(0.1908)]  =  (0.60,  1.28) 

13.  The  generalized  gamma  distribution  reduces  to  a  log¬ 
normal  distribution  if  kappa  =  0. 

Ho  :  kappa  =  0 

0.2376 

Wald  test  statistic:  z  = - =  1.08  (from  output) 

0.2193  V  P 

p-value:  0.279  (from  output) 

Conclusion:  p-value  not  significant  at  a  significance 
level  of  0.05.  Not  enough  evidence  to  reject  H0.  The 
lognormal  distribution  may  be  appropriate. 

14.  The  AIC  statistic  is  shown  below  for  the  generalized 
gamma,  lognormal,  and  log-logistic  models. 


Model 

Number  of 
parameters 

Log 

likelihood 

AIC 

Generalized  Gamma 

8 

-200.626 

417.252 

Lognormal 

7 

-201.210 

416.420 

Log-logistic 

7 

200.196 

414.392 

As  in  Question  1 1 ,  the  log-logistic  model  is  selected 
yielding  the  smallest  AIC  at  414.392. 
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15.  h(t)  =  Xptp  1  where  /.  =  exp(p0  +  Pi  TRT)  and  /;  =  exp 
(50  +  §iTRT) 

let  Xq  =  exp[p0  +  pi(0)],  X.!  =  exp[p0  +  Pi(l)]letp0  =  exp 
[§o  +  8 1.(0)],  p i  =  exp[80  +  5i(l)]l0  =0.0458,  i\  = 
0.0085,  po  =1.3703,  px  =  1.3539  (calculated  using  out¬ 
put) 


HR  (TRT  =  0  vs.  TRT  =  1)  = 


A oPotp°  1 


///has  a  function  of  t)  = 


hpitp'~l 

(0.0458)  (1.3703)/0'3703 
(0.0085)  (1.3539)/0-3539 


mu  -  mi  -  (Q-0458)(i.3703)(io°-3703)  _ 
^  ^  (0.0085)  (1. 3539)  (10°-3539) 


HR{  t  =  20)  = 


(0.0458)  (1. 3703)  (20°'3703) 
(0.0085)  (1. 3539)  (20°-3539) 


=  5.73 


The  estimated  hazard  ratios  for  RX  at  10  weeks  and  at 
20  weeks  are  different,  demonstrating  that  the 
hazards  are  not  constrained  to  be  proportional  in 
this  model.  However,  the  estimated  hazard  ratios  are 
just  slightly  different,  suggesting  that  the  PH  assump¬ 
tion  is  probably  reasonable. 

16.  H0:  §!  =  0 

—0  0123083 

Wald  test  statistic  :  z  =  — - — - =  —0.04  (from  output) 

0.328174  v  '  ’ 


p-value:  0.970  (from  output) 

Conclusion:  p-value  is  not  significant.  No  evidence  to 
reject  H0.  The  PH  assumption  is  reasonable. 

17.  If  the  Weibull  assumption  is  met,  then  the  plots 
should  be  straight  lines  with  slope  p.  If  82  =  0,  then 
the  slope  p  is  the  same  for  TRT  =  1  and  TRT  =  0  and 
the  lines  are  parallel. 


Recurrent 

Event 

Survival 

Analysis 
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Introduction 


Abbreviated 

Outline 


This  chapter  considers  outcome  events  that  may  occur 
more  than  once  over  the  follow-up  time  for  a  given  subject. 
Such  events  are  called  “recurrent  events.”  Modeling  this 
type  of  data  can  be  carried  out  using  a  Cox  PH  model  with 
the  data  layout  constructed  so  that  each  subject  has  a  line 
of  data  corresponding  to  each  recurrent  event.  A  variation 
of  this  approach  uses  a  stratified  Cox  PH  model,  which 
stratifies  on  the  order  in  which  recurrent  events  occur. 
Regardless  of  which  approach  is  used,  the  researcher 
should  consider  adjusting  the  variances  of  estimated 
model  coefficients  for  the  likely  correlation  among  recur¬ 
rent  events  on  the  same  subject.  Such  adjusted  variance 
estimates  are  called  “robust  variance  estimates.”  A  para¬ 
metric  approach  for  analyzing  recurrent  event  data  that 
includes  a  frailty  component  (introduced  in  Chapter  7)  is 
also  described  and  illustrated. 

The  outline  below  gives  the  user  a  preview  of  the  material 
to  be  covered  by  the  presentation.  A  detailed  outline  for 
review  purposes  follows  the  presentation. 

I.  Overview  (page  366) 

II.  Examples  of  Recurrent  Event  Data 

(pages  366-368) 

III.  Counting  Process  Example  (pages  368-369) 

IV.  General  Data  Layout:  Counting  Process 

Approach  (pages  370-371) 

V.  The  Counting  Process  Model  and  Method 

(pages  372-376) 

VI.  Robust  Estimation  (pages  376-378) 

VII.  Results  for  CP  Example  (pages  378-379) 

VIII.  Other  Approaches  Stratified  Cox  (pages  379-385) 

IX.  Bladder  Cancer  Study  Example  (Continued) 

(pages  385-389) 

X.  A  Parametric  Approach  Using  Shared  Frailty 

(pages  389-391) 

XI.  A  Second  Example  (pages  391-395) 

XII.  Survival  Curves  with  Recurrent  Events 

(pages  395-398) 

XIII.  Summary  (pages  398-401) 
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Objectives 


Upon  completing  this  chapter,  the  learner  should  be  able  to: 

1 .  State  or  recognize  examples  of  recurrent  event  data. 

2.  State  or  recognize  the  form  of  the  data  layout  used  for 
the  counting  process  approach  for  analyzing  correlated 
data. 

3.  Given  recurrent  event  data,  outline  the  steps  needed  to 
analyze  such  data  using  the  counting  process 
approach. 

4.  State  or  recognize  the  form  of  the  data  layout  used  for 
the  Stratified  Cox  (SC)  approaches  for  analyzing 
correlated  data. 

5.  Given  recurrent  event  data,  outline  the  steps  needed  to 
analyze  such  data  using  the  SC  approaches. 
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Presentation 


I.  Overview 


In  this  chapter  we  consider  outcome  events  that 
may  occur  more  than  once  over  the  follow¬ 
up  time  for  a  given  subject.  Such  events  are 
called  “recurrent  events.”  We  focus  on  the 
Counting  Process  (CP)  approach  for  analysis 
of  such  data  that  uses  the  Cox  PH  model,  but 
we  also  describe  alternative  approaches  that  use 
a  Stratified  Cox  (SC)  PH  model  and  a  frailty 
model. 


II.  Examples  of  Recurrent 
Event  Data 

1 .  Multiple  relapses  from 
remission  -  leukemia  patients 

2.  Repeated  heart  attacks  - 
coronary  patients 

3 .  Recurrence  of  tumors  -  bladder 
cancer  patients 

4.  Deteriorating  episodes  of  visual 
acuity  -  macular  degeneration 
patients 


Up  to  this  point,  we  have  assumed  that  the  event 
of  interest  can  occur  only  once  for  a  given 
subject.  However,  in  many  research  scenarios 
in  which  the  event  of  interest  is  not  death,  a 
subject  may  experience  an  event  several  times 
over  follow-up.  Examples  of  recurrent  event 
data  include: 

1.  Multiple  episodes  of  relapses  from 
remission  comparing  different  treatments 
for  leukemia  patients. 

2.  Recurrent  heart  attacks  of  coronary 
patients  being  treated  for  heart  disease. 

3.  Recurrence  of  bladder  cancer  tumors  in  a 
cohort  of  patients  randomized  to  one  of  two 
treatment  groups. 

4.  Multiple  events  of  deteriorating  visual 
acuity  in  patients  with  baseline  macular 
degeneration,  where  each  recurrent  event  is 
considered  a  more  clinically  advanced  stage 
of  a  previous  event. 


Objective 

Assess  relationship  of  predictors  For  each  of  the  above  examples,  the  event  of 
to  rate  of  occurrence,  allowing  for  interest  differs,  but  nevertheless  may  occur 
multiple  events  per  subject  more  than  once  per  subject.  A  logical  objective 

for  such  data  is  to  assess  the  relationship  of 
relevant  predictors  to  the  rate  in  which  events 
are  occurring,  allowing  for  multiple  events  per 
subject. 
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LEUKEMIA  EXAMPLE 


Do  treatment  groups  differ  in  rates  of 
relapse  from  remission? 


In  the  leukemia  example  above,  we  might  ask 
whether  persons  in  one  treatment  group  are 
experiencing  relapse  episodes  at  a  higher  rate 
than  persons  in  a  different  treatment  group. 


HEART  ATTACK  EXAMPLE 


Do  smokers  have  a  higher  heart  attack 
rate  than  nonsmokers? 


If  the  recurrent  event  is  a  heart  attack,  we 
might  ask,  for  example,  whether  smokers  are 
experiencing  heart  attack  episodes  at  a  higher 
rate  than  nonsmokers. 


LEUKEMIA  AND  HEART  ATTACK 
EXAMPLES 


All  events  are  of  the  same  type 
The  order  of  events  is  not  important 
Heart  attacks:  Treat  as  identical 
events; 

Don’t  distinguish  among  1st,  2nd,  3rd, 
etc.  attack 


For  either  of  the  above  two  examples,  we  are 
treating  all  events  as  if  they  were  the  same 
type.  That  is,  the  occurrence  of  an  event  on  a 
given  subject  identifies  the  same  disease  with¬ 
out  considering  more  specific  qualifiers  such 
as  severity  or  stage  of  disease.  We  also  are  not 
taking  into  account  the  order  in  which  the 
events  occurred. 


For  example,  we  may  wish  to  treat  all  heart 
attacks,  whether  on  the  same  or  different  sub¬ 
jects,  as  identical  types  of  events,  and  we  don’t 
wish  to  identify  whether  a  given  heart  attack 
episode  was  the  first,  or  the  second,  or  the  third 
event  that  occurred  on  a  given  subject. 


BLADDER  CANCER  EXAMPLE 


Compare  overall  tumor  recurrence 
rate  without  considering  order  or  type 
of  tumor 


The  third  example,  which  considers  recurrence 
of  bladder  cancer  tumors,  can  be  considered 
similarly.  That  is,  we  may  be  interested  in  asses¬ 
sing  the  “overall”  tumor  recurrence  rate  without 
distinguishing  either  the  order  or  type  of  tumor. 


MACULAR  DEGENERATION  OF 
VISUAL  ACUITY  EXAMPLE 


A  second  or  higher  event  is  more 
severe  than  its  preceding  event 


Order  of  event  is  important 


The  fourth  example,  dealing  with  macular  degen¬ 
eration  events,  however,  differs  from  the  other 
examples.  The  recurrent  events  on  the  same  sub¬ 
ject  differ  in  that  a  second  or  higher  event  indi¬ 
cates  a  more  severe  degenerative  condition  than 
its  preceding  event. 

Consequently,  the  investigator  in  this  scenario 
may  wish  to  do  separate  analyses  for  each 
ordered  event  in  addition  to  or  instead  of  treat¬ 
ing  all  recurrent  events  as  identical. 
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Use  a  different  analysis  depending 
on  whether 

a.  recurrent  events  are  treated  as 
identical 

b.  recurrent  events  involve 
different  disease  categories 
and/or  the  order  of  events  is 
important 


We  have  thus  made  an  important  distinction  to 
be  considered  in  the  analysis  of  recurrent  event 
data.  If  all  recurrent  events  on  the  same  sub¬ 
ject  are  treated  as  identical,  then  the  analysis 
required  of  such  data  is  different  than  what  is 
required  if  either  recurrent  events  involve  dif¬ 
ferent  disease  categories  and/or  the  order  that 
events  reoccur  is  considered  important. 


Recurrent  events  identical 

11 

Counting  Process  Approach 

(Andersen  et  al.,  1993) 


The  approach  to  analysis  typically  used  when 
recurrent  events  are  treated  as  identical  is 
called  the  Counting  Process  Approach 
(Andersen  et  al.,  1993). 


Recurrent  events:  different  disease 
categories  or  event  order  important 

11 

Stratified  Cox  (SC)  Model 
Approaches 


When  recurrent  events  involve  different  dis¬ 
ease  categories  and/or  the  order  of  events  is 
considered  important,  a  number  of  alternative 
approaches  to  analysis  have  been  proposed 
that  involve  using  stratified  Cox  (SC)  models. 


In  this  chapter,  we  focus  on  the  Counting 
Process  (CP)  approach,  but  also  describe  the 
other  stratified  Cox  approaches  (in  a  later 
section). 


III.  Counting  Process 
Example 


Table  8.1.  2  Hypothetical  Subjects 
Bladder  Cancer  Tumor  Events 


Time 

interval 

Event 

indicator 

Treatment 

group 

Al 

0  to  3 

1 

i 

3  to  9 

1 

i 

9  to  21 

1 

i 

21  to  23 

0 

i 

Hal 

0  to  3 

1 

0 

3  to  15 

1 

0 

15  to  25 

1 

0 

To  illustrate  the  counting  process  approach, 
we  consider  data  on  two  hypothetical  subjects 
(Table  8.1),  Al  and  Hal,  from  a  randomized 
trial  that  compares  two  treatments  for  bladder 
cancer  tumors. 

Al  gets  recurrent  bladder  cancer  tumors  at 
months  3,9,  and  21,  and  is  without  a  bladder 
cancer  tumor  at  month  23,  after  which  he  is 
no  longer  followed.  Al  received  the  treatment 
coded  as  1 . 

Hal  gets  recurrent  bladder  cancer  tumors  at 
months  3,  15,  and  25,  after  which  he  is  no 
longer  followed.  Hal  received  the  treatment 
coded  as  0. 
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Al 

Hal 

No.  recurrent 
events 

3 

3 

Follow-up  time 

23  months 

25  months 

Event  times 
from  start  of 
follow-up 

3,  9,  21 

3,  15,  25 

Additional 
months  of 
follow-up 
after  last 
event 

2  months 

0  months 

Table  8.2.  Example  of  Data  Layout  for 


Counting  Process  Approach 


Subj 

Interval 

Number 

Time 

Start 

Time 

Stop 

Event 

Status 

Treatment 

Group 

Al 

1 

0 

3 

1 

1 

Al 

2 

3 

9 

1 

1 

Al 

3 

9 

21 

1 

1 

Al 

4 

21 

23 

0 

1 

Hal 

1 

0 

3 

1 

0 

Hal 

2 

3 

15 

1 

0 

Hal 

3 

15 

25 

1 

0 

Counting  process:  Start  and  Stop 
times 

Standard  layout:  only  Stop 
(survival)  times  (no  recurrent 
events) 


Interval 

Time 

Time 

Event 

Treatment 

Subj 

Number 

Start 

Stop 

Status 

Group 

Sal 

1 

0 

17 

1 

0 

Mai 

1 

0 

12 

0 

1 

A1  has  experienced  3  events  of  the  same  type 
(i.e.,  recurrent  bladder  tumors)  over  a  follow-up 
period  of  23  months.  Hal  has  also  experienced  3 
events  of  the  same  type  over  a  follow-up  period 
of  25  months. 

The  three  events  experienced  by  A1  occurred  at 
different  survival  times  (from  the  start  of  initial 
follow-up)  from  the  three  events  experienced 
by  Hal. 

Also,  Al  had  an  additional  2  months  of  follow¬ 
up  after  his  last  recurrent  event  during  which 
time  no  additional  event  occurred.  In  contrast, 
Hal  had  no  additional  event-free  follow-up 
time  after  his  last  recurrent  event. 

In  Table  8.2,  we  show  for  these  2  subjects,  how 
the  data  would  be  set  up  for  computer  analyses 
using  the  counting  process  approach.  Each 
subject  contributes  a  line  of  data  for  each 
time  interval  corresponding  to  each  recurrent 
event  and  any  additional  event-free  follow-up 
interval.  We  previously  introduced  this  format 
as  the  Counting  Process  (CP)  data  layout  in 
section  VI  of  Chapter  1 . 

A  distinguishing  feature  of  the  data  layout  for 
the  counting  process  approach  is  that  each  line 
of  data  for  a  given  subject  lists  the  start  time 
and  stop  time  for  each  interval  of  follow-up. 
This  contrasts  with  the  standard  layout  for 
data  with  no  recurrent  events,  which  lists  only 
the  stop  (survival)  time. 

Note  that  if  a  third  subject,  Sal,  failed  without 
further  events  or  follow-up  occurring,  then  Sal 
contributes  only  one  line  of  data,  as  shown  at 
the  left.  Similarly,  only  one  line  of  data  is  con¬ 
tributed  by  a  (fourth)  subject,  Mai,  who  was 
censored  without  having  failed  at  any  time 
during  follow-up. 
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IV.  General  Data  Layout: 
Counting  Process 
Approach 


The  general  data  layout  for  the  counting  pro¬ 
cess  approach  is  presented  in  Table  8.3  for  a 
dataset  involving  N  subjects. 


'  N  subjects 

F;  time  intervals  for  subject  i 
dij  event  staus  (0  or  1)  for  subject 
i  in  interval  j 

tyo  start  time  for  subject  i  in 
interval  j 

tiji  stop  time  for  subject  i  in 
interval  j 

Xyk  value  of  kth  predictor  for 
subject  i  in  interval  j 
i=  1,  2, . . . ,  N;  j  =  1,2,...,  ni5 
.  k=  1,2,...,  p 


The  ith  subject  has  r,  recurrent  events.  d;  j  denotes 
the  event  status  ( 1  =  failure,  0  =  censored)  for  the 
ith  subject  in  the  jth  time  interval,  tyo  and  tyi 
denote  the  start  and  stop  times,  respectively,  for 
the  ith  subject  in  the  jth  interval.  Xyt  denotes  the 
value  of  the  kth  predictor  for  the  ith  subject  in 
the  jth  interval. 


Table  8.3.  General  Data  Layout:  CP 
Approach 
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j 

dij 

tijO 

hji 

Xiji  ■  ■  •  Xjjp 

i 

i 

dn 

tllo 

till 

Xi ii  . .  .Xnp 

i 

2 

d12 

tl20 

tl21 

Xi21  ■  •  -Xi2p 

1 

ri 

dir. 

tlr,0 

tlr,l 

Xlr,l  • 

■  •  Xirip 

i 

i 

d„ 

tiio 

till 

XiU  . 

•Xi,p 

i 

2 

dl2 

ti20 

ti21 

Xi21  • 

•  Xi2p 

i 

n 

dir, 

tir,0 

tir,l 

Xir,l  ■ 

■  Xir,p 

Subjects  are  not  restricted  to  have  the  same 
number  of  time  intervals  (e.g.,  n  does  not 
have  to  equal  r2)  or  the  same  number  of  recur¬ 
rent  events.  If  the  last  time  interval  for  a  given 
subject  ends  in  censorship  (dy  =  0),  then  the 
number  of  recurrent  events  for  this  subject  is 
Tj  —  1 ;  previous  time  intervals,  however,  usually 
end  with  a  failure  (dy  =1). 

Also,  start  and  stop  times  may  be  different  for 
different  subjects.  (See  the  previous  section’s 
example  involving  two  subjects.) 

As  with  any  survival  data,  the  covariates  (i.e.,  Xs) 
may  be  time-independent  or  time-dependent  for 
a  given  subject.  For  example,  if  one  of  the  Xs  is 
“gender”  (1  =  female,  0  =  male),  the  values  of 
this  variable  will  be  all  Is  or  all  Os  over  all  time 
intervals  observed  for  a  given  subject.  If  another 
X  variable  is,  say,  a  measure  of  daily  stress  level, 
the  values  of  this  variable  are  likely  to  vary  over 
the  time  intervals  for  a  given  subject. 


N  1  dN1  tN10  tNii  XNn  . . .  XNlp 
N  2  dN2  tN2o  tN2i  XN2i  . . .  XN2p 


N  Tfj  dNl.N  tNrN0  bjrNl  XNfni  .  .  .  XNrNp 


The  second  column  (“Interval  j”)  in  the  data 
layout  is  not  needed  for  the  CP  analysis,  but  is 
required  for  other  approaches  described  later. 
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Table  8.4  First  26  Subjects:  Bladder 
Cancer  Study 

id  int  event  start  stop  tx  num  size 


1  1 
2  1 

3  1 

4  1 

5  1 

6  1 
6  2 

7  1 

8  1 
9  1 
9  2 

10  1 
10  2 

10  3 

11  1 
12  1 
12  2 

12  3 

13  1 
13  2 

13  3 

14  1 
14  2 
14  3 

14  4 

15  1 
15  2 
15  3 

15  4 

16  1 
16  2 

16  3 

17  1 

18  1 
18  2 
19  1 

19  2 

20  1 
20  2 
21  1 
22  1 

23  1 

24  1 

24  2 

25  1 
25  2 
25  3 

25  4 

26  1 
26  2 
26  3 
26  4 
26  5 


0  0 

0  0 

0  0 

0  0 

0  0 

1  0 

0  6 

0  0 

0  0 

1  0 

0  5 

1  0 

1  12 

0  16 

0  0 

1  0 

1  10 

0  15 

1  0 

1  3 

1  16 

1  0 

1  3 

1  9 

0  21 

1  0 

1  7 

1  10 

1  16 

1  0 

1  3 

1  15 

0  0 

1  0 

0  1 

1  0 

1  2 

1  0 

0  25 

0  0 

0  0 

0  0 

1  0 

1  28 

1  0 

1  2 

1  17 

0  22 

1  0 

1  3 

1  6 

1  8 

0  12 


0  0 
1  0 

4  0 

7  0 

10  0 
6  0 
10  0 

14  0 
18  0 

5  0 

18  0 
12  0 
16  0 
18  0 
23  0 

10  0 

15  0 

23  0 

3  0 

16  0 

23  0 

3  0 

9  0 

21  0 

23  0 

7  0 
10  0 
16  0 

24  0 

3  0 

15  0 

25  0 

26  0 
1  0 

26  0 
2  0 
26  0 
25  0 

28  0 
29  0 

29  0 

29  0 
28  0 

30  0 
2  0 

17  0 

22  0 
30  0 

3  0 

6  0 

8  0 

12  0 
30  0 


1  1 

1  3 

2  1 

1  1 

5  1 

4  1 

4  1 

1  1 

1  1 

1  3 

1  3 

1  1 

1  1 

1  1 

3  3 

1  3 

1  3 

1  3 

1  1 

1  1 

1  1 

3  1 

3  1 

3  1 

3  1 

2  3 

2  3 

2  3 

2  3 

1  1 

1  1 

1  1 

1  2 

8  1 

8  1 

1  4 

1  4 

1  2 

1  2 

1  4 

1  2 

4  1 

1  6 

1  6 

1  5 

1  5 

1  5 

1  5 

2  1 

2  1 

2  1 

2  1 

2  1 


To  illustrate  the  above  general  data  layout, 
we  present  in  Table  8.4  the  data  for  the  first 
26  subjects  from  a  study  of  recurrent  bladder 
cancer  tumors  (Byar,  1980  and  Wei,  Lin,  and 
Weissfeld,  1989).  The  entire  dataset  contained 
86  patients,  each  followed  for  a  variable  amount 
of  time  up  to  64  months. 

The  repeated  event  being  analyzed  is  the  recur¬ 
rence  of  bladder  cancer  tumors  after  trans¬ 
urethral  surgical  excision.  Each  recurrence  of 
new  tumors  was  treated  by  removal  at  each 
examination. 

About  25%  of  the  86  subjects  experienced  four 
events. 

The  exposure  variable  of  interest  is  drug  treat¬ 
ment  status  (tx,  0  =  placebo,  1  =  treatment  with 
thiotepa).  The  covariates  listed  here  are  initial 
number  of  tumors  (num)  and  initial  size  of 
tumors  (size)  in  centimeters.  The  paper  by 
Wei,  Lin,  and  Weissfeld  actually  focuses  on  a 
different  method  of  analysis  (called  “marginal”), 
which  requires  a  different  data  layout  than 
shown  here.  We  later  describe  the  “marginal” 
approach  and  its  corresponding  layout. 

In  these  data,  it  can  be  seen  that  16  of  these 
subjects  (id  #s  1,  2,  3,  4,  5,  6,  7,  8,  9,  11,  17,  18, 
20, 2 1 , 22, 23)  had  no  recurrent  events,  4  subjects 
had  2  recurrent  events  (id  #s  10,  12,  19,  24), 
4  subjects  (id  #s  13,  14,  16,  25)  had  3  recurrent 
events,  and  2  subjects  (id  #s  15,  26)  had  4  recur¬ 
rent  events. 

Moreover,  9  subjects  (id  #s  6,  9,  10,  12,  14,  18, 
20,  25,  26)  were  observed  for  an  additional 
event-free  time  interval  after  their  last  event. 
Of  these,  4  subjects  (id  #s  6,  9,  18,  20)  experi¬ 
enced  only  one  event  (i.e.,  no  recurrent  events). 
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V.  The  Counting  Process 
Model  and  Method 


Cox  PH  Model 

h(t,  X)  =  h0(t)exp[SPiXi] 

Need  to 

Assess  PH  assumption  for  X; 
Consider  stratified  Cox  or  extended 
Cox  if  PH  assumption  not 
satisfied 

Use  extended  Cox  for  time- 
dependent  variables 


Recurrent  event  Nonrecurrent 

data  event  data 

(Likelihood  function  formed  differently) 


Subjects  with  >  1 
time  interval 
remain  in  the 
risk  set  until  last 
interval  is 
completed 
Different  lines  of 
data  are  treated 
as  independent 
even  though 
several 

outcomes  on  the 
same  subject 


Subjects  removed 
from  risk  set  at 
time  of  failure  or 
censorship 

Different  lines  of 
data  are  treated 

as  independent 

because  they 
come  from 

different 

subjects 


The  model  typically  used  to  carry  out  the 
Counting  Process  approach  is  the  standard 
Cox  PH  model,  once  again  shown  here  at  the  left. 

As  usual,  the  PH  assumption  needs  to  be  evalu¬ 
ated  for  any  time-independent  variable.  A  stra¬ 
tified  Cox  model  or  an  extended  Cox  model 
would  need  to  be  used  if  one  or  more  time- 
independent  variables  did  not  satisfy  the  PH 
assumption.  Also,  an  extended  Cox  model 
would  be  required  if  inherently  time-dependent 
variables  were  considered. 


The  primary  difference  in  the  way  the  Cox 
model  is  used  for  analyzing  recurrent  event 
data  versus  nonrecurrent  (one  time  interval 
per  subject)  data  is  the  way  several  time  inter¬ 
vals  on  the  same  subject  are  treated  in  the  for¬ 
mation  of  the  likelihood  function  maximized 
for  the  Cox  model  used. 

To  keep  things  simple,  we  assume  that  the  data 
involve  only  time-independent  variables  satis¬ 
fying  the  PH  assumption.  For  recurrent  sur¬ 
vival  data,  a  subject  with  more  than  one  time 
interval  remains  in  the  risk  set  until  his  or  her 
last  interval,  after  which  the  subject  is  removed 
from  the  risk  set.  In  contrast,  for  nonrecurrent 
event  data,  each  subject  is  removed  from  the 
risk  set  at  the  time  of  failure  or  censorship. 

Nevertheless,  for  subjects  with  two  or  more 
intervals,  the  different  fines  of  data  contributed 
by  the  same  subject  are  treated  in  the  analysis 
as  if  they  were  independent  contributions  from 
different  subjects,  even  though  there  are  several 
outcomes  on  the  same  subject. 

In  contrast,  for  the  standard  Cox  PH  model 
approach  for  nonrecurrent  survival  data,  dif¬ 
ferent  lines  of  data  are  treated  as  independent 
because  they  come  from  different  subjects. 
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Cox  PH  Model  for  CP  Approach: 
Bladder  Cancer  Study 

h(t,  X)  =  h0(t)exp[p  tx  +  Yj  num 
+  Y2  size] 

where 

tx  =  1  if  thiotepa,  0  if  placebo 
num  =  initial  #  of  tumors 
size  =  initial  size  of  tumors 


No-interaction  Model 


Interaction  model  would  involve 
product  terms 

tx  x  num  and/or  tx  x  size 


For  the  bladder  cancer  study  described  in 
Table  8.4,  the  basic  Cox  PH  model  fit  to  these 
data  takes  the  form  shown  at  the  left. 

The  primary  (exposure)  variable  of  interest  in 
this  model  is  the  treatment  variable  tx.  The 
variables  num  and  size  are  considered  as 
potential  con-founders.  All  three  variables  are 
time-independent  variables. 

This  is  a  no-interaction  model  because  it  does 
not  contain  product  terms  of  the  form  tx  x 
num  or  tx  x  size.  An  interaction  model  involv¬ 
ing  such  product  terms  could  also  be  consid¬ 
ered,  but  we  only  present  the  no-interaction 
model  for  illustrative  purposes. 


Table  8.5.  Ordered  Failure  Time  and 
Risk  Set  Information  for  First  26  Subjects 
in  Bladder  Cancer  Study 


Ordered 
failure 
times  t(f) 

#  in 
risk 

set 

nf 

# 

failed 

m, 

# 

censored 

in 

[t(D,  t(f+i>) 

Subject 
ID  #s  for 

outcomes 

in 

t(f+u) 

0 

26 

— 

1 

i 

1 

25 

i 

1 

2,  18 

2 

24 

2 

0 

19,  25 

3 

24 

4 

1 

3,  13,  14, 
16,  26 

5 

23 

1 

0 

9 

6 

23 

2 

0 

6,  26 

7 

23 

1 

1 

4,  15 

8 

22 

1 

0 

26 

9 

22 

1 

0 

14 

10 

22 

2 

2 

5,6,  12,  15 

12 

20 

2 

1 

7,  10,  26 

15 

19 

2 

0 

12,  16 

16 

19 

3 

0 

10,  13,  15 

17 

19 

1 

3 

8,9,  10,25 

21 

16 

1 

0 

14 

22 

16 

1 

0 

25 

23 

16 

1 

3 

11,  12,  13, 
14 

24 

12 

1 

0 

15 

25 

11 

2 

0 

16,  20 

26 

10 

1 

2 

17,  18,  19 

28 

7 

1 

4 

20,  21, 22, 
23,  24 

30 

3 

1 

2 

24,  25,  26 

Table  8.5  at  the  left  provides  ordered  failure 
times  and  corresponding  risk  set  information 
that  would  result  if  the  first  26  subjects  that  we 
described  in  Table  8.4  comprised  the  entire 
dataset.  (Recall  that  there  are  86  subjects  in 
the  complete  dataset.) 

Because  we  consider  26  subjects,  the  number  in 
the  risk  set  at  ordered  failure  time  t(0>  is  n0  =  26. 
As  these  subjects  fail  (i.e.,  develop  a  bladder 
cancer  tumor)  or  are  censored  over  follow-up, 
the  number  in  the  risk  set  will  decrease  from  the 
fth  to  the  f  +  1th  ordered  failure  time  provided 
that  no  subject  who  fails  at  time  t(f)  either  has  a 
recurrent  event  at  a  later  time  or  has  additional 
follow-up  time  until  later  censorship.  In  other 
words,  a  subject  who  has  additional  follow-up 
time  after  having  failed  at  t^  does  not  drop  out 
of  the  risk  set  after  t (p. 


32 


21 
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Table  8.6.  Focus  on  Subject  #s  19  and  25 
from  Table  8.5 


t(f) 

n(f) 

m(f) 

9(0 

Subject  ID  #s 

0 

26 

— 

1 

1 

1 

25 

1 

1 

2,  18 

2 

24 

2 

0 

19,  25 

3 

24 

4 

1 

• 

3,  13,  14,  16,  26 

17 

19 

1 

3 

8,  9,  10,  25 

21 

16 

1 

0 

14 

22 

16 

1 

0 

25 

23 

16 

1 

3 

11,  12,  13,  14 

24 

12 

1 

0 

15 

25 

11 

2 

0 

16,  20 

26 

10 

1 

2 

17,  18,  19 

28 

7 

1 

4 

20,  21,  22,  23,  24 

30 

3 

1 

2 

24,  25,  26 

For  example,  at  month  t(f)  =  2,  subject  #s  19 
and  25  fail,  but  the  number  in  the  risk  set  at 
that  time  (nf  =  24)  does  not  decrease  (by  2) 
going  into  the  next  failure  time  because  each 
of  these  subjects  has  later  recurrent  events.  In 
particular,  subject  #19  has  a  recurrent  event  at 
month  t(p  =  26  and  subject  #25  has  two  recur¬ 
rent  events  at  months  t(p  =  17  and  t(Q  =  22  and 
has  additional  follow-up  time  until  censored 
month  30. 


Table  8.7.  Focus  on  Subject  #s  3,  13,  14, 
16,  26  from  Table  8.5 


As  another  example  from  Table  8.5,  subject  #s 


t(f) 

n(f) 

m(f) 

qrn 

Subject  ID  #s 

ordered  failure  time  L(q  =  3,  but  the  number 
in  the  risk  set  only  drops  from  24  to  23  even 

0 

26 

_ 

l 

1 

1 

25 

1 

l 

2,  18 

though  the  last  four  of  these  subjects  all  fail  at 

2 

24 

2 

0 

19,  25 

t(f)  =  3.  Subject  #3  is  censored  at  month  4  (see 

3 

24 

4 

l 

3,  13,  14,  16,  26 

Table  8.4),  so  this  subject  is  removed  from  the 

5 

23 

1 

0 

9 

risk  set  after  failure  time  t(f)  =  3.  However, 

6 

23 

2 

0 

6,  26 

subjects  13,  14,  16,  and  26  all  have  recurrent 

7 

23 

1 

1 

4,  15 

events  after  t(f)  =  3,  so  they  are  not  removed 

8 

22 

1 

0 

26 

from  the  risk  set  after  t(f)  =  3. 

9 

22 

1 

0 

14 

10 

22 

2 

2 

5,  6,  12,  15 

Subject  #26  appears  in  the  last  column  5  times. 

12 

20 

2 

1 

7,  10,  26 

This  subject  contributes  5  (start,  stop)  time  inter- 

15 

19 

2 

0 

12,  16 

vals,  fails  at  months  3,  6,  8,  and  12,  and  is  also 

16 

19 

3 

0 

10,  13,  15 

followed  until  month  30,  when  he  is  censored. 

17 

19 

1 

3 

8,  9,  10,  25 

21 

16 

1 

0 

14 

22 

16 

1 

0 

25 

23 

16 

1 

3 

11,  12,  13,  14 

24 

12 

1 

0 

15 

25 

11 

2 

0 

16,  20 

26 

10 

1 

2 

17,  18,  19 

28 

7 

1 

4 

20,  21,  22,  23,  24 

30 

3 

1 

2 

24,  25,  26 

Presentation:  V.  The  Counting  Process  Model  and  Method  375 


“Gaps”  in  follow-up  time: 

0  10  g^p  25  50 

lost  re-enter 

No  Interaction  Cox  PH  Model 

h(t,X)  =  h0(t)exp[p  tx  +  Yj  num 

+  Y2  size] 

Partial  likelihood  function: 

L  =  L]  x  L2  x  •  •  •  x  L22 


Lf  =  individual  likelihood  at  t® 

=  Pr[failing  at  t®  |  survival  up  to 
t(f)l 

f  =  1,  2,  22 

^  _  exp(ptx(f)  +r1num(f)  +y2size(£)) 

exp(ptxs{f)  +  Vjnum^f)  +  y2sizes(f)) 

s  in  R(t(f) ) 


tx®,  num(f),  and  size(f)  values  of  tx, 
num,  and  size  at  t(f) 


txs®,  nums(f),  and  sizes(f)  values  of  tx, 
num,  and  size  for  subject  s  in  R(t(f>) 


Data  for  Subject  #25 


id 

int 

event 

start 

stop 

tx 

num 

size 

25 

i 

i 

0 

2 

0 

i 

5 

25 

2 

i 

2 

17 

0 

i 

5 

25 

3 

i 

17 

22 

0 

i 

5 

25 

4 

0 

22 

30 

0 

i 

5 

f  =  15th  ordered  failure  time 
n15  =  16  subjects  in  risk  set  at 
t(is)  =  22: 


R(t(i5)  =  22)  =  [subject  #s  11,  12, 
13,  14,  15,  16,  17,  18,  19,  20,  21, 
22,  23,  24,  25,  26} 


Another  situation,  which  is  not  illustrated  in 
these  data,  involves  “gaps”  in  a  subject’s  fol¬ 
low-up  time.  A  subject  may  leave  the  risk  set 
(e.g.,  lost  to  follow-up)  at,  say,  time  =  10  and 
then  re-enter  the  risk  set  again  and  be  followed 
from,  say,  time  =  25  to  time  =  50.  This  subject 
has  a  follow-up  gap  during  the  period  from 
time  =  10  to  time  =  25. 

The  (partial)  likelihood  function  (L)  used  to  fit 
the  no-interaction  Cox  PH  model  is  expressed 
in  typical  fashion  as  the  product  of  individual 
likelihoods  contributed  by  each  ordered  failure 
time  and  corresponding  risk  set  information  in 
Table  8.5.  There  are  22  such  terms  in  this  prod¬ 
uct  because  there  are  22  ordered  failure  times 
listed  in  Table  8.5. 

Each  individual  likelihood  Lf  essentially  gives 
the  conditional  probability  of  failing  at  time 
t(f),  given  survival  (i.e.,  remaining  in  the  risk 
set)  at  t(f). 

If  there  is  only  one  failure  at  the  jth  ordered 
failure  time,  Lf  is  expressed  as  shown  at  the  left 
for  the  above  no-interaction  model.  In  this 
expression  tx®,  num(f),  and  size®  denote  the 
values  of  the  variables  tx,  num,  and  size  for  the 
subject  failing  at  month  t®. 

The  terms  txs®,  nums®,  and  sizes®  denote  the 
values  of  the  variables  tx,  num,  and  size  for  the 
subject  s  in  the  risk  set  R(t®).  Recall  that  R(t®) 
consists  of  all  subjects  remaining  at  risk  at 
failure  time  t®. 

For  example,  subject  #25  from  Table  8.4  failed 
for  the  third  time  at  month  22,  which  is  the  f  = 
15th  ordered  failure  time  listed  in  Table  8.5.  It 
can  be  seen  that  nf  =  1 6  of  the  initial  26  subjects 
are  still  at  risk  at  the  beginning  of  month  22. 
The  risk  set  at  this  time  includes  subject  #25 
and  several  other  subjects  (#s  12,  13,  14,  15,  16, 
18,  19,  26)  who  already  had  at  least  one  failure 
prior  to  month  22. 
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Ll5  = 

_ e*p(P(Q)  +  y1(i)  +  rz(5)) _ 

E  exP(Ptxs(i5)  +  y^umsps)  +  5>jSizes(15)) 

s  in  R(t(i5)) 


The  corresponding  likelihood  L15  at  t(i5)  =  22 
is  shown  at  the  left.  Subject  #2 5’s  values 
tx25(i5)  =  0,  num25(i5)  =  1,  and  size25(i5)  =  5, 
have  been  inserted  into  the  numerator  of  the 
formula.  The  denominator  will  contain  a  sum 
of  16  terms,  one  for  each  subject  in  the  risk  set 
at  t(i5)  =  22. 


Computer  program  formulates 

partial  likelihood  L 

(See  Computer  Appendix) 


The  overall  partial  likelihood  L  will  be  formu¬ 
lated  internally  by  the  computer  program  once 
the  data  layout  is  in  the  correct  form  and  the 
program  code  used  involves  the  (start,  stop) 
formulation. 


VI.  Robust  Estimation 


Data  for  Subject  #14 


id 

int 

event 

start 

stop 

tx 

num 

size 

14 

i 

i 

0 

3 

0 

3 

i 

14 

2 

i 

3 

9 

0 

3 

i 

14 

3 

i 

9 

21 

0 

3 

i 

14 

4 

0 

21 

23 

0 

3 

i 

As  illustrated  for  subject  #14  at  the  left,  each 
subject  contributes  a  line  of  data  for  each  time 
interval  corresponding  to  each  recurrent  event 
and  any  additional  event-free  follow-up  interval. 


Up  to  this  point:  We  have  also  pointed  out  that  the  Cox  model 

the  4  lines  of  data  for  subject  #14  are  analysis  described  up  to  this  point  treats  differ- 
treated  as  independent  observations  ent  lines  of  data  contributed  by  the  same  sub¬ 
ject  as  if  they  were  independent  contributions 
from  different  subjects. 


Nevertheless, 

•  Observations  of  the  same 
subject  are  correlated 

•  Makes  sense  to  adjust  for  such 
correlation  in  the  analysis 


Nevertheless,  it  makes  sense  to  view  the  dif¬ 
ferent  intervals  contributed  by  a  given  subject 
as  representing  correlated  observations  on  the 
same  subject  that  must  be  accounted  for  in  the 
analysis. 


Robust  (Empirical)  Estimation 

•  Adjusts 

Var(pk) 

where 

Pk 

is  an  estimated  regression 
coefficient 

•  accounts  for  misspecification  of 
assumed  correlation  structure 


A  widely  used  technique  for  adjusting  for  the 
correlation  among  outcomes  on  the  same  sub¬ 
ject  is  called  robust  estimation  (also  referred 
to  as  empirical  estimation).  This  technique 
essentially  involves  adjusting  the  estimated 
variances  of  regression  coefficients  obtained 
for  a  fitted  model  to  account  for  misspecifi¬ 
cation  of  the  correlation  structure  assumed 
(see  Zeger  and  Liang,  1986  and  Kleinbaum 
and  Klein,  2010). 
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CP  approach:  assumes 
independence 

Goal  of  robust  estimation:  adjust 
for  correlation  within  subjects 


In  the  CP  approach,  the  assumed  correlation 
structure  is  independence;  that  is,  the  Cox  PH 
model  that  is  fit  assumes  that  different  out¬ 
comes  on  the  same  subject  are  independent. 
Therefore  the  goal  of  robust  estimation  for  the 
CP  approach  is  to  obtain  variance  estimators 
that  adjust  for  correlation  within  subjects  when 
previously  no  such  correlation  was  assumed. 


Same  goal  for  other  approaches  for 
analyzing  recurrent  event  data 


This  is  the  same  goal  for  other  approaches  for 
analyzing  recurrent  event  data  that  we  describe 
later  in  this  chapter. 


Do  not  adjust 

Pk 

Only  adjust 
Var(pk) 


Note  that  the  estimated  regression  coefficients 
themselves  are  not  adjusted;  only  the  estimated 
variances  of  these  coefficients  are  adjusted. 


Robust  (Empirical)  Variance 
allows 

tests  of  hypotheses  and 
confidence  intervals 
that  account  for  correlated  data 


The  robust  (i.e.,  empirical)  estimator  of  the 
variance  of  an  estimated  regression  coefficient 
therefore  allows  tests  of  hypotheses  and  confi¬ 
dence  intervals  about  model  parameters  that 
account  for  correlation  within  subjects. 


Matrix  formula:  We  briefly  describe  the  formula  for  the  robust 

variance  estimator  below.  This  formula  is  in 
derived  from  ML  estimation  matrix  form  and  involves  terms  that  derive 

from  the  set  of  “score”  equations  that  are  used 
to  solve  for  ML  estimates  of  the  regression 
coefficients.  This  information  may  be  of  inter¬ 
est  to  the  more  mathematically  inclined  reader 
with  some  background  in  methods  for  the 
analysis  of  correlated  data  (Kleinbaum  and 
Klein,  2010). 


Formula  not  essential  for  using  However,  the  information  below  is  not  essen- 
computer  packages  tial  for  an  understanding  of  how  to  obtain 

robust  estimators  using  computer  packages. 
(See  Computer  Appendix.) 
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Extension  (Lin  and  Wei,  1989)  of  The  robust  estimator  for  recurrent  event  data 
information  sandwich  estimator  was  derived  by  Lin  and  Wei  (1989)  as  an  exten- 
(Zeger  and  Liang,  1986)  sion  similar  to  the  "information  sandwich  esti¬ 

mator”  proposed  by  Zeger  and  Liang  (1986)  for 
generalized  linear  models.  SAS  and  Stata  use 
variations  of  this  estimator  that  give  slightly 
different  estimates. 


Matrix  formula 


R(p)  =Var(p) 


RSRS 


where 

Var^p) 

is  the  information  matrix,  and 

Rs 

is  matrix  of  score  residuals 


The  general  form  of  this  estimator  can  be  most 
conveniently  written  in  matrix  notation  as 
shown  at  the  left.  In  this  formula,  the  variance 
expression  denotes  the  information  matrix 
form  of  estimated  variances  and  covariances 
obtained  from  (partial)  ML  estimation  of  the 
Cox  model  being  fit.  The  Rs  expression  in  the 
middle  of  the  formula  denotes  the  matrix  of 
score  residuals  obtained  from  ML  estimation. 


Formula  applies  to  other  appro¬ 
aches  for  analyzing  recurrent 
event  data 


The  robust  estimation  formula  described  above 
applies  to  the  CP  approach  as  well  as  other 
approaches  for  analyzing  recurrent  event  data 
described  later  in  this  chapter. 


VII.  Results  for  CP 
Example 


We  now  describe  the  results  from  using  the  CP 
approach  on  the  Bladder  Cancer  Study  data 
involving  all  85  subjects. 


Table  8.8.  Edited  SAS  Output  from  CP 
Approach  on  Bladder  Cancer  Data  (N  = 
85  Subjects)  Without  Robust  Variances 


Var  DF 

Parameter 

Estimate 

Std 

Error 

Chisq 

P 

HR 

tx  1 

-0.4071 

0.2001 

4.140 

0.042 

0.667 

num  1 

0.1607 

0.0480 

11.198 

0.001 

1.174 

size  1 

-0.0401 

0.0703 

0.326 

0.568 

0.961 

-2  LOG  L  = 

920.159 

Table  8.9.  Robust  Covariance  Matrix, 
CP  Approach  on  Bladder  Cancer  Data 


tx 

num 

size 

tx 

0.05848 

-0.00270 

-0.00051 

num 

-0.00270 

0.00324 

0.00124 

size 

-0.00051 

0.00124 

0.00522 

Table  8.8  gives  edited  output  from  fitting  the 
no-interaction  Cox  PH  model  involving  the 
three  predictors  tx,  num,  and  size.  A  likelihood 
ratio  chunk  test  for  interaction  terms  tx  x  num 
and  tx  x  size  was  nonsignificant,  thus  support¬ 
ing  the  no-interaction  model  shown  here.  The 
PH  assumption  was  assumed  satisfied  for  all 
three  variables. 

Table  8.9  provides  the  covariance  matrix 
obtained  from  robust  estimation  of  the  var¬ 
iances  of  the  estimated  regression  coefficients 
of  tx,  num,  and  size.  The  values  on  the  dia¬ 
gonal  of  this  matrix  give  robust  estimates  of 
these  variances  and  the  off-diagonal  values 
give  covariances. 


Presentation:  VIII.  Other  Approaches  Stratified  Cox  379 


Robust  standard  error  for  tx 

=  square-root  (.05848)  =  0.2418 

Nonrobust  standard  error  for  tx 

=  0.2001 


Because  the  exposure  variable  of  interest  in 
this  study  is  tx,  the  most  important  value  in 
this  matrix  is  0.05848.  The  square  root  of  this 
value  is  0.2418,  which  gives  the  robust  stan¬ 
dard  error  of  the  estimated  coefficient  of  the  tx 
variable.  Notice  that  this  robust  estimator  is 
similar  but  somewhat  different  from  the  non¬ 
robust  estimator  of  0.2001  shown  in  Table  8.8. 


Summary  of  Results  from 
CP  Approach 

Hazard  Ratio  tx:  exp(— 0.407)  =  0.667 

(=  1/1.5) 

Wald  Chi-Square  tx:  robust  nonrobust 
2.83  4.14 

P-value  tx:  .09  .04 

(H0:  no  effect  of  tx,  HA:  two  sided) 

95%  Cl  for  HR  tx  (robust): 

(0.414,  1.069) 

Ha:  one-sided,  both  p-values  <  .05 

We  return  to  the  analysis  of  these 
data  when  we  discuss  other  app¬ 
roaches  for  analysis  of  recurrent 
event  data. 


We  now  summarize  the  CP  results  for  the 
effect  of  the  exposure  variable  tx  on  recurrent 
event  survival  controlling  for  num  and  size. 
The  hazard  ratio  estimate  of  0.667  indicates 
that  the  hazard  for  the  placebo  is  1.5  times 
the  hazard  for  the  treatment. 

Using  robust  estimation,  the  Wald  statistic  for 
this  hazard  ratio  is  borderline  nonsignificant 
(P  =  .09).  Using  the  nonrobust  estimator,  the 
Wald  statistic  is  borderline  significant  (P  =  .04). 
Both  these  P-values,  however,  are  for  a  two-sided 
alternative.  For  a  one-sided  alternative,  both 
P-values  would  be  significant  at  the  .05  level. 
The  95%  confidence  interval  using  the  robust 
variance  estimator  is  quite  wide  in  any  case. 


VIII.  Other  Approaches 
Stratified  Cox 


3  stratified  Cox  (SC)  approaches: 

Stratified  CP  (Prentice, 

Williams  and 

Gap  Time  Peterson,  1981) 

Marginal  (Wei,  Lin,  and 

Weissfeld,  1989) 

Goal:  distinguish  order  of  recur¬ 
rent  events 


We  now  describe  three  other  approaches 
for  analyzing  recurrent  event  data,  each  of 
which  uses  a  Stratified  Cox  (SC)  PH  model. 
They  are  called  Stratified  CP,  Gap  Time,  and 
Marginal.  These  approaches  are  often  used  to 
distinguish  the  order  in  which  recurrent  events 
occur. 


Strata  variable:  time  interval  # 
treated  as 
categorical 


The  “strata”  variable  for  each  approach  treats 
the  time  interval  number  as  a  categorical 
variable. 
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Example: 

maximum  of  4  failures  per  subject 

a- 

Strata  =  1  for  time  interval  #  1 
variable  2  for  time  interval  #  2 

3  for  time  interval  #  3 

4  for  time  interval  #  4 


For  example,  if  the  maximum  number  of  failures 
that  occur  on  any  given  subject  in  the  dataset  is, 
say,  4,  then  time  interval  #1  is  assigned  to  stra¬ 
tum  1 ,  time  interval  #2  to  stratum  2,  and  so  on. 


Time  between  two  events: 

0  50  — > 80 


Stratified  CP 


Gap  Time 


entry 

0->  30 
evl  ev2 


Both  Stratified  CP  and  Gap  Time  approaches 
focus  on  survival  time  between  two  events. 
However,  Stratified  CP  uses  the  actual  times 
of  the  two  events  from  study  entry,  whereas 
Gap  Time  starts  survival  time  at  0  for  the  ear¬ 
lier  event  and  stops  at  the  later  event. 


Marginal 

•  Total  survival  time  from  study 
entry  until  kth  event 

•  Recurrent  events  of  different 
types 


The  Marginal  approach,  in  contrast  to  each 
conditional  approach,  focuses  on  total  survival 
time  from  study  entry  until  the  occurrence  of  a 
specific  (e.g.,  kth)  event;  this  approach  is  sug¬ 
gested  when  recurrent  events  are  viewed  to  be 
of  different  types. 


Stratified  CP  for  Subject  10 


id 

int 

event 

start 

stop 

tx 

num 

size 

10 

i 

i 

0 

12 

0 

i 

i 

10 

2 

i 

12 

16 

0 

i 

i 

10 

3 

0 

16 

18 

0 

i 

i 

Gap  Time  for  Subject  10 

(stop  =  Interval  Length  Since  Previous  Event) 

id  int  event  start  stop 

tx  num 

size 

10  1  1  0  12 

0  1 

i 

10  2  1  0  4 

0  1 

i 

10  3  0  0  2 

0  1 

i 

The  stratified  CP  approach  uses  the  exact 
same  (start,  stop)  data  layout  format  used  for 
the  CP  approach,  except  that  for  Stratified  CP, 
an  SC  model  is  used  rather  than  a  standard 
(unstratified)  PH  model.  The  strata  variable 
here  is  int  in  this  listing. 

The  Gap  Time  approach  also  uses  a  (start, 
stop)  data  layout,  but  the  start  value  is  always 
0  and  the  stop  value  is  the  time  interval  length 
since  the  previous  event.  The  model  here  is  also 
a  SC  model. 


Marginal  approach 

Standard  (nonrecurrent  event) 
layout,  i.e.,  without  (start,  stop) 
columns 


The  Marginal  approach  uses  the  standard 
(nonrecurrent  event)  data  layout  instead  of 
the  (start,  stop)  layout,  as  illustrated  below. 
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Marginal  Approach  for  Subject  10 


id 

int 

event 

stime 

tx 

num 

size 

10 

i 

i 

12 

0 

i 

i 

10 

2 

i 

16 

0 

i 

i 

10 

3 

0 

18 

0 

i 

i 

10 

4 

0 

18 

0 

i 

i 

The  Marginal  approach  layout,  shown  at  the 
left,  contains  four  lines  of  data  in  contrast  to 
the  three  lines  of  data  that  would  appear  for 
subject  #10  using  the  CP,  Stratified  CP,  and 
Gap  Time  approaches 


Marginal  approach 

Each  subject  at  risk  for  all 
failures  that  might  occur 

#  actual  failures  <  #  possible  failures 


The  reason  why  there  are  4  lines  of  data  here  is 
that,  for  the  Marginal  approach,  each  subject 
is  considered  to  be  at  risk  for  all  failures  that 
might  occur,  regardless  of  the  number  of 
events  a  subject  actually  experienced. 


Bladder  cancer  data: 

Maximum  #  (possible)  failures  =  4 

So,  subject  10  (as  well  as  all  other 
subjects)  gets  4  lines  of  data 


Because  the  maximum  number  of  failures  being 
considered  in  the  bladder  cancer  data  is  4  (e.g., 
for  subject  #s  15  and  26),  subject  #10,  who  failed 
only  twice,  will  have  two  additional  lines  of 
data  corresponding  to  the  two  additional  fail¬ 
ures  that  could  have  possibly  occurred  for  this 
subject. 


Fundamental  Difference  Among  the 
3  SC  Approaches 

Risk  set  differs  for  strata  after  first 
event 


The  three  alternative  SC  approaches  (Strati¬ 
fied  CP,  Gap  Time,  and  Marginal)  fundamen¬ 
tally  differ  in  the  way  the  risk  set  is  determined 
for  strata  corresponding  to  events  after  the  first 
event. 


Gap  Time:  time  until  1st  event 
does  not  influence  risk  set  for  later 
events  (i.e.,  clock  reset  to  0  after 
event  occurs) 


With  Gap  Time,  the  time  until  the  first  event 
does  not  influence  the  composition  of  the  risk 
set  for  a  second  or  later  event.  In  other  words, 
the  clock  for  determining  who  is  at  risk  gets 
reset  to  0  after  each  event  occurs. 


Stratified  CP:  time  until  1st  event 
influences  risk  set  for  later  events 


In  contrast,  with  Stratified  CP,  the  time  until 
the  first  event  affects  the  composition  of  the 
risk  set  for  later  events. 


Marginal:  risk  set  determined 
from  time  since  study  entry 


With  the  Marginal  approach,  the  risk  set  for 
the  kth  event  (k  =  1,  2,  . . .)  identifies  those  at 
risk  for  the  kth  event  since  entry  into  the  study. 
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EXAMPLE 


ID 

Days 

Status  Stratum  Start  Stop 

tx 

M 

i 

i 

0 

100 

1 

M 

i 

2 

100 

105 

1 

H 

i 

1 

0 

30 

0 

H 

i 

2 

30 

50 

0 

P 

i 

1 

0 

20 

0 

P 

i 

2 

20 

60 

0 

P 

i 

3 

60 

85 

0 

Suppose,  for  example,  that  Molly  (M),  Holly 
(H),  and  Polly  (P)  are  the  only  three  subjects 
in  the  dataset  shown  at  the  left.  Molly  receives 
the  treatment  (tx  =  1 )  whereas  Holly  and  Polly 
receive  the  placebo  (tx  =  0).  All  three  have 
recurrent  events  at  different  times.  Also,  Polly 
experiences  three  events  whereas  Molly  and 
Holly  experience  two. 


Stratified  CP 


Stratum  1  Stratum  2 


t(f)  nf  R(t(p)  t(f)  nf  R(t(f>) 

0  3  [M,  H,  P]  20  1  [P] 

20  3  [M,  H,  P]  30  2  [H,  P] 

30  2  [M,  H]  50  2  [H,  P] 

100  1  [M]  60  1  [P] 

105  1  [M] 


Gap  Time 


Stratum  1 

Stratum  2 

t(f) 

nf 

R(t(f>) 

t(f) 

nf 

R(t(fj) 

0 

3 

[M,  H,  P] 

0 

3 

[M,  H,  P] 

20 

3 

[M,  H,  P} 

5 

3 

[M,  H,  P] 

30 

2 

[M,  H] 

20 

2 

[H,  P] 

100 

1 

[M] 

40 

1 

[P] 

The  table  at  the  left  shows  how  the  risk  set 
changes  over  time  for  strata  1  and  2  if  the 
Stratified  CP  approach  is  used.  For  stratum 
2,  there  are  no  subjects  in  the  risk  set  until 
t  =  20,  when  Polly  gets  the  earliest  first  event 
and  so  becomes  at  risk  for  a  second  event. 
Holly  enters  the  risk  set  at  t  =  30.  So  at  t  =  50, 
when  the  earliest  second  event  occurs,  the  risk 
set  contains  Holly  and  Polly.  Molly  is  not  at  risk 
for  getting  her  second  event  until  t  =  100.  The 
risk  set  at  t  =  60  contains  only  Polly  because 
Holly  has  already  had  her  second  event  at 
t  =  50.  And  the  risk  set  at  t  =  105  contains 
only  Molly  because  both  Holly  and  Polly  have 
already  had  their  second  event  by  t  =  105. 

The  next  table  shows  how  the  risk  set  changes 
over  time  if  the  Gap  Time  approach  is  used. 
Notice  that  the  data  for  stratum  1  are  identical 
to  those  for  Stratified  CP.  For  stratum  2,  how¬ 
ever,  all  three  subjects  are  at  risk  for  the  second 
event  at  t  =  0  and  at  t  =  5,  when  Molly  gets 
her  second  event  5  days  after  the  first  occurs. 
The  risk  set  at  t  =  20  contains  Holly  and  Polly 
because  Molly  has  already  had  her  second 
event  by  t  =  20.  And  the  risk  set  at  t  =  40 
contains  only  Polly  because  both  Molly  and 
Holly  have  already  had  their  second  event  by 
t  =  40. 
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Marginal 


Stratum  1 

Stratum  2 

t(f) 

nf 

R(t(f>) 

t(f) 

nf 

R(t(f>) 

0 

3 

{M,  H,  P] 

0 

3 

[M,  H,  P] 

20 

3 

[M,  H,  P] 

50 

3 

[M,  H,  P] 

30 

2 

{M,  H] 

60 

2 

[M,  P] 

100 

3 

[M] 

105 

1 

(M] 

Stratum  3  for  Marginal  approach 
follows 


Marginal 
Stratum  3 


t(f) 

nf 

R(t(f>) 

0 

3 

[M,  H,  P] 

85 

2 

[M,  P] 

Note:  H  censored  by  t  =  85 

Basic  idea  (Marginal  approach): 

Each  failure  considered  a  separate 
process 

Allows  stratifying  on 

•  Failure  order 

•  Different  failure  type  (e.g., 
stage  1  vs.  stage  2  cancer) 

Stratified  Cox  PH  (SC)  Model  for 
all  3  alternative  approaches 

Use  standard  computer  program 
for  SC  (e.g.,  SAS’s  PHREG,  Stata’s 
stcox,  SPSS’s  coxreg,  R’s  Coxph) 


No-interaction  SC  model  for  bladder 
cancer  data 

hg(t,X)  =  h0g(t)exp[p  tx  +  Y!  num 

+  Y2  size] 


We  next  consider  the  Marginal  approach.  For 
stratum  1 ,  the  data  are  identical  again  to  those 
for  Stratified  CP.  For  stratum  2,  however, 
all  three  subjects  are  at  risk  for  the  second 
event  at  t  =  0  and  at  t  =  50,  when  Holly  gets 
her  second  event.  The  risk  set  at  t  =  60  contains 
Molly  and  Polly  because  Holly  has  already  had 
her  second  event  at  t  =  50.  And  the  risk  set  at 
t  =  105  contains  only  Molly  because  both  Holly 
and  Polly  have  already  had  their  second  event 
by  t  =  60. 


Because  Polly  experienced  three  events,  there 
is  also  a  third  stratum  for  this  example,  which 
we  describe  for  the  marginal  approach  only. 

Using  the  marginal  approach,  all  three  subjects 
are  considered  at  risk  for  the  third  event  when 
they  enter  the  study  (t  =  0),  even  though  Molly 
and  Holly  actually  experience  only  two  events. 
At  t  =  85,  when  Polly  has  her  third  event,  Holly, 
whose  follow-up  ended  at  t  =  50,  is  no  longer  in 
the  risk  set;  which  still  includes  Molly  because 
Molly's  follow-up  continues  until  t  =  105. 

The  basic  idea  behind  the  Marginal  approach 
is  that  it  allows  each  failure  to  be  considered  as 
a  separate  process.  Consequently,  the  Marginal 
approach  not  only  allows  the  investigator  to 
consider  the  ordering  of  failures  as  separate 
events  (i.e.,  strata)  of  interest,  but  also  allows 
the  different  failures  to  represent  different  types 
of  events  that  may  occur  on  the  same  subject. 


All  three  alternative  approaches,  although  dif¬ 
fering  in  the  form  of  data  layout  and  the  way 
the  risk  set  is  determined,  nevertheless  use  a 
stratified  Cox  PH  model  to  carry  out  the  analy¬ 
sis.  This  allows  a  standard  program  that  fits  a 
SC  model  (e.g.,  SAS’s  PHREG)  to  perform  the 
analysis. 

The  models  used  for  the  three  alternative  SC 
approaches  are  therefore  of  the  same  form.  For 
example,  we  show  on  the  left  the  no-interaction 
SC  model  appropriate  for  the  bladder  cancer 
data  we  have  been  illustrating. 


where  g  =  1,  2,  3,  4 
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Two  types  of  SC  models: 

No-interaction  versus  interaction 
model 

Typically  compared  using  LR  test 


Version  1:  Interaction  SC  Model 

hg(t,X)  =  h0g(t)  exp[(3g  tx 

+  Yig  num  +  y2g  size] 

g=  1,2,  3,4 

Version  2:  Interaction  SC  Model 

hg(t,X)  =  h0g(t)exp[P  tx  +  yl  num 
+  y2  size  +  <>n  (Z\  x  tx) 

+  §12  (Z*2  x  tx)  +  §13  (Z]  x  tx) 

+  §21  (Z\  x  num)  +  §22  (Zj  x  num) 
+  §23  (Zj  x  num)  +  §3i  (Z\  x  size) 
+§32 (Z2  x  size)  +  §33 (Z]  x  size)] 

where  Z\,  Z2,  and  Z2  are  3  dummy 

variables  for  the  4  strata. 

H0  (Version  1) 

Pi  =  P2  =  P3  =  P4  =  P) 

Y11  =  y  12  =  y  13  =  y  14  =  Yu 

Y21  =  Y22  =  Y23  =  Y24  =  Y2 

H0  (Version  2) 

§11  =  §12  =  §13  =  §21  =  §22 
=  §23  =  §31  =  §32  =  §33 
=  0 


As  described  previously  in  Chapter  5  on  the 
stratified  Cox  procedure,  a  no-interaction  stra¬ 
tified  Cox  model  is  not  appropriate  if  there 
is  interaction  between  the  stratified  variables 
and  the  predictor  variables  put  into  the 
model.  Thus,  it  is  necessary  to  assess  whether 
an  interaction  version  of  the  SC  model  is  more 
appropriate,  as  typically  carried  out  using  a 
likelihood  ratio  test. 

For  the  bladder  cancer  data,  we  show  at  the  left 
two  equivalent  versions  of  the  SC  interaction 
model.  The  first  version  separates  the  data  into 
4  separate  models,  one  for  each  stratum. 


The  second  version  contains  product  terms 
involving  the  stratified  variable  with  each  of 
the  3  predictors  in  the  model.  Because  there 
are  4  strata,  the  stratified  variable  is  defined 
using  3  dummy  variables  Z],  Z2,  and  Z\. 


The  null  hypotheses  for  the  LR  test  that  com¬ 
pares  the  interaction  with  the  no-interaction 
SC  model  is  shown  at  the  left  for  each  version. 
The  df  for  the  LR  test  is  9. 


Presentation:  IX.  Bladder  Cancer  Study  Example  385 


Interaction  SC  model  may  be  used 
regardless  of  LR  test  result 

•  Allows  separate  HRs  for  tx  for 
each  stratum 

•  if  no-interaction  SC,  then  only 
an  overall  effect  of  tx  can  be 
estimated 


Even  if  the  no-interaction  SC  model  is  found 
more  appropriate  from  the  likelihood  ratio 
test,  the  investigator  may  still  wish  to  use  the 
interaction  SC  model  in  order  to  obtain  and 
evaluate  different  hazard  ratios  for  each  stra¬ 
tum.  In  other  words,  if  the  no-interaction 
model  is  used,  it  is  not  possible  to  separate 
out  the  effects  of  predictors  (e.g.,  tx)  within 
each  stratum,  and  only  an  overall  effect  of  a 
predictor  on  survival  can  be  estimated. 


Recommend  using 

robust  estimation 


R 


(p)  =Var(p) 


R'sRs 


Varfp 


to  adjust  for  correlation  of  observa¬ 
tions  on  the  same  subject 


For  each  of  the  SC  alternative  approaches,  as 
for  the  CP  approach,  it  is  recommended  to  use 
robust  estimation  to  adjust  the  variances  of 
the  estimated  regression  coefficients  for  the 
correlation  of  observations  on  the  same  sub¬ 
ject.  The  general  form  for  the  robust  estimator 
is  the  same  as  in  the  CP  approach,  but  will  give 
different  numerical  results  because  of  the  dif¬ 
ferent  data  layouts  used  in  each  method. 


IX.  Bladder  Cancer  Study 
Example  (Continued) 


Table  8.10.  Estimated  (3s  and  HRs  for  tx 
from  Bladder  Cancer  Data 


Model 

P 

HR  =  exp(p) 

CP 

-0.407 

0.666  (=1/1.50) 

SCP 

-0.334 

0.716  (=1/1.40) 

GT 

-0.270 

0.763  (=1/1.31) 

M 

-0.580 

0.560  (=1/1.79) 

CP  =  Counting  Process, 

SCP  =  Stratified  CP 

GT  =  Gap  Time,  M  =  Marginal 


HR  for  M:  0.560  (=1/1.79) 
differs  from 

HRs  for  CP:  0.666  (=1/1.50), 
SCP:  0.716  (=1/1.40), 
GT:  0.763  (=1/1.31) 


We  now  present  and  compare  SAS  results  from 
using  all  four  methods  we  have  described  -  CP, 
Stratified  CP,  Gap  Time,  and  Marginal  -  for 
analyzing  the  recurrent  event  data  from  the 
bladder  cancer  study. 

Table  8.10  gives  the  regression  coefficients  for 
the  tx  variable  and  their  corresponding  hazard 
ratios  (i.e.,  exp(p)  for  the  no-interaction  Cox 
PH  models  using  these  four  approaches).  The 
model  used  for  the  CP  approach  is  a  standard 
Cox  PH  model  whereas  the  other  three  models 
are  SC  models  that  stratify  on  the  event  order. 


From  this  table,  we  can  see  that  the  hazard 
ratio  for  the  effect  of  the  exposure  variable 
tx  differs  somewhat  for  each  of  the  four 
approaches,  with  the  Marginal  model  giving  a 
much  different  result  from  that  obtained  from 
the  other  three  approaches. 
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Table  8.11  Estimated  Ps,  SE(P)s,  and  P- 
Values  for  tx  from  No-Interaction  Model 
for  Bladder  Cancer  Data 

Model  |3  SE(NR)  SE(R)  P(NR)  P(R) 

CP  -0.407  0.200  0.242  .042  .092 

SCP  -0.334  0.216  0.197  .122  .090 

GT  -0.270  0.208  00.208  .195  .194 

M  -0.580  0.201  0.303  .004  .056 

CP  =  Counting  Process,  SCP  =  Stratified  CP, 

GT  =  Gap  Time,  M  =  Marginal, 

NR  =  Nonrobust,  R  =  Robust,  P  =  Wald  P-value 

SE(NR)  differs  from  SE(R) 

P(NR)  differs  from  P(R) 
but  no  clear  pattern 

for  example, 

CP:  P(NR)  =  .042  <  P(R)  =  .092 
SCP:  P(NR)  =  .122  >  P(R)  =  .090 
GT:  P(NR)  =  .195  =  P(R)  =  .194 

Wald  test  statistic(s):  The  P-values  shown  in  Table  8.11  are  com- 

Z  —  B/SE(B)  Z2  —  IB/SEfB'll2  Puted  using  the  standard  Wald  test  Z  or  chi- 

square  statistic,  the  latter  having  a  chi-square 
~  N(0, 1)  under  H0:  P  =  0  ~  yA  df  distribution  with  1  df  under  the  null  hypothesis 

that  there  is  no  effect  of  tx. 


The  nonrobust  and  robust  standard  errors  and 
P-values  differ  to  some  extent  for  each  of  the 
different  approaches.  There  is  also  no  clear 
pattern  to  suggest  that  the  nonrobust  results 
will  always  be  either  higher  or  lower  than  the 
corresponding  robust  results. 


Table  8.11  provides,  again  for  the  exposure 
variable  tx  only,  the  regression  coefficients, 
robust  standard  errors,  nonrobust  standard 
errors,  and  corresponding  Wald  Statistic  P- 
values  obtained  from  using  the  no-interaction 
model  with  each  approach. 


Table  8.12  Estimated  Ps  and  Robust 
SE(P)s  for  tx  from  Interaction  SC  Model 
for  Bladder  Cancer  Data 


Interaction  SC  Model 

No 

Strl 

Str2 

Str3 

Str4 

Interaction 

ft 

ft 

ft 

ft 

p 

Model 

(SE) 

(SE) 

(SE) 

(SE) 

(SE) 

CP 

— 

— 

— 

— 

-.407 

(.242) 

SCP 

-.518 

-.459 

.117 

-.041 

-.334 

(.308) 

(.441) 

(.466) 

(.515) 

(.197) 

GT 

-.518 

-.259 

.221 

-.195 

-.270 

(.308) 

(.402) 

(.620) 

(.628) 

(.208) 

M 

-.518 

-.619 

-.700 

-.651 

-.580 

(.308) 

(.364) 

(.415) 

(.490) 

(.303) 

CP  =  Counting  Process,  SCP  =  Stratified  CP 
GT  =  Gap  Time,  M  =  Marginal 


Table  8.12  presents,  again  for  the  exposure 
variable  tx  only,  the  estimated  regression  coef¬ 
ficients  and  robust  standard  errors  for  both  the 
interaction  and  the  no-interaction  SC  models 
for  the  three  approaches  (other  than  the  CP 
approach)  that  use  a  SC  model. 

Notice  that  for  each  of  the  three  SC  modeling 
approaches,  the  estimated  (3s  and  corres¬ 
ponding  standard  errors  are  different  over  the 
four  strata  as  well  as  for  the  no-interaction 
model.  For  example,  using  the  Stratified  CP 
approach,  the  estimated  coefficients  are  —0.518, 
-  0.459,  -  0.117,  -  0.041,  and  -  0.334  for 
strata  1  through  4  and  the  no-interaction 
model,  respectively. 
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Version  1:  Interaction  SC  Model 

hg(t,X)  =  h0g(t)exp[pgtx 

+  Yig  num  +  y2g  size] 

g=  1,2,  3,4 


Such  differing  results  over  the  different  strata 
should  be  expected  because  they  result  from 
fitting  an  interaction  SC  model,  which  by  defi¬ 
nition  allows  for  different  regression  coeffi¬ 
cients  over  the  strata. 


Note:  subscript  g  allows  for  differ¬ 
ent  regression  coefficients  for  each 
stratum 


Stratified  CP  for  Subject  10  Notice  also  that  for  stratum  1,  the  estimated  (I 


id 

int 

event 

start 

stop  tx 

num 

size 

and  its  standard  error  are  identical  (-0.518 

10 

i 

i 

0 

12  0 

i 

1 

and  0.308,  resp.)  for  the  Stratified  CP,  Gap 

Gap  Time  for  Subject  10 

Time,  and  Marginal  approaches.  This  is  as 
expected  because,  as  illustrated  for  subject 
#10  at  the  left,  the  survival  time  information 

id 

int 

event 

start 

stop  tx 

num 

size 

for  first  stratum  is  the  same  for  stratum  1  for 

10 

i 

i 

0 

12  0 

i 

1 

the  three  SC  approaches  and  does  not  start  to 
differ  until  stratum  2. 

Marginal  Approach  for  Subject  10 
id  int  event  stime  tx  num 

size 

10 

i 

i 

12 

0 

i 

1 

Note:  int  =  stratum  # 


Marginal  approach 

start  time  =  0  always 
stop  time  =  stime 

Subject  #10:  (start,  stop)  =  (0,  12) 


Although  the  data  layout  for  the  marginal 
approach  does  not  require  (start, stop)  columns, 
the  start  time  for  the  first  stratum  (and  all  other 
strata)  is  0  and  the  stop  time  is  given  in  the  stime 
column.  In  other  words,  for  stratum  1  on  subject 
#10,  the  stop  time  is  0  and  the  start  time  is  12, 
which  is  the  same  as  for  the  Stratified  CP  and 
Gap  Time  data  for  this  subject. 


Bladder  Cancer  Study  So,  based  on  all  the  information  we  have 

provided  above  concerning  the  analysis  of  the 

1 .  Which  approach  is  best?  bladder  cancer  study, 

2.  Conclusion  about  tx? 

1 .  Which  of  the  four  recurrent  event  analysis 
approaches  is  best? 

2.  What  do  we  conclude  about  the  estimated 
effect  of  tx  controlling  for  num  and  size? 
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Which  of  the  4  approaches  is  best? 

It  depends! 

CP:  Don’t  want  to  distinguish 
recurrent  event  order 
Want  overall  effect 


If  event  order  important: 

Choose  from  the  3  SC  approaches. 


Stratified  CP:  time  of  recurrent 
event  from  entry 
into  the  study 


Gap  Time:  Use  time  from 

previous  event  to 
next  recurrent  event 


Marginal:  Consider  strata  as 

representing  different 
event  types 


Stratified  CP  versus  Marginal 
(subtle  choice) 

Recommend:  Choose  Stratified 
CP  unless  strata 
represent  different 
event  types 

What  do  we  conclude  about  tx? 

Conclusions  based  on  results  from 

CP  and  Stratified  CP  approaches 


The  answer  to  question  1  is  probably  best 
phrased  as,  "It  depends!”  Nevertheless,  if  the 
investigator  does  not  want  to  distinguish 
between  recurrent  events  on  the  same  subject 
and  wishes  an  overall  conclusion  about  the 
effect  of  tx,  then  the  CP  approach  seems  quite 
appropriate,  as  for  this  study. 

If,  however,  the  investigator  wants  to  distin¬ 
guish  the  effects  of  tx  according  to  the  order 
that  the  event  occurs  (i.e.,  by  stratum  #),  then 
one  of  the  three  SC  approaches  should  be  pre¬ 
ferred.  So,  which  one? 

The  Stratified  CP  approach  is  preferred  if  the 
study  goal  is  to  use  time  of  occurrence  of  each 
recurrent  event  from  entry  into  the  study  to 
assess  a  subject’s  risk  for  an  event  of  a  specific 
order  (i.e.,  as  defined  by  a  stratum  #)  to  occur. 

The  Gap  Time  approach  would  be  preferred  if 
the  time  interval  of  interest  is  the  time  (reset 
from  0)  from  the  previous  event  to  the  next 
recurrent  event  rather  than  time  from  study 
entry  until  each  recurrent  event. 

Finally,  the  Marginal  approach  is  recom¬ 
mended  if  the  investigator  wants  to  consider 
the  events  occurring  at  different  orders  as  dif¬ 
ferent  types  of  events,  for  example  different 
disease  conditions. 

We  (the  authors)  consider  the  choice  between 
the  Stratified  CP  and  Marginal  approaches 
to  be  quite  subtle.  We  prefer  Stratified  CP, 
provided  the  different  strata  do  not  clearly  rep¬ 
resent  different  event  types.  If,  however,  the 
strata  clearly  indicate  separate  event  processes, 
we  would  recommend  the  Marginal  approach. 

Overall,  based  on  the  above  discussion,  we 
think  that  the  CP  approach  is  an  acceptable 
method  to  use  for  analyzing  the  bladder  cancer 
data.  If  we  had  to  choose  one  of  the  three  SC 
approaches  as  an  alternative,  we  would  choose 
the  Stratified  CP  approach,  particularly 
because  the  order  of  recurrent  events  that 
define  the  strata  doesn’t  clearly  distinguish 
separate  disease  processes. 
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Table  8.13.  Comparison  of  Results 
Obtained  from  No-Interaction  Models 
Across  Two  Methods  for  Bladder  Cancer 
Data 


Counting 

process 

Stratified 

CP 

Parameter 

-0.407 

-0.334 

estimate 

Robust 

0.2418 

0.1971 

standard 

error 

Wald  chi-square 

2.8338 

2.8777 

p-value 

0.0923 

0.0898 

Hazard  ratio 

0.667 

0.716 

95%  confidence 

(0.414, 

(0.486, 

interval 

1.069) 

1.053) 

Table  8.13  summarizes  the  results  for  the  CP 
and  Stratified  CP  approaches  with  regard  to 
the  effect  of  the  treatment  variable  (tx), 
adjusted  for  the  control  variables  num  and 
size.  We  report  results  only  for  the  no-interac- 
tion  models,  because  the  interaction  SC  model 
for  the  Stratified  CP  approach  was  found 
(using  LR  test)  to  be  not  significantly  different 
from  the  no-interaction  model. 

The  results  are  quite  similar  for  the  two  different 
approaches.  There  appears  to  be  a  small  effect 
of  tx  on  survival  from  bladder  cancer:  HR(CP)  = 
0.667  =  1/1.50,  HR(C1)  =  0.716=  1/1.40.  This 
effect  is  borderline  nonsignificant  (2-sided 
tests):  P(CP)  =  .09  =  P(SCP).  95%  confidence 
intervals  around  the  hazard  ratios  are  quite 
wide,  indicating  an  imprecise  estimate  of  effect. 


Overall,  therefore,  these  results  indicate  that 
there  is  no  strong  evidence  that  tx  is  effective 
(after  controlling  for  num  and  size)  based  on 
recurrent  event  survival  analyses  of  the  bladder 
cancer  data. 


X.  A  Parametric  Approach 
Using  Shared  Frailty 


Compared  4  approaches  in  previ¬ 
ous  section 

•  Each  used  a  Cox  model 

•  Robust  standard  errors 

°  Adjusts  for  correlation  from 
same  subject 


In  the  previous  section  we  compared  results 
obtained  from  using  four  analytic  approaches 
on  the  recurrent  event  data  from  the  bladder 
cancer  study.  Each  of  these  approaches  used  a 
Cox  model.  Robust  standard  errors  were 
included  to  adjust  for  the  correlation  among 
outcomes  from  the  same  subject. 


We  now  present  a  parametric 
approach 

•  Weibull  PH  model 

•  Gamma  shared  frailty 
component 

•  Bladder  Cancer  dataset 

°  Data  layout  for  the  counting 
process  approach 

Can  review  Chapter  7 
Weibull  model  (Section  VI) 

Frailty  models  (Section  XII) 


In  this  section  we  present  a  parametric 
approach  for  analyzing  recurrent  event  data 
that  includes  a  frailty  component.  Specifically, 
a  Weibull  PH  model  with  a  gamma  distributed 
shared  frailty  component  is  shown  using  the 
Bladder  Cancer  dataset.  The  data  layout  is  the 
same  as  described  for  the  counting  process 
approach.  It  is  recommended  that  the  reader 
first  review  Chapter  7,  particularly  the  sections 
on  Weibull  models  (Section  VI)  and  frailty 
models  (Section  XII). 
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Hazard  conditioned  on  frailty  rJy 


hi(t|a,Xj)  =  tXjh(t|Xi) 


where  a  ~  gammafp  =  1,  var  =  0) 
and  where  h(t|Xi)  =  X;  ptv~ 1 
(Weibull)  with  X^  =  exp((!0  + 
pitXj  +  p2numi  +  p3sizei) 

Including  shared  frailty 

•  Accounts  for  unobserved 
factors 

°  Subject  specific 
o  Source  of  correlation 
o  Observations  clustered  by 
subject 


Robust  standard  errors 

•  Adjusts  standard  errors 

•  Does  not  affect  coefficient 
estimates 


Shared  frailty 


•  Built  into  model 

•  Can  affect  coefficient  estimates 
and  their  standard  errors 


Weibull  regression  (PH  form) 
Gamma  shared  frailty 
Log  likelihood  =  —184.73658 


_t 

Coef. 

Std.  Err. 

z 

P>M 

tx 

-.458 

.268 

-1.71 

0.011 

num 

.184 

.072 

2.55 

0.327 

size 

-.031 

.091 

-0.34 

0.730 

_cons 

-2.952 

.417 

-7.07 

0.000 

/ln_p 

-.119 

.090 

-1.33 

0.184 

/ln_the 

-.725 

.516 

-1.40 

0.160 

P 

.888 

.080 

1/p 

1.13 

.101 

theta 

.484 

.250 

Likelihood  ratio  test  of  theta  =  0: 

chibar(Ol)  =  7.34 

Prob  >  =  chibar2  =  0.003 


We  define  the  model  in  terms  of  the  hazard  of 
any  (recurrent)  outcome  on  the  ith  subject  con¬ 
ditional  on  his  or  her  frailty  a,-.  The  frailty  is  a 
multiplicative  random  effect  on  the  hazard 
function  h(t|Xj),  assumed  to  follow  a  gamma 
distribution  of  mean  1  and  variance  theta  0. 
We  assume  h(t|X;)  follows  a  Weibull  distribu¬ 
tion  (shown  at  left). 

The  frailty  is  included  in  the  model  to  account 
for  variability  due  to  unobserved  subject-specific 
factors  that  are  otherwise  unaccounted  for  by 
the  other  predictors  in  the  model.  These  unob¬ 
served  subject-specific  factors  can  be  a  source 
of  wi thin-subject  correlation.  We  use  the  term 
shared  frailty  to  indicate  that  observations 
are  clustered  by  subject  and  each  cluster  (i.e., 
subject)  shares  the  same  level  of  frailty. 

In  the  previous  sections,  we  have  used  robust 
variance  estimators  to  adjust  the  standard 
errors  of  the  coefficient  estimates  to  account 
for  within-subject  correlation.  Shared  frailty  is 
not  only  an  adjustment,  but  also  is  built  into 
the  model  and  can  have  an  impact  on  the  esti¬ 
mated  coefficients  as  well  as  their  standard 
errors. 

The  model  output  (obtained  using  Stata 
version  10)  is  shown  on  the  left.  The  inclusion 
of  frailty  in  a  model  (shared  or  unshared)  leads 
to  one  additional  parameter  estimate  in  the 
output  (theta,  the  variance  of  the  frailty).  A  like¬ 
lihood  ratio  test  for  theta  =  0  yields  a  statisti¬ 
cally  significant  p-value  of  0.003  (bottom  of 
output)  suggesting  that  the  frailty  component 
contributes  to  the  model  and  that  there  is 
within-subject  correlation. 

The  estimate  for  the  Weibull  shape  parameter;? 
is  0.888  suggesting  a  slightly  decreasing  hazard 
over  time  because  p<  1.  However,  the  Wald 
test  for  ln(p)  =  0  (or  equivalently  p  =1)  yields 
a  non-significant  p-value  of  0.184. 
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Comparing  Hazard  Ratios 

Weibull  with  frailty  model 
HR(  tx)  =  exp(-0.458)  =  0.633 
95% Cl  =  exp[— 0.458  ±  1.96(0.268)] 
=  (0.374,  1.070) 

Counting  processes  approach  with 
Cox  model 

HR( tx)  :  exp(— 0.407)  =  0.667 

95%  Cl  for  HR  tx  (robust):  (0.414, 
1.069) 


An  estimated  hazard  ratio  of  0.633  for  the  effect 
of  treatment  comparing  two  individuals  with 
the  same  level  of  frailty  and  controlling  for  the 
other  covariates  is  obtained  by  exponentiat¬ 
ing  the  estimated  coefficient  (-0.458)  for  tx. 
The  estimated  hazard  ratio  and  95%  confi¬ 
dence  intervals  are  similar  to  the  corresponding 
results  obtained  using  a  counting  processes 
approach  with  a  Cox  model  and  robust  standard 
errors  (see  left). 


Interpretations  of  HR  from  frailty 
model 

•  Compares  2  individuals  with 
same  a 

•  Compares  individual  with 
himself 

°  What  is  effect  if  individual 
had  used  treatment  rather 
than  placebo? 


Another  interpretation  for  the  estimated  hazard 
ratio  from  the  frailty  model  involves  the  com¬ 
parison  of  an  individual  to  himself.  In  other 
words,  this  hazard  ratio  describes  the  effect  on 
an  individual’s  hazard  (i.e.,  conditional  hazard) 
if  that  individual  had  used  the  treatment  rather 
than  the  placebo. 


XI.  A  Second  Example  We  now  illustrate  the  analysis  of  recurrent  event 

survival  data  using  a  new  example.  We  con¬ 
sider  a  subset  of  data  from  the  Age-Related 
Eye  Disease  Study  (AREDS),  a  long-term  multi¬ 
center,  prospective  study  sponsored  by  the  U.S. 
National  Eye  Institute  of  the  clinical  course  of 
age-related  macular  degeneration  (AMD)  (see 
AREDS  Research  Group,  2003). 


In  addition  to  collecting  natural  history  data, 
AREDS  included  a  clinical  trial  to  evaluate  the 
effect  of  high  doses  of  antioxidants  and  zinc  on 
the  progression  of  AMD.  The  data  subset  we 
consider  consists  of  43  patients  who  experi¬ 
enced  ocular  events  while  followed  for  their 
baseline  condition,  macular  degeneration. 


Age-Related  Eye  Disease 
Study  (AREDS) 

Outcome 

Age-related  macular 
degeneration  (AMD) 

Clinical  trial 

Evaluate  effect  of  treatment 
with  high  doses  of  antioxidants 
and  zinc  on  progression  of  AMD 

n  =  43  (subset  of  data  analyzed 
here) 
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Exposure 

tx  =  1  if  treatment,  0  if  placebo 
8  years  of  follow-up 

Two  possible  events 

First  event:  visual  acuity  score 
<50  (i.e.,  poor 
vision) 


Second  event:  clinically 
advanced  severe  stage  of 
macular  degeneration 


4  approaches  for  analyzing 
recurrent  event  survival  data 
carried  out  on  macular 
degeneration  data 

Each  model  contains  tx,  age, 
and  sex. 

CP  model 

h(t,X)  =  h0(t)exp[p  tx  +  yj  age 
+  Y2  sex] 

No-interaction  SC  model 

hg(t,X)  =  h0g(t)exp[p  tx  +  yi  age 
+  y2  sex] 

where  g  =  1 ,  2 

Interaction  SC  model: 

hg(t,X)  =  h0g(t)exp[pg  tx  +  Yig 

age 

+  Y2g  sex] 

where  g  =  1 ,  2 


The  exposure  variable  of  interest  was  treat¬ 
ment  group  (tx),  which  was  coded  as  1  for 
patients  randomly  allocated  to  an  oral  combi¬ 
nation  of  antioxidants,  zinc,  and  vitamin  C  ver¬ 
sus  0  for  patients  allocated  to  a  placebo. 
Patients  were  followed  for  8  years. 

Each  patient  could  possibly  experience  two 
events.  The  first  event  was  defined  as  the  sud¬ 
den  decrease  in  visual  acuity  score  below  50 
measured  at  scheduled  appointment  times. 
Visual  acuity  score  was  defined  as  the  number 
of  letters  read  on  a  standardized  visual  acuity 
chart  at  a  distance  of  4  m,  where  the  higher  the 
score,  the  better  the  vision. 

The  second  event  was  considered  a  successive 
stage  of  the  first  event  and  defined  as  a  clini¬ 
cally  advanced  and  severe  stage  of  macular 
degeneration.  Thus,  the  subject  had  to  experi¬ 
ence  the  first  event  before  he  or  she  could  expe¬ 
rience  the  second  event. 

We  now  describe  the  results  of  using  the  four 
approaches  for  analyzing  recurrent  event  sur¬ 
vival  with  these  data.  In  each  analysis,  two  cov¬ 
ariates  age  and  sex  were  controlled,  so  that  each 
model  contained  the  variables  tx,  age,  and  sex. 


The  counting  process  (CP)  model  is  shown 
here  at  the  left  together  with  both  the  no-inter¬ 
action  and  interaction  SC  models  used  for  the 
three  stratified  Cox  (SC)  approaches. 
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Table  8.14  Comparison  of  Parameter 
Estimates  and  Robust  Standard 
Errors  for  Treatment  Variable  (tx) 
Controlling  for  Age  and  Sex  (Macular 
Degeneration  Data) 


“Interaction”  Cox 
stratified  model 

“No- 

interaction” 
SC  model 

Model 

Stratum  1 

Stratum 

Pi  (SE) 

fe  (SE) 

P3  (SE) 

Counting 

n/a 

n/a 

-0.174 

process 

(0.104) 

SCP 

-0.055 

-0.955 

-0.306 

(0.286) 

(0.443) 

(0.253) 

GT 

-0.055 

-1.185 

-0.339 

(0.286) 

(0.555) 

(0.245) 

Marginal 

-0.055 

-0.861 

-0.299 

(0.286) 

(0.465) 

(0.290) 

Interaction  SC  models  are  pre¬ 
ferred  (based  on  LR  test  results) 
to  use  of  no-interaction  SC 
model 


Table  8.15.  Comparison  of  Results  for 
the  Treatment  Variable  (tx)  Obtained 
for  Stratified  CP  and  Marginal 
Approaches  (Macular  Degeneration 
Data) 


Stratified  CP 

Marginal 

Estimate 

Pi 

-0.0555 

-0.0555 

fc 

-0.9551 

-0.8615 

P 

-0.306 

-0.2989 

Robust 

SE(P,) 

0.2857 

0.2857 

std. 

SE(p2) 

0.4434 

0.4653 

error 

SE(P) 

0.2534 

0.2902 

Wald 

H0:Pi  =  0 

0.0378 

0.0378 

chi- 

H0:p2  =  0 

4.6395 

3.4281 

square 

H0:P  =  0 

1.4569 

1.0609 

P-value 

H0:Pi  =  0 

0.8458 

0.8478 

H0:p2  =  0 

0.0312 

0.0641 

H0:P  =  0 

0.2274 

0.3030 

Hazard 

exp(Pi) 

0.946 

0.946 

ratio 

exp(p2) 

0.385 

0.423 

exp(P) 

0.736 

0.742 

95%  Conf. 

exp(Pd 

(0.540,  1.656) 

(0.540,  1.656) 

interval 

exp(p2) 

(0.161,  0.918) 

(0.170,  1.052) 

exp(p) 

(0.448,  1.210) 

(0.420,  1.310) 

In  Table  8.14,  we  compare  the  coefficient  esti¬ 
mates  and  their  robust  standard  errors  for  the 
treatment  variable  (tx)  from  all  four  approaches. 
This  table  shows  results  for  both  the  “interac¬ 
tion”  and  “nointeraction”  stratified  Cox  models 
for  the  three  approaches  other  than  the  counting 
process  approach. 

Notice  that  the  estimated  coefficients  for  (), 
and  their  corresponding  standard  errors  are 
identical  for  the  three  SC  approaches.  This 
will  always  be  the  case  for  the  first  stratum 
regardless  of  the  data  set  being  considered. 

The  estimated  coefficients  for  |)2  are,  as 
expected,  somewhat  different  for  the  three  SC 
approaches.  We  return  to  these  results  shortly. 

LR  tests  for  comparing  the  “no-interaction” 
with  the  “interaction”  SC  models  were  signifi¬ 
cant  (P  <  .0001)  for  all  three  SC  approaches 
(details  not  shown),  indicating  that  an  inter¬ 
action  model  was  more  appropriate  than  a 
no-interaction  model  for  each  approach. 

In  Table  8.15,  we  summarize  the  statistical 
inference  results  for  the  effect  of  the  treatment 
variable  (tx)  for  the  Stratified  CP  and 
Marginal  approaches  only. 

We  have  not  included  the  CP  results  here 
because  the  two  events  being  considered  are  of 
very  different  types,  particularly  regarding 
severity  of  illness,  whereas  the  CP  approach 
treats  both  events  as  identical  replications.  We 
have  not  considered  the  Gap  Time  approach 
because  the  investigators  were  more  likely  inter¬ 
ested  in  survival  time  from  baseline  entry  into 
the  study  than  the  survival  time  “gap"  from  the 
first  to  second  event. 

Because  we  previously  pointed  out  that  the 
interaction  SC  model  was  found  to  be  signi¬ 
ficant  when  compared  to  the  corresponding 
no-interaction  SC  model,  we  focus  here  on  the 
treatment  (tx)  effect  for  each  stratum  (i.e., 
event)  separately. 
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First  event: 

SCP 

Marginal 

HR 

0.946 

0.946 

p-value 

0.85 

0.85 

Second 

event: 

SCP 

Marginal 

HR 

0.385 

0.423 

p-value 

0.03 

0.06 

95%  Cl 

(0.16,  0.92) 

(0.17,  1.05) 

Conclusions  regarding  1st  event: 

•  No  treatment  effect 

•  Same  for  Stratified  CP  and 
Marginal  approaches 


Conclusions  regarding  2nd  event: 

•  Clinically  moderate  and 
statistically  significant 
treatment  effect 

•  Similar  for  Stratified  CP 
and  Marginal  approaches, 
but  more  support  from 
Stratified  CP  approach 

Comparison  of  Stratified  CP 
with  Marginal  Approach 

What  if  results  had  been 
different? 


Based  on  the  Wald  statistics  and  corresponding 
P-values  for  testing  the  effect  of  the  treatment 
on  survival  to  the  first  event  (i.e.,  H0:  pi  =  0), 
both  the  Stratified  CP  and  Marginal  appro¬ 
aches  give  the  identical  result  that  the  esti¬ 
mated  treatment  effect  (HR  =  0.946  =  1/1.06)  is 
neither  meaningful  nor  significant  (P  =  0.85). 

For  the  second  event,  indicating  a  clinically 
severe  stage  of  macular  degeneration,  the 
Wald  statistic  P-value  for  the  Stratified  CP 
approach  is  0.03,  which  is  significant  at  the 
.05  level,  whereas  the  corresponding  P-value 
for  the  Marginal  approach  is  0.06,  border¬ 
line  nonsignificant  at  the  .05  level. 

The  estimated  HR  for  the  effect  of  the  treatment 
is  (HR  =  0.385  =  1/2.60)  using  the  Stratified  CP 
approach  and  its  95%  confidence  interval  is 
quite  wide  but  does  not  contain  the  null  value 
of  1 .  For  the  Marginal  approach,  the  estimated 
HR  is  HR  =  0.423  =  1  /2.36,  also  with  a  wide 
confidence  interval,  but  includes  1 . 

Thus,  based  on  the  above  results,  there  appears 
to  be  no  effect  of  treating  patients  with  high 
doses  of  antioxidants  and  zinc  on  reducing 
visual  acuity  score  below  50  (i.e.,  the  first  event) 
based  on  either  Stratified  CP  or  Marginal 
approaches  to  the  analysis. 

However,  there  is  evidence  of  a  clinically 
moderate  and  statistically  significant  effect  of 
the  treatment  on  protection  (i.e.,  not  failing) 
from  the  second  more  severe  event  of  macular 
degeneration.  This  conclusion  is  more  sup¬ 
ported  from  the  Stratified  CP  analysis  than 
from  the  Marginal  analysis. 


Despite  similar  conclusions  from  both  appro¬ 
aches,  it  still  remains  to  compare  the  two 
approaches  for  these  data.  In  fact,  if  the  results 
from  each  approach  had  been  very  different,  it 
would  be  important  to  make  a  choice  between 
these  approaches. 
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Recommend  Stratified  CP  if 

Can  assume  2nd  event  cannot 
occur  without  1st  event 
previously  occurring 

Should  consider  survival  time  to 
2nd  event  conditional  on 

experiencing  1st  event 

Recommend  Marginal  if 

Can  assume  each  subject  at  risk 
for  2nd  event  whether  or  not 
1st  event  previously  occurred 

2nd  event  considered  a  separate 
event,  that  is,  unconditional 

of  the  1st  event 

If 

Should  consider  survival  times 
to  2nd  event  for  all  subjects 

Macular  degeneration  data: 
recommend  Marginal  approach 

In  general:  carefully  consider 
interpretation  of  each  approach 


Nevertheless,  we  authors  find  it  difficult  to 
make  such  a  decision,  even  for  this  example. 
The  Stratified  CP  approach  would  seem 
appropriate  if  the  investigators  assumed  that 
the  second  event  cannot  occur  without  the  first 
event  previously  occurring.  If  so,  it  would  be 
important  to  consider  survival  time  to  the  sec¬ 
ond  event  only  for  (i.e.,  conditional  on)  those 
subjects  who  experience  a  first  event. 


On  the  other  hand,  the  Marginal  approach 
would  seem  appropriate  if  each  subject  is  con¬ 
sidered  to  be  at  risk  for  the  second  event 
whether  or  not  the  subject  experiences  the 
first  event.  The  second  event  is  therefore  con¬ 
sidered  separate  from  (i.e.,  unconditional  of) 
the  first  event,  so  that  survival  times  to  the 
second  event  need  to  be  included  for  all  sub¬ 
jects,  as  in  the  Marginal  approach. 


For  the  macular  degeneration  data  example,  we 
find  the  Marginal  approach  persuasive.  How¬ 
ever,  in  general,  the  choice  among  all  four 
approaches  is  not  often  clear-cut  and  requires 
careful  consideration  of  the  different  interpreta¬ 
tions  that  can  be  drawn  from  each  approach. 


XII.  Survival  Curves  with 
Recurrent  Events 

Goal:  Plot  and  Interpret 
Survival  Curves 

Types  of  survival  curves: 

KM  (empirical):  Chapter  2 
Adjusted  (Cox  PH):  Chapters  3  and  4 


An  important  goal  of  most  survival  analyses, 
whether  or  not  a  regression  model  (e.g.,  Cox 
PH)  is  involved,  is  to  plot  and  interpret/compare 
survival  curves  for  different  groups.  We  have 
previously  described  the  Kaplan-Meier  (KM) 
approach  for  plotting  empirical  survival  curves 
(Chapter  2)  and  we  have  also  described  how  to 
obtain  adjusted  survival  curves  for  Cox  PH 
models  (Chapters  3  and  4). 


Previously:  1  (nonrecurrent)  event  This  previous  discussion  only  considered  sur- 
how:  vival  data  for  the  occurrence  of  one  (nonre- 

Survival  plots  with  recurrent  events?  current)  event.  So,  how  does  one  obtain 

survival  plots  when  there  are  recurrent  events? 
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Focus  on  one  ordered  event  at  a  time 

Si(t):  1st  event 
S2(t):  2nd  event 

Sk(t):  kth  event 


The  answer  is  that  survival  plots  with  recurrent 
events  only  make  sense  when  the  focus  is  on 
one  ordered  event  at  a  time.  That  is,  we  can  plot 
a  survival  curve  for  survival  to  a  first  event, 
survival  to  a  second  event,  and  so  on. 


Survival  to  a  1st  event 

Sjft)  =  Pr(T,  >  t) 
where 

T i  =  survival  time  up  to  occurrence 
of  1st  event 

(ignores  later  recurrent  events) 


For  survival  to  a  first  event,  the  survival  curve 
describes  the  probability  that  a  subject’s  time 
to  occurrence  of  a  first  event  will  exceed  a  spe¬ 
cified  time.  Such  a  plot  essentially  ignores  any 
recurrent  events  that  a  subject  may  have  after  a 
first  event. 


Survival  to  a  2nd  event  For  survival  to  a  second  event,  the  survival 

curve  describes  the  probability  that  a  subject’s 
S2(t)  =  Pr(T2  >  t)  time  to  occurrence  of  a  second  event  will 

exceed  a  specified  time. 

where 

T2  =  survival  time  up  to  occurrence 
of  2nd  event 


Two  versions 


There  are  two  possible  versions  for  such  a  plot. 


Stratified: 

T2c  =  time  from  1st  event  to  2nd 
event,  restricting  data  to  1st 
event  subjects 

Marginal: 

T2m  =  time  from  study  entry  to  2nd 
event,  ignoring  1st  event 


Stratified:  use  survival  time  from  time  of  first 
event  until  occurrence  of  second  event,  thus 
restricting  the  dataset  to  only  those  subjects 
who  experienced  a  first  event. 

Marginal:  use  survival  time  from  study  entry 
to  occurrence  of  second  event,  ignoring 
whether  a  first  event  occurred. 


Survival  to  a  kth  event  (k  >  2)  Similarly,  for  survival  to  the  kth  event,  the 

survival  curve  describes  the  probability  that  a 
Sk(t)  =  Pr(Tk  >  t)  subject’s  time  to  occurrence  of  the  kth  event 

will  exceed  a  specified  time. 

where 

Tk  =  survival  time  up  to  occurrence 
of  kth  event 


Two  versions  As  with  survival  to  the  second  event,  there  are 

two  possible  versions,  Stratified  or  Marginal, 
Stratified:  for  such  a  plot,  as  stated  on  the  left. 

Tkc  =  time  from  the  k  —  1st  to  kth 
event,  restricting  data  to 
subjects  with  k  —  1  events 
Marginal: 

Tkm  =  time  from  study  entry  to  kth 
event,  ignoring  previous 
events 
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Deriving  Sj(t):  Stratum  1 
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Deriving  S2C(t):  Stratum  2 
(Stratified  GT) 
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Deriving 

S2m(t):  Stratum  2  (Marginal) 
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Survival  Plots  for  Molly,  Holly  and 
Polly  Recurrent  Event  Data  (n  =3) 
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We  now  illustrate  such  survival  plots  for  recur¬ 
rent  event  data  by  returning  to  the  small  data¬ 
set  previously  described  for  three  subjects 
Molly  (M),  Holly  (H),  and  Polly  (P),  shown 
again  on  the  left. 


The  survival  plot  for  survival  to  the  first  event 
St(t)  is  derived  from  the  stratum  1  data  layout 
for  any  of  the  three  alternative  SC  analysis 
approaches.  Recall  that  mf  and  qf  denote  the 
number  of  failures  and  censored  observations 
at  time  t(f).  The  survival  probabilities  in  the  last 
column  use  the  KM  product  limit  formula. 

The  Stratified  survival  plot  for  survival  to  the 
second  event  is  derived  from  the  stratum  2  data 
layout  for  the  Gap  Time  approach.  We  denote 
this  survival  curve  as  S2c(t).  Notice  that  the 
survival  probabilities  here  are  identical  to 
those  in  the  previous  table;  however,  the  failure 
times  t(f)  in  each  table  are  different. 

The  Marginal  survival  plot  for  survival  to  the 
second  event  is  derived  from  the  stratum  2  data 
layout  for  the  Marginal  approach.  We  denote 
this  survival  curve  as  S2m(t).  Again,  the  last 
column  here  is  identical  to  those  in  the  previ¬ 
ous  two  tables,  but,  once  again,  the  failure 
times  t(f)  in  each  table  are  different. 


The  survival  plots  that  correspond  to  the  above 
three  data  layouts  are  shown  in  Figures  8. 1  to  8.3. 

Figure  8.1  shows  survival  probabilities  for  the 
first  event,  ignoring  later  events.  The  risk  set  at 
time  zero  contains  all  three  subjects.  The  plot 
drops  from  Si(t)  =  1  to  Si(t)  =  0.67  at  t  =  20, 
drops  again  to  Si (t)  =  0.33  at  t  =  30  and  falls  to 
Si(t)  =  0  at  t  =  100  when  the  latest  first  event 


Figure  8.1.  Si (t):  Survival  to  1st  Event 


occurs. 
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Figure  8.2.  S2c(t):  Survival  to  2nd  Event 
(Stratified  GT) 


Figure  8.2  shows  Stratified  GT  survival  prob¬ 
abilities  for  the  second  event  using  survival 
time  from  the  first  event  to  the  second 
event.  Because  all  three  subjects  had  a  first 
event,  the  risk  set  at  time  zero  once  again  con¬ 
tains  all  three  subjects.  Also,  the  survival  prob¬ 
abilities  of  1,  0.67,  0.33,  and  0  are  the  same  as 
in  Figure  8.1.  Nevertheless,  this  plot  differs 
from  the  previous  plot  because  the  survival 
probabilities  are  plotted  at  different  survival 
times  (t  =  5,  20,  40  in  Figure  8.2  instead  of  t  = 
20,  30,  100  in  Figure  8.1) 
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Figure  8.3  shows  Marginal  survival  probabil¬ 
ities  for  the  second  event  using  survival  time 
from  study  entry  to  the  second  event,  ignor¬ 
ing  the  first  event.  The  survival  probabilities 
of  1,  0.67,  0.33,  and  0  are  once  again  the  same 
as  in  Figures  8.1  and  8.2.  Nevertheless,  this  plot 
differs  from  the  previous  two  plots  because  the 
survival  probabilities  are  plotted  at  different 
survival  times  (t  =  50,  60,  105  in  Figure  8.3). 


Figure  8.3.  S2m(t):  Survival  to  2nd  Event 
(Marginal) 


XIII.  Summary 


We  have  described  four  approaches  for  analyz¬ 
ing  recurrent  event  survival  data. 


4  approaches  for  recurrent  event 
data 

Counting  process  (CP), 

Stratified  CP,  Gap  Time, 
Marginal 

The  4  approaches 


These  approaches  differ  in  how  the  risk  set  is 
determined  and  in  data  layout.  All  four 
approaches  involve  using  a  standard  computer 
program  that  fits  a  Cox  PFI  model,  with  the 
latter  three  approaches  requiring  a  stratified 
Cox  model,  stratified  by  the  different  events 
that  occur. 


•  Differ  in  how  risk  set  is 
determined 

•  Differ  in  data  layout 

•  All  involve  standard  Cox  model 
program 

•  Latter  three  approaches  use  a 
SC  model 


Identical  recurrent  events 

11 

CP  approach 


The  approach  to  analysis  typically  used  when 
recurrent  events  are  treated  as  identical  is 
called  the  CP  Approach. 
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Recurrent  events:  different  disease 
categories  or  event  order  important 

Stratified  Cox  (SC)  approaches 

CP  approach:  Start  and  Stop  times 

Standard  layout:  only  Stop  (sur¬ 
vival)  times  (no  recurrent  events) 


Stratified  CP:  same  Start  and  Stop 

Times  as  CP,  but 
uses  SC  model 


Gap  Time:  Start  and  Stop 

Times 

Start  =  0  always 
Stop  =  time  since 
previous 
event 
SC  model 

Marginal  approach: 

Standard  layout  (nonrecurrent 
event),  that  is,  without  (Start, 
Stop)  columns 

Each  failure  is  a  separate  process 

Recommend  using  robust  estima¬ 
tion  to  adjust  for  correlation  of 
observations  on  the  same  subject. 


Application  1 :  Bladder  Cancer 
study 
n  =  86 

64  months  of 
follow-up 


When  recurrent  events  involve  different  disease 
categories  and/or  the  order  of  events  is  consid¬ 
ered  important,  the  analysis  requires  choosing 
among  the  three  alternative  SC  approaches. 

The  data  layout  for  the  counting  process 
approach  requires  each  subject  to  have  a  line 
of  data  for  each  recurrent  event  and  lists  the 
start  time  and  stop  time  of  the  interval  of  fol¬ 
low-up.  This  contrasts  with  the  standard  layout 
for  data  with  no  recurrent  events,  which  lists 
only  the  stop  (survival)  time  on  a  single  line  of 
data  for  each  subject. 

The  Stratified  CP  approach  uses  the  exact 
same  (start,  stop)  data  layout  format  used  for 
the  CP  approach,  except  that  for  Stratified  CP, 
the  model  used  is  a  SC  PH  model  rather  than 
an  unstratified  PH  model. 

The  Gap  Time  approach  also  uses  a  (start, 
stop)  data  layout,  but  the  start  value  is  always 
0  and  the  stop  value  is  the  time  interval  length 
since  the  previous  event.  The  model  here  is  also 
a  SC  model. 


The  Marginal  approach  uses  the  standard 
(nonrecurrent  event)  data  layout  instead  of 
the  (start,  stop)  layout.  The  basic  idea  behind 
the  Marginal  approach  is  that  it  allows  each 
failure  to  be  considered  as  a  separate  process. 


For  each  of  the  SC  alternative  approaches,  as  for 
the  CP  approach,  it  is  recommended  to  use 
robust  estimation  to  adjust  the  variances  of 
the  estimated  regression  coefficients  for  the  cor¬ 
relation  of  observations  on  the  same  subject. 

We  considered  two  applications  of  the  different 
approaches  described  above.  First,  we  com¬ 
pared  results  from  using  all  four  methods  to 
analyze  data  from  a  study  of  bladder  cancer 
involving  86  patients,  each  followed  for  a  vari¬ 
able  time  up  to  64  months. 
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Repeated  event:  recurrence  of 
bladder  cancer 
tumor;  up  to 
4  events 


tx  =  1  if  thiotepa,  0  if  placebo 
num  =  initial  #  of  tumors 
size  =  initial  size  of  tumors 


CP  results:  no  strong  evidence  for  tx 
(HR  =  0.67,  P  =  .09, 

95%  Cl:  0.414,  1.069) 


Alternative  parametric  approach 

•  Weibull  PH  model 

•  Gamma  shared  frailty 
component 

•  Bladder  cancer  dataset 

•  Similar  HR  and  confidence 
interval  as  for  counting  process 
approach 

Application  2:  Clinical  trial 


n  =  43 

8  years  of  follow-up 

High  doses  of  antioxidants  and  zinc 

Age-related  macular  degeneration 

Exposure:  tx  =  1  if  treatment, 

0  if  placebo 
Covariates:  age,  sex 

Two  possible  events: 

1st  event:  visual  acuity  score  <50 
(i.e.,  poor  vision) 

2nd  event:  clinically  advanced 
severe  stage  of  macular 
degeneration 


The  repeated  event  analyzed  was  the  recurrence 
of  a  bladder  cancer  tumor  after  transurethral 
surgical  excision.  Each  recurrence  of  new 
tumors  was  treated  by  removal  at  each  examina¬ 
tion.  About  25%  of  the  86  subjects  experienced 
four  events. 

The  exposure  variable  of  interest  was  drug 
treatment  status  (tx,  0  =  placebo,  1  =  treatment 
with  thiotepa),  There  were  two  co variates: 
initial  number  of  tumors  (num)  and  initial 
size  of  tumors  (size). 

Results  for  the  CP  approach,  which  was  con¬ 
sidered  appropriate  for  these  data,  indicated 
that  there  was  no  strong  evidence  that  tx  is 
effective  after  controlling  for  num  and  size. 

An  alternative  approach  for  analyzing  recur¬ 
rent  event  data  was  also  described  using  a 
parametric  model  containing  a  frailty  compo¬ 
nent  (see  Chapter  7).  Specifically,  a  Weibull  PH 
model  with  a  gamma  distributed  frailty  was  fit 
using  the  bladder  cancer  dataset.  The  resulting 
estimated  HR  and  confidence  interval  were 
quite  similar  to  the  counting  process  results. 


The  second  application  considered  a  subset  of 
data  (n  =  43)  from  a  clinical  trial  to  evaluate 
the  effect  of  high  doses  of  antioxidants  and 
zinc  on  the  progression  of  age-related  macular 
degeneration  (AMD).  Patients  were  followed 
for  8  years. 

The  exposure  variable  of  interest  was  treat¬ 
ment  group  (tx).  Covariates  considered  were 
age  and  sex. 

Each  patient  could  possibly  experience  two 
events.  The  first  event  was  defined  as  the  sud¬ 
den  decrease  in  visual  acuity  score  below  50. 
The  second  event  was  considered  a  successive 
stage  of  the  first  event  and  defined  as  a  clini¬ 
cally  advanced  and  severe  stage  of  macular 
degeneration. 
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Focus  on  Stratified  CP  vs.  Marginal  Because  the  two  events  were  of  very  different 
(events  were  of  different  types)  types  and  because  survival  from  baseline  was  of 

primary  interest,  we  focused  on  the  results  for  the 

Stratified  CP  and  Marginal  approaches  only. 

An  interaction  SC  model  was  more  appropriate 
than  a  no-interaction  model  for  each  approach, 
thus  requiring  separate  results  for  the  two 
events  under  study. 

Conclusions  regarding  1st  event  The  results  for  the  first  event  indicated  no 

effect  of  the  treatment  on  reducing  visual  acu- 

•  No  treatment  effect  ity  score  below  50  (i.e.,  the  first  event)  from 

•  Same  for  Stratified  CP  and  either  Stratified  CP  or  Marginal  approaches 

Marginal  approaches  to  the  analysis. 

Conclusions  regarding  2nd  event  However,  there  was  evidence  of  a  clinically 

moderate  and  statistically  significant  effect  of 

•  Clinically  moderate  and  the  treatment  on  the  second  more  severe  event 

statistically  significant  of  macular  degeneration. 

treatment  effect 

Macular  degeneration  data:  prefer  The  choice  between  the  Stratified  CP  and 
Marginal  approach  (but  not  clear-  marginal  approaches  for  these  data  was  not 
cut)  clear-cut,  although  the  Marginal  approach 

was  perhaps  more  appropriate  because  the 
two  events  were  of  very  different  types. 

In  general:  carefully  consider  inter-  In  general,  however,  the  choice  among  all 
pretation  of  each  approach  four  approaches  requires  careful  consideration 

of  the  different  interpretations  that  can  be 
drawn  from  each  approach. 

Survival  plots:  one  ordered  event  at  Survival  plots  with  recurrent  events  are  derived 
a  time  Two  versions  for  survival  to  one  ordered  event  at  a  time.  For  plotting  sur- 
kth  event:  vival  to  a  kth  event  where  k  >  2,  one  can  use 

Stratified:  only  subjects  with  k  —  1  either  a  Stratified  or  Marginal  plot,  which 

events  typically  differ. 

Marginal:  ignores  previous  events 


Interaction  SC  model  v 
No-interaction  SC  model  x 
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Detailed 

Outline 


I.  Overview  (page  366) 

A.  Focus:  outcome  events  that  may  occur  more 
than  once  over  the  follow-up  time  for  a  given 
subject,  that  is,  “recurrent  events." 

B.  Counting  Process  (CP)  approach  uses  the  Cox 
PH  model. 

C.  Alternative  approaches  that  use  a  Stratified  Cox 
(SC)  PH  model  and  a  frailty  model. 

II.  Examples  of  Recurrent  Event  Data 

(pages  366-368) 

A.  1.  Multiple  relapses  from  remission:  leukemia 

patients. 

2.  Repeated  heart  attacks:  coronary  patients. 

3.  Recurrence  of  tumors:  bladder  cancer 
patients. 

4.  Deteriorating  episodes  of  visual  acuity: 
macular  degeneration  patients. 

B.  Objective  of  each  example:  to  assess  relation¬ 
ship  of  predictors  to  rate  of  occurrence,  allow¬ 
ing  for  multiple  events  per  subject. 

C.  Different  analysis  required  depending  on 
whether: 

1.  Recurrent  events  are  treated  as  identical 
(counting  process  approach),  or 

2.  Recurrent  events  involve  different  disease 
categories  and/or  the  order  of  events  is 
important  (stratified  Cox  approaches). 

III.  Counting  Process  Example  (pages  368-369) 

A.  Data  on  two  hypothetical  subjects  from  a  ran¬ 
domized  trial  that  compares  two  treatments  for 
bladder  cancer  tumors. 

B.  Data  set-up  for  Counting  Process  (CP) 

approach: 

1.  Each  subject  contributes  a  line  of  data  for 
each  time  interval  corresponding  to  each 
recurrent  event  and  any  additional  event- 
free  follow-up  interval. 

2.  Each  line  of  data  for  a  given  subject  lists  the 
start  time  and  stop  time  for  each  interval  of 
follow-up. 
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IV.  General  Data  Layout:  Counting  Process 
Approach  (pages  370-371) 

A.  i'j  time  intervals  for  subject  i. 

djj  event  status  (0  or  1)  for  subject  i  in  interval  j. 
tjjo  start  time  for  subject  i  in  interval  j. 
tjji  stop  time  for  subject  i  in  interval  j. 

Xijk  value  of  kth  predictor  for  subject  i  in  interval  j . 
i  =  1,  2,. . N;  j  =  1,  2,  . . .,  r,;  k  =  1,2,  . . p. 

B.  Layout  for  subject  i: 
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C.  Bladder  Cancer  Study  example: 

1 .  Data  layout  provided  for  the  first  26  subjects 
(86  subjects  total)  from  a  64-month  study  of 
recurrent  bladder  cancer  tumors. 

2.  The  exposure  variable:  drug  treatment  status 
(tx,  0  =  placebo,  1  =  treatment  with  thiotepa). 

3.  Covariates:  initial  number  of  tumors  (num) 
and  initial  size  of  tumors  (size). 

4.  Up  to  4  events  per  subject. 

V.  The  Counting  Process  Model  and  Method 

(pages  372-376) 

A.  The  model  typically  used  to  carry  out  the  Count¬ 
ing  Process  (CP)  approach  is  the  standard  Cox 
PH  model:  h(t,  X)  =  h0(t)  exp[£  PiXj]. 

B.  For  recurrent  event  survival  data,  the  (partial) 
likelihood  function  is  formed  differently  than 
for  nonrecurrent  event  survival  data: 

1 .  A  subject  who  continues  to  be  followed  after 
having  failed  at  t(f>  does  not  drop  out  of  the 
risk  set  after  t(fj  and  remains  in  the  risk  set 
until  his  or  her  last  interval  of  follow-up,  after 
which  the  subject  is  removed  from  the  risk  set. 

2.  Different  lines  of  data  contributed  by  the 
same  subject  are  treated  in  the  analysis  as  if 
they  were  independent  contributions  from 
different  subjects. 
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C.  For  the  bladder  cancer  data,  the  Cox  PH  Model 
for  CP  approach  is  given  by 

h(t,  X)  =  h0(t)exp[p  tx  +  Y!  num  +  Y2  size]. 

D.  The  overall  partial  likelihood  L  from  using 
the  CP  approach  will  be  automatically  deter¬ 
mined  by  the  computer  program  used  once 
the  data  layout  is  in  the  correct  CP  form  and 
the  program  code  used  involves  the  (start,  stop) 
formulation. 

VI.  Robust  Estimation  (pages  376-378) 

A.  In  the  CP  approach,  the  different  intervals  con¬ 
tributed  by  a  given  subject  represent  correlated 
observations  on  the  same  subject  that  must  be 
accounted  for  in  the  analysis. 

B.  A  widely  used  technique  for  adjusting  for  the 
correlation  among  outcomes  on  the  same  sub¬ 
ject  is  called  robust  estimation. 

C.  The  goal  of  robust  estimation  for  the  CP 
approach  is  to  obtain  variance  estimators  that 
adjust  for  correlation  within  subjects  when  pre¬ 
viously  no  such  correlation  was  assumed. 

D.  The  robust  estimator  of  the  variance  of  an 
estimated  regression  coefficient  allows  tests  of 
hypotheses  and  confidence  interval  estimation 
about  model  parameters  to  account  for  correla¬ 
tion  within  subjects. 

E.  The  general  form  of  the  robust  estimator  can 
be  most  conveniently  written  in  matrix  nota¬ 
tion;  this  formula  is  incorporated  into  the  com¬ 
puter  program  and  is  automatically  calculated 
by  the  program  with  appropriate  coding. 

VII.  Results  for  CP  Example  (pages  378-379) 

A.  Edited  output  is  provided  from  fitting  the  no¬ 
interaction  Cox  PH  model  involving  the  three 
predictors  tx,  num,  and  size. 

B .  A  likelihood  ratio  chunk  test  for  interaction  terms 
tx  x  num  and  tx  x  size  was  nonsignificant. 

C.  The  PH  assumption  was  assumed  satisfied  for 
all  three  variables. 

D.  The  robust  estimator  of  0.24 1 8  for  the  standard 
deviation  of  tx  was  similar  though  somewhat 
different  from  the  corresponding  nonrobust 
estimator  of  0.2001. 

E.  There  was  not  strong  evidence  that  tx  is  effec¬ 
tive  after  controlling  for  num  and  size  (HR  = 
0.67,  two-sided  P  =  .09,  95%  Cl:  0.414,  1.069). 
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F.  However,  for  a  one-sided  alternative,  the  p-values 
using  both  robust  and  nonrobust  standard  errors 
were  significant  at  the  .05  level. 

G.  The  95%  confidence  interval  using  the  robust 
variance  estimator  is  quite  wide. 

VIII.  Other  Approaches  Stratified  Cox 

(pages  379-385) 

A.  The  “strata”  variable  for  each  of  the  three  SC 
approaches  treats  the  time  interval  number  for 
each  event  occurring  on  a  given  subject  as  a 
stratified  variable. 

B.  Three  alternative  approaches  involving  SC 
models  need  to  be  considered  if  the  investigator 
wants  to  distinguish  the  order  in  which  recur¬ 
rent  events  occur. 

C.  These  approaches  all  differ  from  what  is  called 
competing  risk  survival  analysis  in  that  the 
latter  allows  each  subject  to  experience  only 
one  of  several  different  types  of  events  over 
follow-up. 

D.  Stratified  CP  approach: 

1.  Same  Start  and  Stop  Times  as  CP. 

2.  SC  model. 

E.  Gap  Time  approach: 

1.  Start  and  Stop  Times,  but  Start  =  0  always 
and  Stop  =  time  since  previous  event. 

2.  SC  model. 

F.  Marginal  approach: 

1.  Uses  standard  layout  (nonrecurrent  event); 
no  (Start,  Stop)  columns. 

2.  Treats  each  failure  is  a  separate  process. 

3 .  Each  subject  at  risk  for  all  failures  that  might 
occur,  so  that  #  actual  failures  <  #  possible 
failures. 

4.  SC  model. 

G.  Must  decide  between  two  types  of  SC  models: 

1.  No-interaction  SC  versus  interaction  SC. 

2.  Bladder  cancer  example: 

No-interaction  model:  hg(t,  X)  = 
h0g(t)exp[(3  tx  +  y  i  num  +  y2  size]  where  g  = 
1,2,  3,  4. 

Interaction  model:  hg(t,X)  =  h0g(t)exp[pgtx 
+  Yignum  +  Y2gsize].  where  g  =  1,  2,  3,  4. 

H.  Recommend  using  robust  estimation  to  adjust 
for  correlation  of  observations  on  the  same 
subject. 
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IX.  Bladder  Cancer  Study  Example  (Continued) 

(pages  385-389) 

A.  Results  from  using  all  four  methods  -  CP,  Stra¬ 
tified  CP,  Gap  Time,  and  Marginal  -  on  the 

bladder  cancer  data  were  compared. 

B.  The  hazard  ratio  for  the  effect  of  tx  based  on  a 
no-interaction  model  differed  somewhat  for 
each  of  the  four  approaches,  with  the  marginal 
model  being  most  different: 

M:  0.560  CP:  0.666  SCP:  0.716  GT:  0.763 

C.  The  nonrobust  and  robust  standard  errors  and 
P-values  differed  to  some  extent  for  each  of  the 
different  approaches. 

D.  Using  an  interaction  SC  model,  the  estimated 
(3s  and  corresponding  standard  errors  are  dif¬ 
ferent  over  the  four  strata  (i.e.,  four  events)  for 
each  model  separately. 

E.  The  estimated  P’s  and  corresponding  standard 
errors  for  the  three  alternative  SC  models  are 
identical,  as  expected  (always  for  first  events). 

F.  Which  of  the  four  recurrent  event  analysis 
approaches  is  best? 

1 .  Recommend  CP  approach  if  do  not  want  to 
distinguish  between  recurrent  events  on  the 
same  subject  and  desire  overall  conclusion 
about  the  effect  of  tx. 

2.  Recommend  one  of  the  three  SC  approaches 
if  want  to  distinguish  the  effect  of  tx  accord¬ 
ing  to  the  order  in  which  the  event  occurs. 

3.  The  choice  between  the  Stratified  CP  and 
Marginal  is  difficult,  but  prefer  Stratified 
CP  because  the  strata  do  not  clearly  repre¬ 
sent  different  event  types. 

G.  Overall,  regardless  of  the  approach  used,  there 
was  no  strong  evidence  that  tx  is  effective  after 
controlling  for  num  and  size. 

X.  A  Parametric  Approach  Using  Shared  Frailty 

(pages  389-391) 

A.  Alternative  approach  using  a  parametric  model 
containing  a  frailty  component  (see  Chapter  7). 

B.  Weibull  PH  model  with  a  gamma  distributed 
frailty  was  fit  using  the  bladder  cancer  dataset. 

C.  Estimated  HR  and  confidence  interval  were 
quite  similar  to  the  counting  process  results. 

D.  Estimated  frailty  component  was  significant 
(P  =  0.003). 
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XI.  A  Second  Example  (pages  391-395) 

A.  Clinical  trial  (n  =  43,  8-year  study)  on  effect  of 
using  high  doses  of  antioxidants  and  zinc  (i.e., 
tx  =  1  if  yes,  0  if  no)  to  prevent  age-related 
macular  degeneration. 

B.  Covariates:  age  and  sex. 

C.  Two  possible  events: 

1 .  First  event:  visual  acuity  score  <50  (i.e.,  poor 
vision). 

2.  Second  event:  clinically  advanced  stage  of 
macular  degeneration. 

D.  Focus  on  Stratified  CP  vs.  Marginal  because 
events  are  of  different  types. 

E.  Interaction  SC  model  significant  when  com¬ 
pared  to  no-interaction  SC  model. 

F.  Conclusions  regarding  1st  event: 

1.  No  treatment  effect  (HR  =  0.946,  P  =  0.85). 

2.  Same  for  Stratified  CP  and  Marginal 
approaches. 

G.  Conclusions  regarding  2nd  event. 

1.  Stratified  CP:  HR  =  0.385  =  1/2.60,  two- 
sided  P  =  0.03. 

2.  Marginal:  HR  =  0.423  =  1/2.36,  two-sided 
P  =  0.06). 

3.  Overall,  clinically  moderate  and  statistically 
significant  treatment  effect. 

H.  Marginal  approach  preferred  because  1st  and 
2nd  events  are  different  types. 

XII.  Survival  Curves  with  Recurrent  Events 

(pages  395-398) 

A.  Survival  plots  with  recurrent  events  only  make 
sense  when  the  focus  is  on  one  ordered  event  at 
a  time. 

B.  For  survival  from  a  1st  event,  the  survival  curve 
is  given  by  Si(t)  =  Pr  (T!  >  t)  where  Ti  =  sur¬ 
vival  time  up  to  occurrence  of  the  1st  event 
(ignores  later  recurrent  events). 

C.  For  survival  from  the  kth  event,  the  survival 
curve  is  given  by  S^f  t)  =  Pr  (T^  >  t)  where  T|{  = 
survival  time  up  to  occurrence  of  the  kth  event). 
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D.  Two  versions  for  Sk(t): 

i.  Skc(t)  Stratified:  Tkc  =  time  from  the  (k- 1  )st 
to  kth  event,  restricting  data  to  subjects 
with  k-1  events. 

h.  Skmft)  Marginal:  Tkm  =  time  from  study 
entry  to  kth  event,  ignoring  previous  events. 

E.  Illustration  of  survival  plots  for  recurrent  event 
data  using  a  small  dataset  involving  three  sub¬ 
jects  Molly  (M),  Holly  (H),  and  Polly  (P). 

XIII.  Summary  (pages  398-401) 

A.  Four  approaches  for  analyzing  recurrent  event 
survival  data:  the  counting  process  (CP), 
Stratified  CP,  Gap  Time,  and  Marginal 

approaches. 

B.  Data  layouts  differ  for  each  approach. 

C.  CP  approach  uses  Cox  PH  model;  other 
approaches  use  Cox  SC  model. 

D.  Choice  of  approach  depends  in  general  on  care¬ 
fully  considering  the  interpretation  of  each 
approach. 

E.  Should  use  robust  estimation  to  adjust  for 
correlation  of  observations  on  the  same  subject. 


Answer  questions  1  to  15  as  true  or  false  (circle  T  or  F). 

T  F  1 .  A  recurrent  event  is  an  event  (i.e.,  failure)  that  can 
occur  more  than  once  over  the  follow-up  on  a 
given  subject. 

T  F  2.  The  Counting  Process  (CP)  approach  is  appro¬ 
priate  if  a  given  subject  can  experience  more  than 
one  different  type  of  event  over  follow-up. 

T  F  3.  In  the  data  layout  for  the  CP  approach,  a  subject 
who  has  additional  follow-up  time  after  having 
failed  at  time  t(f)  does  not  drop  out  of  the  risk  set 
after  t(f). 

T  F  4.  The  CP  approach  requires  the  use  of  a  stratified 
Cox  (SC)  PH  model. 

T  F  5.  Using  the  CP  approach,  if  exactly  two  subjects  fail 
at  month  t(q  =10,  but  both  these  subjects  have 
later  recurrent  events,  then  the  number  in  the  risk 
set  at  the  next  ordered  failure  time  does  not 
decrease  because  of  these  two  failures. 
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T  F  6. 


T  F  7. 


T  F  8. 


T  F  9. 


T  F  10. 


T  F  11. 


T  F  12. 


T  F  13. 


The  goal  of  robust  estimation  for  the  CP 

approach  is  to  adjust  estimated  regression  coeffi¬ 
cients  to  account  for  the  correlation  of  observa¬ 
tions  within  subjects  when  previously  no  such 
correlation  was  assumed. 

Robust  estimation  is  recommended  for  the  CP 
approach  but  not  for  the  alternative  SC 
approaches  for  analyzing  recurrent  event  survival 
data. 

The  p-value  obtained  from  using  a  robust  stan¬ 
dard  error  will  always  be  larger  than  the 
corresponding  p-value  from  using  a  nonrobust 
standard  error. 

The  Marginal  approach  uses  the  exact  same 
(start,  stop)  data  layout  format  used  for  the  CP 
approach,  except  that  for  the  Marginal  approach, 
the  model  used  is  a  stratified  Cox  PH  model  vari¬ 
able  rather  than  a  standard  (unstratified)  PH 
model. 

Suppose  the  maximum  number  of  failures  occur¬ 
ring  for  a  given  subject  is  five  in  a  dataset  to  be 
analyzed  using  the  Marginal  approach.  Then  a 
subject  who  failed  only  twice  will  contribute  five 
lines  of  data  corresponding  to  his  or  her  two  fail¬ 
ures  and  the  three  additional  failures  that  could 
have  possibly  occurred  for  this  subject. 

Suppose  the  maximum  number  of  failures  occur¬ 
ring  for  a  given  subject  is  five  in  a  dataset  to  be 
analyzed  using  the  Stratified  CP  approach.  Then 
an  interaction  SC  model  used  to  carry  out  this 
analysis  will  have  the  following  general  model 
form:  hg(t,  X)  =  h0g(t)  exp[plgX!  +  p2gX2  +•••  + 
ppgXp],  g=  1,2, 3, 4, 5. 

Suppose  a  no-interaction  SC  model  using  the 
Stratified  CP  approach  is  found  (using  a  likeli¬ 
hood  ratio  test)  not  statistically  different  from  a 
corresponding  interaction  SC  model.  Then  if  the 
no-interaction  model  is  used,  it  will  not  be  possi¬ 
ble  to  separate  out  the  effects  of  predictors  within 
each  stratum  representing  the  recurring  events  on 
a  given  subject. 

In  choosing  between  the  Stratified  CP  and  the 
Marginal  approaches,  the  Marginal  approach 
would  be  preferred  provided  the  different  strata 
clearly  represent  different  event  types. 
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T  F  14.  When  using  an  interaction  SC  model  to  analyze 
recurrent  event  data,  the  estimated  regression 
coefficients  and  corresponding  standard  errors 
for  the  first  stratum  always  will  be  identical  for 

the  Stratified  CP,  Gap  Time,  and  Marginal 

approaches. 

T  F  15.  The  choice  among  the  CP,  Stratified  CP,  Gap 
Time,  and  Marginal  approaches  depends  upon 
whether  a  no-interaction  SC  or  an  interaction  SC 
model  is  more  appropriate  for  one’s  data. 

16.  Suppose  that  Allie  (A),  Sally  (S),  and  Callie  (C)  are  the 
only  three  subjects  in  the  dataset  shown  below.  All 
three  subjects  have  two  recurrent  events  that  occur 
at  different  times. 


ID 

Status 

Stratum 

Start 

Stop 

tx 

A 

1 

1 

0 

70 

1 

A 

1 

2 

70 

90 

1 

S 

1 

1 

0 

20 

0 

S 

1 

2 

20 

30 

0 

C 

1 

1 

0 

10 

1 

c 

1 

2 

10 

40 

1 

Fill  in  the  following  data  layout  describing  survival  (in 
weeks)  to  the  first  event  (stratum  1).  Recall  that  mf 
and  qf  denote  the  number  of  failures  and  censored 
observations  at  time  t(f).  The  survival  probabilities  in 
the  last  column  use  the  KM  product  limit  formula. 

t(f>  nf  mf  qf  R(t(f))  Si(t(f)) 

0  3  0  0  [A,  S,  C]  1.00 

10 


17.  Plot  the  survival  curve  that  corresponds  to  the  data 
layout  obtained  for  Question  16. 

l.o 
.8  - 
.6  - 

.4  - 
.2  - 


20  40 


60 


80  100 
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18.  Fill  in  the  following  data  layout  describing  survival  (in 
weeks)  from  the  first  to  second  event  using  the  Gap 
Time  approach: 

t(f>  nf  mf  qf  R(t{f))  S2(t(f)) 

0  3  0  0  {A,  S,  C]  1.00 

10 

19.  Plot  the  survival  curve  that  corresponds  to  the  data 
layout  obtained  for  Question  18. 

1.0 
.8 
.6 
.4 
.2 

20  40  60  80  100 

20.  Fill  in  the  following  data  layout  describing  survival  (in 
weeks)  to  the  second  event  using  the  Marginal 

approach: 

t(f>  nf  mf  qf  R(t(f))  S2(t(f)) 

0  3  0  0  {A,  S,  Cj  1.00 

30 

21.  Plot  the  survival  curve  that  corresponds  to  the  data 
layout  obtained  for  Question  20. 

1.0 
.8 
.6 
.4 
.2 

20  40  60  80  100 

22.  To  what  extent  do  the  three  plots  obtained  in  Ques¬ 
tions  17,  19,  and  21  differ?  Explain  briefly. 
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Test  1 .  Suppose  that  Bonnie  (B)  and  Lonnie  (L)  are  th  only  two 

subjects  in  the  dataset  shown  below,  wher  both  sub¬ 
jects  have  two  recurrent  events  that  occur  ar  different 
times. 


ID 

Status 

Stratum 

Start 

Stop 

B 

1 

1 

0 

12 

B 

1 

2 

12 

16 

L 

1 

1 

0 

20 

L 

1 

2 

20 

23 

a.  Fill  in  the  empty  cells  in  the  following  data  layout 
describing  survival  time  (say,  in  weeks)  to  the  first 
event  (stratum  1): 


kf) 

nf 

rrif 

Of 

R(t(f)) 

0 

2 

0 

0 

[B,  L] 

12 

20 

b.  Why  will  the  layout  given  in  part  a  be  the  same 
regardless  of  whether  rhe  analysis  approach  is  the 
Counting  Process  (CP),  Stratified  CP,  Gap  Time,  or 
Marginal  approaches? 

c.  Fill  in  the  empty  cells  in  the  following  data  layout 
describing  survival  time  (say,  in  weeks)  from  the 
first  to  the  second  event  (stratum  2)  using  the 
Stratified  CP  approach: 


t(£) 

nf 

rrif 

Of 

0 

0 

0 

0 

- 

16 

23 

d.  Fill  in  the  empty  cells  in  the  following  data  layout 
describing  survival  time  (say,  in  weeks)  from  the 
first  to  the  second  event  (stratum  2)  using  the  Gap 
Time  approach: 


t(f) 

nf 

rrif 

qi 

R(t<f>) 

0 

2 

0 

0 

[B,  L] 

3 

4 
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e.  Fill  in  the  empty  cells  in  the  following  data  layout 
describiing  survival  time  (say,  in  weeks)  from  the 
first  to  the  second  event  (stratum  2)  using  the 
Marginal  approach: 


t(f> 

nf 

nif 

0 

2 

0 

0 

[B,  L] 

16 

23 

f.  For  the  Stratified  CP  approach  described  in  part  c, 
determine  which  of  the  following  choices  is  correct. 
Circle  the  number  corresponding  to  the  one  and 
only  one  correct  choice. 

i.  Lonnie  is  in  the  risk  set  when  Bonnie  gets  her 
second  event. 

ii.  Bonnie  is  in  the  risk  set  when  Lonnie  gets  her 
second  event. 

iii.  Neither  is  in  the  risk  set  for  the  other’s  second 
event. 

g.  For  the  Gap  Time  approach  described  in  part  d, 
determine  which  of  the  following  choices  is  correct. 
Circle  the  number  corresponding  to  the  one  and 
only  one  correct  choice. 

i.  Lonnie  is  in  the  risk  set  when  Bonnie  gets  her 
second  event. 

ii.  Bonnie  is  in  the  risk  set  when  Lonnie  gets  her 
second  event. 

ii.  Neither  is  in  the  risk  set  for  the  other’s  second 
event. 

h.  For  the  Marginal  approach  described  in  part  e, 
determine  which  of  the  following  choices  is  correct. 
Circle  the  number  corresponding  to  the  one  and 
only  one  correct  choice. 

i.  Lonnie  is  in  the  risk  set  when  Bonnie  gets  her 
second  event. 

ii.  Bonnie  is  in  the  risk  set  when  Lonnie  gets  her 
second  event. 

iii.  Neither  is  in  the  risk  set  for  the  other’s  second 
event. 
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2.  The  dataset  shown  below  in  the  counting  process  lay¬ 
out  comes  from  a  clinical  trial  involving  36  heart  attack 
patients  between  40  and  50  years  of  age  with  implanted 
defibrillators  who  were  randomized  to  one  of  two  treat¬ 
ment  groups  (tx,  =  1  if  treatment  A,  =  0  if  treatment  B) 
to  reduce  their  risk  for  future  heart  attacks  over  a  4- 
month  period.  The  event  of  interest  was  experiencing  a 
“high  energy  shock”  from  the  defibrillator.  The  out¬ 
come  is  time  (in  days)  until  an  event  occurs.  The  covari¬ 
ate  of  interest  was  Smoking  History  ( 1  =  ever  smoked, 
0  =  never  smoked).  Questions  about  the  analysis  of  this 
dataset  follow. 


Col  1  =  id,  Col  2  =  event,  Col  3  =  start,  Col  4  =  stop, 
Col  5  =  tx,  Col  6  =  smoking 


01 

1 

0 

39 

0 

0 

12 

1 

0 

39 

0 

1 

01 

1 

39 

66 

0 

0 

12 

1 

39 

80 

0 

1 

01 

1 

66 

97 

0 

0 

12 

0 

80 

107 

0 

1 

02 

1 

0 

34 

0 

1 

13 

1 

0 

36 

0 

1 

02 

1 

34 

65 

0 

1 

13 

1 

36 

64 

0 

1 

02 

1 

65 

100 

0 

1 

13 

1 

64 

95 

0 

1 

03 

1 

0 

36 

0 

0 

14 

1 

0 

46 

0 

1 

03 

1 

36 

67 

0 

0 

14 

1 

46 

77 

0 

1 

03 

1 

67 

96 

0 

0 

14 

0 

77 

111 

0 

1 

04 

1 

0 

40 

0 

0 

15 

1 

0 

61 

0 

1 

04 

1 

40 

80 

0 

0 

15 

1 

61 

79 

0 

1 

04 

0 

80 

111 

0 

0 

15 

0 

79 

111 

0 

1 

05 

1 

0 

45 

0 

0 

16 

1 

0 

57 

0 

1 

05 

1 

45 

68 

0 

0 

16 

0 

57 

79 

0 

1 

05 

68 

0 

0 

16 

79 

0 

1 

06 

1 

0 

33 

0 

1 

17 

1 

0 

37 

0 

1 

06 

1 

33 

66 

0 

1 

17 

1 

37 

76 

0 

1 

06 

1 

66 

96 

0 

1 

17 

0 

76 

113 

0 

1 

07 

1 

0 

34 

0 

1 

18 

1 

0 

58 

0 

1 

07 

1 

34 

67 

0 

1 

18 

1 

58 

67 

0 

1 

07 

1 

67 

93 

0 

1 

18 

0 

67 

109 

0 

1 

08 

1 

0 

39 

0 

1 

19 

1 

0 

58 

1 

1 

08 

1 

39 

72 

0 

1 

19 

1 

58 

63 

1 

1 

08 

1 

72 

102 

0 

1 

19 

1 

63 

106 

1 

1 

09 

1 

0 

39 

0 

1 

20 

1 

0 

45 

1 

0 

09 

1 

39 

79 

0 

1 

20 

1 

45 

72 

1 

0 

09 

0 

79 

109 

0 

1 

20 

1 

72 

106 

1 

0 

10 

1 

0 

36 

0 

0 

21 

1 

0 

48 

1 

0 

10 

1 

36 

65 

0 

0 

21 

1 

48 

81 

1 

0 

10 

1 

65 

96 

0 

0 

21 

1 

81 

112 

1 

0 

11 

1 

0 

39 

0 

0 

22 

1 

0 

38 

1 

1 

11 

1 

39 

78 

0 

0 

22 

1 

38 

64 

1 

1 

11 

1 

78 

108 

0 

0 

22 

1 

64 

97 

1 

1 
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23 

1 

0 

51 

1 

1 

30 

1 

0 

57 

1 

0 

23 

1 

51 

69 

1 

1 

30 

1 

57 

78 

1 

0 

23 

0 

69 

98 

1 

1 

30 

1 

78 

99 

1 

0 

24 

1 

0 

43 

1 

1 

31 

1 

0 

44 

1 

1 

24 

1 

43 

67 

1 

1 

31 

1 

44 

74 

1 

1 

24 

0 

67 

111 

1 

1 

31 

1 

74 

96 

1 

1 

25 

1 

0 

46 

1 

0 

32 

1 

0 

38 

1 

1 

25 

1 

46 

66 

1 

0 

32 

1 

38 

71 

1 

1 

25 

1 

66 

110 

1 

0 

32 

1 

71 

105 

1 

1 

26 

1 

0 

33 

1 

1 

33 

1 

0 

38 

1 

1 

26 

1 

33 

68 

1 

1 

33 

1 

38 

64 

1 

1 

26 

1 

68 

96 

1 

1 

33 

1 

64 

97 

1 

1 

27 

1 

0 

51 

1 

1 

34 

1 

0 

38 

1 

1 

27 

1 

51 

97 

1 

1 

34 

1 

38 

63 

1 

1 

27 

0 

97 

115 

1 

1 

34 

1 

63 

99 

1 

1 

28 

1 

0 

37 

1 

0 

35 

1 

0 

49 

1 

1 

28 

1 

37 

79 

1 

0 

35 

1 

49 

70 

1 

1 

28 

1 

79 

93 

1 

0 

35 

0 

70 

107 

1 

1 

29 

1 

0 

41 

1 

1 

36 

1 

0 

34 

1 

1 

29 

1 

41 

73 

1 

1 

36 

1 

34 

81 

1 

1 

29 

0 

73 

111 

1 

1 

36 

1 

81 

97 

1 

1 

Table  T.l  below  provides  the  results  for  the  treatment 
variable  (tx)  from  no-interaction  models  over  all  four 
recurrent  event  analysis  approaches.  Each  model  was 
fit  using  either  a  Cox  PH  model  (CP  approach)  or  a 
Stratified  Cox  (SC)  PH  model  (Stratified  CP,  Gap 
Time,  Marginal  approaches)  that  controlled  for  the 
covariate  smoking. 


Table  T.l.  Comparison  of  Results  for  the  Treatment  Variable  (tx) 
Obtained  from  No-Interaction  Models3  Across  Four  Methods 
(Defibrillator  Study) 


Model 

CP 

Stratified 

CP 

Gap  Time 

Marginal 

Parameter 

0.0839 

0.0046 

-0.0018 

-0.0043 

estimateb 

Robust  standard 

0.1036 

0.2548 

0.1775 

0.2579 

error 

Chi-square 

0.6555 

0.0003 

0.0001 

0.0003 

p-value 

0.4182 

0.9856 

0.9918 

0.9866 

Hazard  ratio 

1.087 

1.005 

0.998 

0.996 

95%  confidence 

(0.888,  1.332)  (0.610,  1.655) 

(0.705,  1.413) 

(0.601,  1.651) 

interval 

a  No-interaction  SC  model  fitted  with  PROC  PHREG  for  the  Stratified  CP, 
Gap  Time  and  Marginal  methods;  no-interaction  standard  Cox  PH  model 
fitted  for  CP  approach. 
b  Estimated  coefficient  of  tx  variable. 
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2.  a.  State  the  hazard  function  formula  for  the  no¬ 
interaction  model  used  to  fit  the  CP  approach. 

b.  Based  on  the  CP  approach,  what  do  you  conclude 
about  the  effect  of  treatment  (tx)?  Explain  briefly 
using  the  results  in  Table  T.  1 . 

c.  State  the  hazard  function  formulas  for  the  no-inter¬ 
action  and  interaction  SC  models  corresponding  to 
the  use  of  the  Marginal  approach  for  fitting  these 
data. 

d.  Table  T.  1  gives  results  for  “no-interaction”  SC  mod¬ 
els  because  likelihood  ratio  (LR)  tests  comparing  a 
"no-interaction”  with  an  "interaction”  SC  model 
were  not  significant.  Describe  the  (LR)  test  used 
for  the  marginal  model  (full  and  reduced  models, 
null  hypothesis,  test  statistic,  distribution  of  test 
statistic  under  the  null). 

e.  How  can  you  criticize  the  use  of  a  no-interaction  SC 
model  for  any  of  the  SC  approaches,  despite  the 
finding  that  the  above  likelihood  ratio  test  was  not 
significant? 

f.  Based  on  the  study  description  given  earlier,  why 
does  it  make  sense  to  recommend  the  CP  approach 
over  the  other  alternative  approaches? 

g.  Under  what  circumstances/assumptions  would  you 
recommend  using  the  Marginal  approach  instead 
of  the  CP  approach? 

Table  T.2  below  provides  ordered  failure  times  and 
corresponding  risk  set  information  that  result  for  the  36 
subjects  in  the  above  Defibrillator  Study  dataset  using  the 
Counting  Process  (CP)  data  layout  format. 


Table  T.2.  Ordered  Failure  Times  and  Risk  Set  Information  for 
Defibrillator  Study  (CP) 


Ordered 
failure 
times  t(f) 

#  in 
risk 
set  nf 

# 

failed 

mf 

#  censored 
in  [t(f), 
hf+n) 

Subject  ID  #s  for 
outcomes  in  [t(f), 
t(f-t-i)) 

0 

36 

0 

0 

— 

33 

36 

2 

0 

6,  26 

34 

36 

3 

0 

2,  7,  36 

36 

36 

3 

0 

3,  10,  13 

37 

36 

2 

0 

17,  28 

38 

36 

4 

0 

22,  32,  33,  34 

39 

36 

5 

0 

1,  8,  9,  11,  12 

40 

36 

1 

0 

4 

41 

36 

1 

0 

29 

43 

36 

1 

0 

24 

44 

36 

1 

0 

31 

( Continued  on  next  page) 
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Table  T.2.  ( Continued ) 


Ordered 
failure 
times  t(f) 

#  in 
risk 
set  nf 

# 

failed 

mf 

#  censored 
in  [t(f), 
t(f+n) 

Subject  ID  #s  for 
outcomes  in  [t(Q, 
kf+n) 

45 

36 

2 

0 

5,  20 

46 

36 

2 

0 

14,  25 

48 

36 

1 

0 

21 

49 

36 

1 

0 

35 

51 

36 

2 

0 

23,  27 

57 

36 

2 

0 

16,  30 

58 

36 

2 

0 

18,  19 

61 

36 

1 

0 

15 

63 

36 

2 

0 

19,  34 

64 

36 

3 

0 

13,  22,  33 

65 

36 

2 

0 

2,  10 

66 

36 

3 

0 

1,  6,  25 

67 

36 

4 

0 

3,  7,  18,  24 

68 

36 

2 

0 

5,  26 

69 

35 

1 

0 

23 

70 

35 

1 

0 

35 

71 

35 

1 

0 

32 

72 

35 

2 

0 

8,  20 

73 

35 

1 

0 

29 

74 

35 

1 

0 

31 

76 

35 

1 

0 

17 

77 

35 

1 

0 

14 

78 

35 

2 

0 

11,  30 

79 

35 

3 

1 

9,  15,  16,  28 

80 

34 

2 

0 

4,  12 

81 

34 

2 

0 

21,  36 

93 

34 

2 

0 

7,  28 

95 

32 

1 

0 

13 

96 

31 

5 

0 

3,  6,  10,  26,  31 

97 

26 

5 

0 

1,  22,  27,  33,  36 

98 

22 

0 

1 

23 

99 

21 

2 

0 

30,  34 

100 

19 

1 

0 

2 

102 

18 

1 

0 

8 

105 

17 

1 

0 

32 

106 

16 

2 

0 

19,  20 

107 

14 

1 

1 

12,  35 

108 

12 

1 

0 

11 

109 

11 

0 

2 

9,  18 

110 

9 

1 

0 

25 

111 

8 

0 

5 

4,  14,  15,  24,  29 

112 

3 

1 

0 

21 

113 

2 

0 

1 

17 

115 

1 

0 

1 

27 
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h.  In  Table  T.2,  why  does  the  number  in  the  risk  set 
(nf)  remain  unchanged  through  failure  time  (i.e., 
day)  68,  even  though  50  events  occur  up  to  that 
time? 

i.  Why  does  the  number  in  the  risk  set  change  from  3 1 
to  26  when  going  from  time  96  to  97? 

j .  Why  is  the  number  of  failures  (mf)  equal  to  3  and  the 
number  of  censored  subjects  equal  to  1  in  the  inter¬ 
val  between  failure  times  79  and  80? 

k.  What  5  subjects  were  censored  in  the  interval 
between  failure  times  111  and  112? 

l.  Describe  the  event  history  for  subject  #5,  including 
his  or  her  effect  on  changes  in  the  risk  set. 

Based  on  the  CP  data  layout  of  Table  T.2,  the  following 

table  (T.3)  of  survival  probabilities  has  been  calculated. 


Table  T.3.  Survival  Probabilities  for  Defibrillator  Study  Data  Based 
on  CP  Layout 


t(f) 

nf 

mf 

qt 

S(W  =  S(t(f_ 

H 

V 

3 

H 

IV 

3^ 

0 

36 

0 

0 

1.0 

33 

36 

2 

0 

1  x  34/36  =  .94 

34 

36 

3 

0 

.94  x  33/36  = 

.87 

36 

36 

3 

0 

.87  x  33/36  = 

.79 

37 

36 

2 

0 

.79  x  34/36  = 

.75 

38 

36 

4 

0 

.75  x  32/36  = 

.67 

39 

36 

5 

0 

.67  x  31/36  = 

.57 

40 

36 

1 

0 

.57  x  35/36  = 

.56 

41 

36 

1 

0 

.56  x  35/36  = 

.54 

43 

36 

1 

0 

.54  x  35/36  = 

.53 

44 

36 

1 

0 

.53  x  35/36  = 

.51 

45 

36 

2 

0 

.51  x  34/36  = 

.48 

46 

36 

2 

0 

.48  x  34/36  = 

.46 

48 

36 

1 

0 

.46  x  35/36  = 

.44 

49 

36 

1 

0 

.44  x  35/36  = 

.43 

51 

36 

2 

0 

.43  x  34/36  = 

.41 

57 

36 

2 

0 

.41  x  34/36  = 

.39 

58 

36 

2 

0 

.39  x  34/36  = 

.36 

61 

36 

1 

0 

.36  x  35/36  = 

.35 

63 

36 

2 

0 

.35  x  34/36  = 

.33 

64 

36 

3 

0 

.33  x  33/36  = 

.31 

65 

36 

2 

0 

.31  x  34/36  = 

.29 

66 

36 

3 

0 

.29  x  33/36  = 

.27 

67 

36 

4 

0 

.27  x  32/36  = 

.24 

68 

36 

2 

0 

.24  x  34/36  = 

.22 

69 

35 

1 

0 

.22  x  34/35  = 

.22 

70 

35 

1 

0 

.22  x  34/35  = 

.21 

71 

35 

1 

0 

.21  x  34/35  = 

.20 

72 

35 

2 

0 

.20  x  33/35  = 

.19 

73 

35 

1 

0 

.19  x  34/35  = 

.19 

74 

35 

1 

0 

.19  x  34/35  = 

.18 

76 

35 

1 

0 

.18  x  34/35  = 

.18 

77 

35 

1 

0 

.18  x  34/35  = 

.17 

78 

35 

2 

0 

.17  x  33/35  = 

.16 
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Table  T.3.  ( Continued ) 


t(f) 

nf 

rrif 

Of 

S(t(f))  =  S(t(f_ 

H 

V 

3 

H 

IV 

3^ 

79 

35 

3 

1 

.16  x  31/35  = 

.14 

80 

34 

2 

0 

.14  x  32/34  = 

.13 

81 

34 

2 

0 

.13  x  32/34  = 

.13 

95 

32 

1 

0 

.13  x  31/32  = 

.12 

96 

31 

5 

0 

.12  x  26/31  = 

.10 

97 

26 

5 

0 

.10  x  21/26  = 

.08 

98 

22 

0 

1 

.08  x  22/22  = 

.08 

99 

21 

2 

0 

.08  x  19/21  = 

.07 

100 

19 

1 

0 

.07  x  18/19  = 

.07 

102 

18 

1 

0 

.07  x  17/18  = 

.06 

105 

17 

1 

0 

.06  x  16/17  = 

.06 

106 

16 

2 

0 

.06  x  14/16  = 

.05 

107 

14 

1 

1 

.05  x  13/14  = 

.05 

108 

12 

1 

0 

.05  x  21/26  = 

.05 

109 

11 

0 

2 

.05  x  11/11  = 

.05 

110 

9 

1 

0 

.05  x  8/9  = 

.04 

111 

8 

0 

5 

.04  x  8/8  = 

.04 

112 

3 

1 

0 

.04  x  2/3  = 

.03 

113 

2 

0 

1 

.03  x  2/2  = 

.03 

115 

1 

0 

1 

.03  x  1/1  = 

.03 

Suppose  the  survival  probabilities  shown  in  Table  T.3 
are  plotted  on  the  y-axis  versus  corresponding  ordered 
failure  times  on  the  x-axis. 

m.  What  is  being  plotted  by  such  a  curve?  (Circle  one 

or  more  choices.) 

i.  Pr(Tj  >  t)  where  T,  =  time  to  first  event  from 
study  entry. 

ii.  Pr(T  >  t)  where  T  =  time  from  any  event  to  the 
next  recurrent  event. 

iii.  Pr(T  >  t)  where  T  =  time  to  any  event  from 
study  entry. 

iv.  Pr(not  failing  prior  to  time  t). 

v.  None  of  the  above. 

n.  Can  you  criticize  the  use  of  the  product  limit  for¬ 
mula  for  S(t(f))  in  Table  T.3?  Explain  briefly. 
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o.  Use  Table  T.2  to  complete  the  data  layouts  for  plot¬ 
ting  the  following  survival  curves. 

i.  Si(t)  =  Pr(T!  >  t)  where  Tj  =  time  to  first  event 
from  study  entry 


t(f) 

nf 

mt 

qf  s(t(f))  = 

S(t(f —  i ))  x  Pr(Tj  >  t|Ti  >  t) 

0 

36 

0 

0 

1.00 

33 

36 

2 

0 

0.94 

34 

34 

3 

0 

0.86 

36 

31 

3 

0 

0.78 

37 

28 

2 

0 

0.72 

38 

26 

4 

0 

0.61 

39 

22 

5 

0 

0.47 

40 

17 

1 

0 

0.44 

41 

16 

1 

0 

0.42 

43 

15 

1 

0 

0.39 

44 

14 

1 

0 

0.36 

45 

13 

2 

0 

0.31 

46 

11 

2 

0 

0.25 

48 

9 

1 

0 

0.22 

49 

8 

1 

0 

0.19 

51 

- 

- 

- 

- 

57 

- 

- 

- 

- 

58 

- 

- 

- 

- 

61 

- 

- 

- 

- 

ii. 

Gap  Time  S2c(t)  =  Pr(T2c  >  t)  where  T2c  = 
time  to  second  event  from  first  event. 

ten 

nf 

m} 

qf  s(t{f))  = 

S(t(f —  i ))  x  Pr(Tj  >  t|Ti  >  t) 

0 

36 

0 

0 

1.00 

5 

36 

1 

0 

0.97 

9 

35 

1 

0 

0.94 

18 

34 

2 

0 

0.89 

20 

32 

1 

0 

0.86 

21 

31 

2 

l 

0.81 

23 

28 

1 

0 

0.78 

24 

27 

1 

0 

0.75 

25 

26 

1 

0 

0.72 

26 

25 

2 

0 

0.66 

27 

23 

2 

0 

0.60 

28 

21 

1 

0 

0.58 

29 

20 

1 

0 

0.55 

30 

19 

1 

0 

0.52 
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( Continued ) 


t(f) 

nf 

mf 

qf  s(t(fj)  =  s(t(f_ 

-d)  x  PrCTj  >  t|Ti  >  t) 

31 

18 

3 

0 

0.43 

32 

15 

1 

0 

0.40 

33 

14 

5 

0 

0.26 

35 

9 

1 

0 

0.23 

39 

8 

2 

0 

0.17 

40 

- 

- 

- 

- 

41 

- 

- 

- 

- 

42 

- 

- 

- 

- 

46 

- 

- 

- 

- 

47 

- 

- 

- 

- 

iii. 

Marginal  S2m(t)  =  Pr(T2m  >  t)  where  T2m  = 

time  to  second  event  from  study  entry. 

t(p 

nf 

mf 

Qf  S(t(f))  =  S(t(f_ 

d)  x  PrCTj  >  t|Ti  >  t) 

0 

36 

0 

0 

1.00 

63 

36 

2 

0 

0.94 

64 

34 

3 

0 

0.86 

65 

31 

2 

0 

0.81 

66 

29 

3 

0 

0.72 

67 

26 

4 

0 

0.61 

68 

22 

2 

0 

0.56 

69 

20 

1 

0 

0.53 

70 

19 

1 

0 

0.50 

71 

18 

1 

0 

0.47 

72 

17 

2 

0 

0.42 

73 

15 

1 

0 

0.39 

74 

14 

1 

0 

0.36 

76 

13 

1 

0 

0.33 

77 

12 

1 

0 

0.31 

78 

11 

2 

0 

0.25 

79 

- 

- 

- 

- 

80 

- 

- 

- 

- 

81 

- 

- 

- 

- 

97 

- 

- 

- 

- 

p.  The  survival  curves  corresponding  to  each  of  the 
data  layouts  (a,  b,  c)  described  in  Question  14  will 
be  different.  Why? 
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Answers  to 

Practice 

Exercises 


1.  T 

2.  F:  The  Marginal  approach  is  appropriate  if  events  are 
of  different  types. 

3.  T 

4.  F:  The  Marginal,  Stratified  CP,  and  Gap  Time 

approaches  all  require  a  SC  model,  whereas  the  CP 
approach  requires  a  standard  PH  model. 

5.  T 

6.  F:  Robust  estimation  adjusts  the  standard  errors  of 
regression  coefficients. 

7.  F:  Robust  estimation  is  recommended  for  all  four 
approaches,  not  just  the  CP  approach. 

8.  F:  The  P-value  from  robust  estimation  may  be  either 
larger  or  smaller  than  the  corresponding  P-value  from 
nonrobust  estimation. 

9.  F:  Replace  the  word  Marginal  with  Stratified  CP  or 
Gap  Time.  The  Marginal  approach  does  not  use 
(Start,  Stop)  columns  in  its  layout. 

10.  T 

11.  T 

12.  T 

13.  T 

14.  T 

15.  F:  The  choice  among  the  CP,  Stratified  CP,  Gap 
Time,  and  Marginal  approaches  depends  on  carefully 
considering  the  interpretation  of  each  approach. 


t(f) 

nf 

nif 

Of 

R(t(f)} 

Si(t(f)) 

0 

3 

0 

0 

{A,  S,  CJ 

1.00 

10 

3 

1 

0 

{A,  S,  C] 

0.67 

20 

2 

1 

0 

{A,  S] 

0.33 

70 

1 

1 

0 

{A} 

0.00 

16. 
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O  OO  'O  \t  (N 

+-> 

r-’ 

1 

18.  t(f)  nf 

20  40  60  80  100 

mf  qf  R(t(f)]  S2(t(f))  Gap  Time 

0  3 

0 

0  [A,  S,  C]  1.00 

10  3 

1 

0  [A,  S,  C]  0.67 

20  2 

1 

0  {A,  C]  0.33 

30  1 

1 

0  [C] 

0.00 

19.  S2c(t)  Gap  Time 

1.0  - 

— 

1 

.8  - 

L 

.6  - 

1 

.4  - 

L_. 

.2  - 

“I - - 1 - 1 - 1 - 1 — 

20  40  60  80  100 


t(f) 

nf 

mf 

Of 

R(t(f)] 

S2(t(f))  Marginal 

0 

3 

0 

0 

[A,  S,  C] 

1.00 

30 

3 

1 

0 

[A,  S,  C] 

0.67 

40 

2 

1 

0 

[A,  C} 

0.33 

90 

1 

1 

0 

[A] 

0.00 

2 1 .  S2m(t)  Marginal 

t.O  -| - 

.8  - 
.6  - 
.4  - 
0.2  - 

- 1 - 1 - 1 - 1 - - 1 - 

20  40  60  80  100 

22.  All  three  plots  differ  because  the  risk  sets  for  each  plot 
are  defined  differently  inasmuch  as  the  failure  times 
are  different  for  each  plot. 
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Introduction 


This  chapter  considers  survival  data  in  which  each  subject 
can  experience  only  one  of  several  different  types  of  events 
over  follow-up.  This  situation  contrasts  with  the  topic  of 
the  preceding  chapter  in  which  subjects  could  experience 
more  than  one  event  of  a  given  type.  When  only  one  of 
several  different  types  of  event  can  occur,  we  refer  to  the 
probabilities  of  these  events  as  “competing  risks,”  which 
explains  the  title  of  this  chapter. 

Modeling  competing  risks  survival  data  can  be  carried  out 
using  a  Cox  model,  a  parametric  survival  model,  or  models 
that  use  the  cumulative  incidence  (rather  than  survival). 
In  this  chapter,  we  mainly  consider  the  Cox  model  because 
of  its  wide  popularity  and  also  because  of  the  availability  of 
computer  programs  that  use  the  Cox  model  for  analysis 
of  competing  risks. 

The  typical  (“cause-specific”)  approach  for  analyzing  com¬ 
peting  risks  data  is  to  perform  a  survival  analysis  for  each 
event  type  separately,  where  the  other  (competing)  event 
types  are  treated  as  censored  categories.  There  are  two 
primary  drawbacks  to  the  above  method.  One  problem  is 
that  the  above  method  requires  the  assumption  that  com¬ 
peting  risks  are  independent.  The  second  problem  is  that 
the  generalized  Kaplan-Meier  (KM)-based  product-limit 
survival  curve  obtained  from  fitting  separate  Cox  models 
for  each  event  type  has  questionable  interpretation  when 
there  are  competing  risks. 

Unfortunately,  if  the  independence  assumption  is  incorrect, 
there  is  no  direct  methodology  available  for  analyzing  com¬ 
peting  risks  simultaneously.  The  only  “indirect”  method  for 
addressing  this  problem  involves  carrying  out  a  “sensitivity 
analysis”  that  treats  subjects  with  events  from  competing 
risks  as  all  being  event-free  or  as  all  experiencing  the  event 
of  interest.  An  example  of  this  “sensitivity”  approach  is 
provided. 
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The  primary  alternative  summary  curve  to  the  KM-based 
survival  curve  is  the  “cumulative  incidence  curve  (CIC),” 
which  estimates  the  “marginal  probability”  of  an  event 
(both  terms  are  defined  in  this  chapter).  This  CIC  is  not 
estimated  using  a  product-limit  formulation,  and  its 
computation  is  not  included  in  mainstream  statistical 
packages.  Moreover,  the  independence  of  competing  risks 
is  still  required  when  a  proportional  hazard  model  is  used 
to  obtain  hazard  ratio  estimates  for  individual  competing 
risks  as  an  intermediate  step  in  the  computation  of  a  CIC. 
Nevertheless,  the  CIC  has  a  meaningful  interpretation  in 
terms  of  treatment  utility  regardless  of  whether  competing 
risks  are  independent.  A  variation  of  the  CIC,  called  the 
“conditional  probability  curve  (CPC),”  provides  a  risk 
probability  conditional  on  an  individual  not  experiencing 
any  of  the  other  competing  risks  by  time  t. 

An  equivalent  approach  to  the  cause-specific  method  for 
analyzing  competing  risks  is  called  the  Lunn-McNeil  (LM) 
approach.  The  LM  approach  allows  only  one  model  to 
be  fit  rather  than  separate  models  for  each  event  type 
and,  moreover,  allows  flexibility  to  perform  statistical 
inferences  about  simpler  versions  of  the  LM  model.  This 
approach  has  added  appeal  in  that  competing  events  are 
not  considered  as  simply  being  censored.  Nevertheless,  as 
with  the  cause-specific  approach,  the  LM  method  assumes 
the  independence  of  competing  risks. 
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Abbreviated 

Outline 


The  outline  below  gives  the  user  a  preview  of  the  material 
covered  by  the  presentation.  A  detailed  outline  for  review 
purposes  follows  the  presentation. 

I.  Overview  (page  430) 

II.  Examples  of  Competing  Risks  Data 

(pages  430-432) 

III.  Byar  Data  (pages  433-434) 

IV.  Method  1:  Separate  Models  for  Different 
Event  Types  (pages  434-437) 

V.  The  Independence  Assumption  (pages  437-443) 

VI.  Cumulative  Incidence  Curves  (CIC) 

(pages  444-453) 

VII.  Conditional  Probability  Curves  (CPC) 

(pages  453-455) 

VIII.  Method  2 — The  Lunn-McNeil  (LM)  Approach 

(pages  455-461) 

IX.  Method  2a:  Alternative  Lunn-McNeil  (LMait) 
Approach  (pages  461-464) 

X.  Method  1  (Separate  Models)  versus 

Method  2  (LM  Approach)  (pages  465-468) 

XI.  Summary  (pages  468-473) 
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Objectives 


Upon  completing  this  chapter,  the  learner  should  be  able  to: 

1 .  State  or  recognize  examples  of  competing  risks 
survival  data. 

2.  Given  competing  risks  data,  outline  the  steps  needed 
to  analyze  such  data  using  separate  Cox  models. 

3.  Given  computer  output  from  the  analysis  of 
competing  risk  data,  carry  out  an  analysis  to  assess 
the  effects  of  explanatory  variables  on  one  or  more  of 
the  competing  risks. 

4.  State  or  describe  the  independence  assumption 
typically  required  in  the  analysis  of  competing 
risks  data. 

5.  Describe  how  to  carry  out  and/or  interpret  a 
“sensitivity  analysis”  to  assess  the  independence 
assumption  about  competing  risks. 

6.  State  why  a  survival  function  obtained  from 
competing  risk  data  using  the  Cox  model  has  a 
questionable  interpretation. 

7.  State  or  describe  the  “cumulative  incidence”  approach 
for  analyzing  competing  risks  data. 

8.  Given  competing  risk  data,  describe  how  to  calculate  a 
CIC  and/or  a  CPC  curve. 

9.  Given  competing  risks  data,  outline  the  steps  needed 
to  analyze  such  data  using  the  Lunn-McNeil  method. 

10.  Given  computer  output  from  fitting  either  a  LM  or 
LMait  model,  carry  out  an  analysis  to  assess  the  effect 
of  explanatory  variables  on  one  or  more  of  the 
competing  risks. 
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Presentation 


I.  Overview 


In  this  chapter,  we  consider  survival  data  in 
which  each  subject  can  experience  only  one 
of  different  types  of  events  over  follow-up. 
The  probabilities  of  these  events  are  typically 
referred  to  as  competing  risks.  We  describe 
how  to  use  the  Cox  PH  model  to  analyze  such 
data,  the  drawbacks  to  this  approach,  and  some 
approaches  for  addressing  these  drawbacks. 


II.  Examples  of  Competing 
Risks  Data 


1 .  Dying  from  either  lung  cancer 
or  stroke 

2.  Advanced  cancer  patients 
either  dying  from  surgery  or 
getting  hospital  infection 

3.  Soldiers  dying  in  accident  or  in 
combat 

4.  Limb  sarcoma  patients 
developing  local  recurrence, 
lung  metastasis,  or  other 
metastasis  over  follow-up 


Each  example  above  allows  only 
one  event  out  of  several  possible 
events  to  occur  per  subject 

If  event  not  death,  then  recurrent 
events  are  possible 

Competing  risks  +  recurrent  events 
beyond  scope  of  this  chapter 


Competing  risks  occur  when  there  are  at  least 
two  possible  ways  that  a  person  can  fail,  but 
only  one  such  failure  type  can  actually  occur. 
For  example, 

1 .  A  person  can  die  from  lung  cancer  or  from  a 
stroke,  but  not  from  both  (although  he  can 
have  both  lung  cancer  and  atherosclerosis 
before  he  dies); 

2.  Patients  with  advanced-stage  cancer  may 
die  after  surgery  before  their  hospital  stay 
is  long  enough  for  them  to  get  a  hospital 
infection; 

3.  Soldiers  in  war  may  die  during  combat  or 
may  die  by  (e.g.,  traffic)  accident; 

4 .  In  a  clinical  trial,  patients  with  nonmetastatic 
limb  sarcoma  undergoing  chemotherapy  and 
surgery  might  develop  a  local  recurrence, 
lung  metastasis,  or  other  metastasis  after 
follow-up. 

For  each  of  the  above  examples,  the  possible 
events  of  interest  differ,  but  only  one  such 
event  can  occur  per  subject.  Note,  however,  if 
at  least  one  of  the  possible  event  types  does  not 
involve  death,  it  is  also  possible  that  such  events 
can  recur  over  follow-up.  Thus,  although  the 
analysis  of  recurrent  events  that  also  involves 
competing  risks  may  be  required,  this  more 
complex  topic  is  beyond  the  scope  of  this  chapter 
(see  Tai  et  al.,  2001). 
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Objective:  assess 

Xu  X2t. . Xp  =>  Failure  rate 

(survival  probability) 

for  any  one  event  allowing  for 
competing  risks  from  other  possi¬ 
ble  events 


A  logical  objective  for  competing  risks  data  is 
to  assess  the  relationship  of  relevant  predictors 
to  the  failure  rate  or  corresponding  survival 
probability  of  any  one  of  the  possible  events 
allowing  for  the  competing  risks  of  the  other 
ways  to  fail. 


Another  objective  We  might  also  want  to  compare  the  failure 

Compare  hazard  rate  for  event  A  rates  (e.g.,  using  a  hazard  ratio)  for  two  or 
with  hazard  rate  for  event  B  more  possible  events,  controlling  for  relevant 

predictors. 


Lung  Cancer  vs.  Stroke  ( 1 ) 

HRlc(E  vs.  not  E)  =  1? 

(allowing  for  competing  risk  from 
stroke) 


In  the  lung  cancer  versus  stroke  example 
above,  we  might  ask  whether  the  lung  cancer 
death  rate  in  “exposed"  persons  is  different 
from  the  lung  cancer  rate  in  “unexposed”  per¬ 
sons,  allowing  for  the  possibility  that  subjects 
could  have  died  from  a  stroke  instead. 


HR(LC  vs.  Stroke)  =  1? 
(controlling  for  predictors) 


We  might  also  want  to  know  if  the  lung  cancer 
death  rate  differs  from  the  stroke  death  rate 
controlling  for  predictors  of  interest. 


Surgery  Death  vs.  Hospital 
Infection  (2) 


HRHospinf(E  vs.  not  E)  =  1  ?  (allow¬ 
ing  for  competing  risk  from  surgery) 

Note:  death  from  surgery  reduces 
number  of  hospital  infections  to  be 
treated 


In  the  second  example,  the  competing  risks  are 
death  from  surgeiy  versus  development  of  a 
hospital  infection.  For  infection  control  inves¬ 
tigators,  the  hospital  infection  event  is  of  pri¬ 
mary  interest.  Nevertheless,  the  occurrence  of 
death  from  surgery  reduces  the  burden  of  hos¬ 
pital  infection  control  required.  Thus,  the  esti¬ 
mation  of  hospital  infection  rates  are 
complicated  by  the  competing  risk  of  death 
from  surgery. 


Accidental  Death  vs.  Combat 
Death  (3) 

HRcombat  (E  vs.  not  E)  (allowing 
competing  risk  of  accidental  death) 

Suppose  entire  company  dies  at 
accident  time  t  before  entering  combat 

ScombatW  =  P(Tcombat  >  t)  =  1 
where  TCOmbat  =  time  to  combat  death 


The  third  example  involves  competing  risks  of 
death  from  either  combat  or  accident  in  a  com¬ 
pany  of  soldiers.  Here,  primary  interest  con¬ 
cerns  the  hazard  ratio  for  combat  death 
comparing  two  exposure  groups.  Suppose  the 
entire  company  dies  at  time  t  in  a  helicopter 
accident  on  their  way  to  a  combat  area. 
Because  no  one  died  in  combat  by  time  t,  the 
survival  probability  of  not  dying  in  combat  is 
one,  even  though  no  combat  took  place. 
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However, 

Tc+a  =  combat  or  accidental  death 

a- 

“event  free”  Sc+A(t)  =  P(TC+A  >  t)  =  0 
Moreover, 

Skm(Tcombat  >  t) 

is  undefined  because  the  risk  set  is 
empty  at  time  t 

Competing  Risks  Data  Survival 
Curve  Interpretation? 


Limb  sarcoma  patients  (4) 
Competing  risks 

1  =  local  recurrence,  2  =  lung  meta¬ 
stasis,  or  3  =  other  metastasis 

HRC(E  vs.  not  E),  c  =  1,  2,  3  (allow¬ 
ing  for  competing  risk  from  other 
two  failure  types) 

HR(Lung  Metastasis  vs.  Local 
Recurrence)?  Controlling  for 
Predictors 


No  failure  types  involve  death 

Recurrent  events  possible 

But  can  use  classical  competing 
risk  methods  if  focus  on  only  first 
failure 


However,  if  we  define  the  outcome  of  interest 
as  death  from  either  combat  or  accident,  the 
"event  free”  survival  probability  is  zero  after 
the  accident  occurred  (at  time  t). 

Moreover,  the  KM  survival  probability  for 
combat  death  at  time  t  is  undefined  because 
no  one  was  at  risk  for  a  combat  death  at  time  t. 


This  example  points  out  that  when  there  are 
competing  risks,  the  interpretation  of  a  sur¬ 
vival  curve  may  be  difficult  or  questionable 
(more  on  this  issue  later). 

In  the  fourth  example  involving  limb  sarcoma 
patients,  the  competing  risks  are  the  three  fail¬ 
ure  types  shown  at  the  left. 

In  this  study,  the  investigators  wanted  hazard 
ratios  for  each  failure  type,  allowing  for  com¬ 
peting  risks  from  the  other  two  failure  types. 


It  was  also  of  interest  to  compare  the  failure 
rates  for  lung  metastasis  versus  local  recur¬ 
rence  (or  any  other  two  of  the  three  failure 
types),  controlling  for  relevant  predictors. 

Because  none  of  the  failure  types  involves 
death,  recurrent  events  are  possible  for  any  of 
the  three  failure  types.  If,  however,  the  infor¬ 
mation  on  only  the  first  failure  is  targeted,  the 
classical  competing  risk  methodology  described 
in  this  chapter  can  be  applied. 
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III.  Byar  Data 


•  Randomized  clinical  trial 

•  Compare  treatments  for  Stage 
III  and  IV  prostate  cancer 

•  Rx  status:  placebo  or  one  of 
3  dose  levels  of  DES 


We  now  introduce  an  example  of  competing 
risks  survival  analysis  of  data  from  a  rando¬ 
mized  clinical  trial  (Byar  and  Green,  1980) 
comparing  treatments  for  prostate  cancer.  We 
henceforth  refer  to  this  as  the  Byar  data. 
Patients  with  Stages  III  (local  extension 
beyond  the  prostate  gland)  and  IV  (distant 
metastases,  elevated  acid  phosphatase,  or 
both)  prostate  cancer  were  randomized  to 
receive  either  a  placebo  or  one  of  three 
dose  levels  of  the  active  treatment  diethylstil- 
bestrol  (DES). 


Competing  risks:  deaths  from 

Cancer  (main  focus) 

CVD 

Other 

Covariate  information  collected 
Some  predictors  grouped 


In  this  study,  patients  could  die  from  prostate 
cancer,  cardiovascular  disease,  or  other 
causes.  Covariate  information  was  also  col¬ 
lected  to  account  for  the  possible  influence  of 
predictors  on  survival.  These  data  have  been 
analyzed  extensively  (Byar  and  Corle,  1977, 
Kay,  1986,  and  Lunn  and  McNeil,  1995). 
Some  grouping  of  the  predictors  was  consid¬ 
ered  to  be  clinically  meaningful. 


Predictors 

Value 

Category 

Treatment  (Rx) 

0 

Placebo,  0.2  mg 
DES 

1 

1.0,5  mg  DES 

Age  at  diagnosis 

0 

<74  years 

Diagnosis  (Age) 

1 

75-79  years 

2 

>80  years 

Standardized3 

0 

>100 

weight  (Wt) 

1 

80-99 

2 

>80 

Performance 

0 

Normal 

status  (PF) 

1 

Limitation  of 
activity 

History  of 

0 

No 

CVD  (Hx) 

1 

Yes 

Hemoglobin  (Hg) 

0 

>  12  g/100  ml 

1 

9.0-1 1.9  g/100  ml 

2 

<9  g/100  ml 

Size  of  the 

0 

<30  cm2 

primary 
lesion  (SZ) 

1 

>30  cm2 

Gleeson 

0 

<10 

Score+(SG) 

1 

>10 

3  weight  (kg)  -  height  (cm)  +  200 
+  index  of  tumor  invasiveness/ 
aggressiveness 


Key  risk  factors  related  to  the  primary  outcome 
of  interest  (cancer  deaths)  and  the  appropriate 
grouping  is  shown  at  the  left. 


Primary  interest  was  to  assess  the  effect  of 
treatment  (Rx)  adjusted  for  relevant  risk  fac¬ 
tors  in  the  presence  of  the  competing  risks. 
Notice  from  the  table  that  the  Rx  variable  is 
grouped  into  a  binary  variable  by  coding  sub¬ 
jects  receiving  the  placebo  or  0.2  mg  of  DES  as 
0  and  coding  subjects  receiving  1.0  or  5.0  mg  of 
DES  as  1. 
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Independence  assumption  (dis¬ 
cussed  later) 

Next 

Analysis  of  competing  risks 
survival  data 

Assume  independent  competing 
risks 


From  a  clinical  perspective,  these  three 
competing  risks  can  be  considered  to  be  inde¬ 
pendent  (e.g.,  failure  from  heart  disease  and / 
or  other  causes  of  death  is  unrelated  to  risk  of 
failure  from  prostate  cancer).  We  discuss  this 
"independence  assumption”  in  more  detail  in  a 
subsequent  section  of  this  chapter. 

We  now  describe  the  approach  typically  used 
to  analyze  competing  risks  data.  This  approach 
assumes  that  competing  risks  are  indepen¬ 
dent.  We  illustrate  this  approach  using  the 
Byar  data. 


IV.  Method  1 :  Separate 
Models  for  Different 
Event  Types 

•  Use  Cox  (PH)  model 

•  Estimate  separate  hazards  or 
HRs  for  each  failure  type 

•  Other  (competing)  failure  types 
are  treated  as  censored 

•  Persons  lost  to  follow-up  or 
withdrawal  are  also  censored 


The  typical  approach  for  analyzing  competing 
risks  data  uses  the  Cox  (PH)  model  to  sepa¬ 
rately  estimate  hazards  and  corresponding 
hazard  ratios  for  each  failure  type,  treating 
the  other  (competing)  failure  types  as  censored 
in  addition  to  those  who  are  censored  from  loss 
to  follow-up  or  withdrawal.  We  refer  to  this 
approach  as  Method  1  because  we  later 
describe  an  alternative  approach  (Method  2) 
that  requires  only  a  single  model  to  be  fit. 


If  only  one  failure  type  of  interest 


a- 

Estimate  only  one  hazard  or  HR 


If  only  one  failure  type  is  of  primary  interest, 
then  the  analysis  might  be  restricted  to  esti¬ 
mating  hazards  or  hazard  ratios  for  that  type 
only  (but  still  treating  the  competing  failure 
types  as  censored). 


Cause-specific  hazard  function 

hc(t)  =  lim  P(t  <  Tc  <  t  +  AtjTc  >  t) /At 

where  Tc  =  time-to-failure  from 
event  c 

c  =  1,  2,  . . .,  C  (#  of  event  types) 


To  describe  this  method  mathematically,  we 
define  the  cause-specific  hazard  function 

shown  at  the  left.  The  random  variable  Tc 
denotes  the  time-to-failure  from  event  type  c. 
Thus,  hc(t)  gives  the  instantaneous  failure  rate 
at  time  t  for  event  type  c,  given  not  failing  from 
event  c  by  time  t. 


Cox  PH  cause-specific  model 

(event-type  c): 


hc(t.X) 


h0c(t)  exp 


E  PieXi 


i=l 


c  =  1 . C 


Using  a  Cox  PH  model  that  considers  predic¬ 
tors  X  =  (Xi,  X2,. . .,  Xp),  the  cause-specific 
hazard  model  for  event  type  c  has  the  form 
shown  at  the  left.  Note  that  pic,  the  regression 
coefficient  for  the  ith  predictor,  is  subscripted 
by  c  to  indicate  that  the  effects  of  the  predictors 
may  be  different  for  different  event  types. 


pic  allows  effect  of  X;  to  differ  by 
event-type 
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Byar  Data  Example 


Competing  Risks:  Cancer,  CVD,  Other 

Cause-specific  model:  Cancer 
No-interaction  model: 

head,  X)  =  hoca(t)exp[p1CaRx  +  p2c»Age 
+  P3CaWt  +  IhcaPF  +  PsCaHx 
+  P6CaHG  +  P7CaSZ  +  PgCaSG] 

HRCa(RX  =  1  vs.  RX  =  0)  =  exp[p1Ca] 
CVD  and  Other  deaths  are  censored 


Cause-specific  model:  CVD 

hcvi>(  I  ,X)  —  h0cvD(t)exp[PicvDltx  +  fhcvoAge 
-4-  P:-;cvnWt  -i-  P4cvdPF  +  PscvdHx 

+  P6cvdHG  +  PtcvdS/  +  (l8CVI»SG  | 

HRcVD(RX  =  1  vs.  RX  =  0)  =  exp[plcVD] 
Cancer  and  Other  are  censored 

Cause-specific  model:  Other 

HoTh(L.X)  =  hooTH(t)exp[PioTHRx  +  iGoTH'Vge 
+  P30THWt  +  P40THPF  +  PsOThHx 

+  P60THttG  +  p70TI[S7.  +  pHOTllSCi  | 

Cancer  and  CVD  are  censored 


We  illustrate  the  above  model  using  the  Byar 
data  involving  the  three  competing  risks  and 
the  eight  predictors. 

A  no-interaction  cause-specific  model  for 
Cancer  death  (Ca)  is  shown  at  the  left.  From 
this  model,  the  hazard  ratio  for  the  effect  of  Rx 
controlling  for  the  other  variables  is  exp[p1Ca]. 

Because  Cancer  is  the  event-type  of  interest, 
the  two  competing  event-types,  CVD  and  Other, 
need  to  be  treated  as  censored  in  addition  to 
usual  censored  observations  (i.e.,  for  persons 
who  are  either  lost  to  follow-up  or  withdraw 
from  the  study). 

Similarly,  if  CVD  is  the  event-type  of  interest, 
the  cause-specific  no-interaction  hazard  model 
and  the  hazard  ratio  formula  for  the  effect  of 
treatment  is  shown  at  the  left,  and  the  event 
types  Cancer  and  Other  would  be  treated  as 
censored. 

And  finally,  if  Other  is  the  event- type  of  interest, 
the  cause-specific  no-interaction  hazard  model 
and  the  hazard  ratio  formula  for  the  effect  of 
treatment  is  shown  at  the  left,  and  the  event 
types  Cancer  and  CVD  would  be  treated  as 
censored. 


Table  9.1.  Edited  Output  for  Cancer 
with  CVD  and  Other  Censored 


Var 

DF 

Coef 

Std. 

Err. 

P  >  lZl 

Haz. 

Ratio 

Rx 

1 

-0.550 

0.170 

0.001 

0.577 

Age 

1 

0.005 

0.142 

0.970 

1.005 

Wt 

1 

0.187 

0.138 

0.173 

1.206 

PF 

1 

0.253 

0.262 

0.334 

1.288 

Hx 

1 

-0.094 

0.179 

0.599 

0.910 

HG 

1 

0.467 

0.177 

0.008 

1.596 

SZ 

1 

1.154 

0.203 

0.000 

3.170 

SG 

1 

1.343 

0.202 

0.000 

3.830 

Log  likelihood  =  —  771.174 


Edited  output  for  each  of  the  above  three 
cause-specific  models  is  now  presented. 


First,  we  show  the  results  for  the  event  type 
Cancer,  treating  CVD  and  Other  as  censored 
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HRCa(RX  =  1  vs.  RX  =  0) 

=  exp(— 0.550)  =  0.577 

Wald  ChiSq  =  (-.550/.170)2 
=  10.345  (P  =  0.001) 

Signif.  below  .01  level 


From  this  output,  the  adjusted  HR  for  the  effect 
of  Rxis  0.577  (=1/1.73). 

The  P-value  for  a  two-tailed  Wald  test  is  0.001; 
thus  Rx  has  a  significant  positive  effect  on  sur¬ 
vival  for  Cancer  death  with  competing  risks 
from  CVD  and  Other  deaths. 


95%  Cl  for  exp[p1Ca]:  Also,  the  95%  confidence  interval  for  this  HR  is 

exp[— 0.550  ±  1.96(0.170)]  (0-413,  0.807)  =  (1/2.43,  1/1.24). 

=  (0.413,  0.807) 


Table  9.2.  Edited  Output  for  CVD  with 
Cancer  and  Other  Censored 


Var 

DF 

Coef 

Std. 

Err. 

P  >  lZl 

Haz. 

Ratio 

Rx 

1 

0.354 

0.174 

0.042 

1.425 

Age 

1 

0.337 

0.134 

0.012 

1.401 

Wt 

1 

0.041 

0.150 

0.783 

1.042 

PF 

1 

0.475 

0.270 

0.079 

1.608 

Hx 

1 

1.141 

0.187 

0.000 

3.131 

HG 

1 

0.018 

0.202 

0.929 

1.018 

SZ 

1 

-0.222 

0.364 

0.542 

0.801 

SG 

1 

-0.023 

0.186 

0.900 

0.977 

Log  likelihood  = 

-763.001 

////cvd(PX  =  1 

vs.  RX  = 

0) 

=  exp(0.354)  =  1.425 

Wald  ChiSq  = 

(.354/.174)2 

=  4.220  (P  = 

=  0.042) 

Signif.  at  .05  level 

95%  Cl  for  expfpicvi)]: 

exp. [0.354  ±  1.96(0.174)] 

=  (1.013,  2.004) 

Table  9.3.  Edited  Output  for  Other  with 

Cancer  and  CVD  Censored 

Std. 

Haz. 

Var  DF  Coef 

Err. 

P  >  lZ| 

Ratio 

Rx  1  -0.578 

0.279 

0.038 

0.561 

Age  1  0.770 

0.204 

0.000 

2.159 

Wt  1  0.532 

0.227 

0.019 

1.702 

PF  1  0.541 

0.422 

0.200 

1.718 

Hx  1  0.023 

0.285 

0.935 

1.023 

HG  1  0.357 

0.296 

0.228 

1.428 

SZ  1  0.715 

0.423 

0.091 

2.045 

SG  1  -0.454 

0.298 

0.127 

0.635 

Log  likelihood  =  —297.741 


We  next  provide  edited  output  when  the  event- 
type  is  CVD,  treating  Cancer  and  Other  as 
censored. 


Here,  the  adjusted  HR  for  the  effect  of  Rx  is  1 .425. 

The  P-value  for  a  two-tailed  Wald  test  is  0.042; 
thus,  Rx  has  a  significant  (P  <  .05)  but  negative 
effect  on  survival  for  CVD  death  with  compet¬ 
ing  risks  from  Cancer  and  Other  deaths. 

The  95%  confidence  interval  for  this  HR  is 
(1.013,  2.004). 


Last,  we  provide  edited  output  when  the  event- 
type  is  Other,  treating  Cancer  and  CVD  as 
censored. 
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HROTh(RX  =  1  vs.  RX  =  0) 
=  exp(— 0.580)=  0.560 


Here,  the  adjusted  HR  for  the  effect  of  Rx  is 
0.561  (=  1/  1.78). 


Wald  ChiSq  =  (-.578/.279)2 
=4.29  (P=0.038) 

Signif.  at  .05  level 


The  P-value  for  a  two-tailed  Wald  test  is  .038; 
thus,  Rx  has  a  significant  (P  <  .05)  protective 
effect  on  survival  for  Other  deaths  with  com¬ 
peting  risks  from  Cancer  and  CVD  deaths. 


95%  Cl  for  explpioTn):  The  95%  confidence  interval  for  this  HR  is 

exp[— 0.578-1 — 1  .96(0.279)]  (0.325,0.969),  which  is  somewhat  imprecise. 

=(0.325,  0.969) 


Not  assessed  in  the  above  analysis: 
PH  assumption 

Interaction  of  Rx  with  control  vari¬ 
ables 


We  have  thus  completed  a  competing  risk 
analysis  af  the  Byar  data  assuming  that  a  no¬ 
interaction  Cox  PH  model  is  appropriate.  We 
haven’t  actually  checked  the  PH  assumption 
for  any  of  the  variables  in  the  model  nor  have 
we  assessed  whether  there  is  significant  inter¬ 
action  between  Rx  and  the  other  variables 
being  controlled.  Typically,  these  situations 
should  be  explored  to  ensure  a  more  appropri¬ 
ate  analysis. 


V.  The  Independence 
Assumption 

Censoring:  a  major  concern  in 
survival  analysis 

Right-censoring  vs.  left-censoring 

I 

•  more  often 

•  our  focus 


At  the  beginning  of  this  text  in  Chapter  1,  we 
introduced  the  concept  of  censoring  as  a  major 
concern  for  the  analysis  of  survival  data.  We 
distinguished  between  right-  and  left-censoring 
and  indicated  our  focus  in  the  text  would  be  on 
right-censoring,  which  occurs  more  often. 


Typical  assumption: 

Censoring  is  Independent 

•  Required  for  all  approaches/ 
models  described  to  this  point 

•  Relevant  for  competing  risks 


We  also  introduced  in  Chapter  1  an  important 
assumption  about  censoring  that  is  typically 
assumed  for  all  approaches/models  for  analyz¬ 
ing  survival  data  described  up  to  this  point, 
including  data  with  competing  risks.  This 
assumption  is  often  stated  as  follows:  censor¬ 
ing  is  independent. 


Two  other  (different)  assumptions 
Random  censoring 
Non-informative  censoring 


In  addition,  we  distinguished  (in  Chapter  1) 
“independent”  censoring  from  two  other 
assumptions:  “random”  censoring  and  "non- 
informative”  censoring,  but  emphasized  the 
importance  of  the  independence  assumption. 
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Independent  censoring: 

Chapter  1  context:  no  competing  risks 

h(t  |  G,  Ce)  =  h(t  |  G,  NCe)  where 

G  denotes  any  subgroup  of  subjects 
at  risk  at  time  t 

h(t  |  G,  Ce)  denotes  hazard  for  cen¬ 
sored  subjects  in  subgroup  G 
h(t  |  G,  NCe)  denotes  hazard  for  non- 
censored  subjects  in  subgroup  G 

Bias  possible: 

S(t)  may  over-estimate  S(t) 
if 

large  proportion  of  subjects  censored 
at  time  t  actually  fail  after  time  t 

Independent  censoring  with 
competing  risks 

11 

Different  types  of  censoring: 

•  failure  from  competing  risks 

•  lost-to-follow-up 

•  withdrawal 

•  end  of  study 


EXAMPLE  (Byar  data) 


3  competing  risks: 

Cancer,  CVD,  or  Other  deaths 
Independent  censoring? 

Suppose  censoring  is  independent 
and  Harry  censored  at  time  t 

J) 

hca(t  |  G,  Ce)  =  hCa(t  |  G,  NCe) 


Suppose  censoring  is  not 
independent  and  Harry  died  from 
CVD  or  Other  Cause  at  time  t 

J) 

hCa(t  |  G,  Ce)  ^  hCa(t  |  G,  NCe) 


In  Chapter  1 ,  we  defined  independent  censoring 
in  a  context  that  assumed  the  absence  of 
competing  risks  as  follows: 

Within  any  subgroup  of  interest,  subjects  who  are 
censored  at  time  t  should  be  representative  of  all 
subjects  in  that  subgroup  who  remained  at  risk  at 
time  t  with  respect  to  their  survival  experience. 


Non-independent  censoring  unfortunately  can 
lead  to  biased  results  in  a  survival  analysis. 
A  bias  can  result  if  people  who  get  censored  are 
more  likely  to  fail  than  those  not  censored.  Thus, 
the  estimated  survival  probability  at  any  time 
t  may  over-estimate  the  true  survival  probability 
at  time  t  if  a  large  proportion  of  those  with 
unknown  status  (i.e.,  censored)  actually  failed. 

When  the  survival  analysis  problem  involves 
competing  risks,  the  requirement  of  indepen¬ 
dent  censoring  has  the  additional  complication 
that  there  are  different  types  of  censoring  that  are 
possible.  That  is,  when  focusing  on  the  cause- 
specific  hazard  for  event-type  A,  say,  competing 
risks  other  than  A  are  also  considered  as  censored 
in  addition  to  standard  censorship  from  lost- 
to-follow-up,  withdrawal  or  ending  of  the  study. 

For  example,  in  the  Byar  data  set,  there  were 
three  competing  risks  of  interest,  Cancer,  CVD, 
or  Other  deaths.  What,  then,  must  we  assume  if 
censoring  in  this  study  were  independent? 

Suppose  censoring  is  independent  and  we 

focus  on  cause-specific  deaths  for  Cancer, 
then  any  subject  (e.g.,  Harry)  in  the  risk  set  at 
time  t  with  a  given  set  (G)  of  covariates  who  is 
censored  at  time  t  is  presumed  to  have  the 
same  failure  rate  as  any  noncensored  subject 
in  the  risk  set  with  the  same  set  of  covariates 
regardless  of  whether  the  reason  for  censoring 
is  either  a  CVD  or  Other  death,  withdrawal 
from  study,  or  lost-to-follow-up. 

On  the  other  hand,  suppose  censoring  were 
not  independent,  then  if  Harry  was  censored 
at  time  t  because  he  died  from  CVD  or  Other 
causes  at  time  t,  Harry’s  (unknown)  failure  rate 
at  time  t  for  dying  of  Cancer  would  differ  from 
the  Cancer  failure  rate  for  noncensored  sub¬ 
jects  at  time  t  (who  didn’t  previously  die  of 
Cancer,  CVD,  or  Other  causes  prior  to  time  t). 
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Important  assumption  for 
competing  risks 

Censoring  is  independent 

regardless  of  different  types  of  cen¬ 
soring  possible 

Synonym:  Competing  risks  are 
independent 


Questions  about  independence 
assumption 

1 .  How  can  we  determine  whether 
this  assumption  is  satisfied? 

2.  How  can  we  proceed  with  the 
analysis  to  consider  the 
possibility  that  the  assumption 
is  not  satisfied? 

Answer  to  1 : 

We  can  never  explicitly  prove  the 
assumption  is  satisfied  for  given 
data. 

For  example,  Byar  data:  Cancer 
death 

Then  can’t  determine  would  have 
died  from  Cancer  if  hadn’t  died 
from  CVD. 

CVD  death 

11 

Cancer  death  unobservable 
In  general 

Failure  from  competing  risk  A 

11- 

Failure  from  competing  risk  B 
unobservable 


Answer  to  2: 

Alternative  strategies  available 
but  no  strategy  is  always  best 


The  important  message  at  this  point  when 
analyzing  competing  risks  survival  data  is  that 
it  is  typically  assumed  that  censoring  is  inde¬ 
pendent  regardless  of  the  different  ways  that 
censoring  can  occur,  including  failure  from 
competing  risks  other  than  the  cause-specific 
event-type  of  interest.  A  synonymous  expres¬ 
sion  is  to  say  that  competing  risks  are  inde¬ 
pendent,  which  we  henceforth  adopt  in  our 
remaining  discussion  of  this  topic. 

So,  if  we  typically  require  that  competing  risks 
are  independent,  (1)  how  can  we  determine 
whether  this  assumption  is  satisfied  and  (2)  how 
can  we  proceed  with  the  analysis  to  consider  the 
possibility  that  the  assumption  is  not  satisfied? 


Unfortunately,  the  answer  to  the  first  question  is 
that  we  can  never  explicitly  prove  that  compet¬ 
ing  risks  are  or  are  not  independent  for  a  given 
dataset.  For  example,  in  the  Byar  dataset,  we 
cannot  determine  for  certain  whether  a  subject 
who  died  from,  say,  CVD  at  time  t  would  have 
died  from  Cancer  if  he  hadn’t  died  from  CVD. 


In  other  words,  dying  from  Cancer  at  time  t  is  an 
unobservable  outcome  for  a  subject  who  died 
from  CVD  at  or  before  time  t.  More  generally, 
failure  from  a  competing  risk  at  time  t  is  unob¬ 
servable  for  a  subject  who  has  already  failed 
from  a  different  competing  risk  up  to  time  t. 

Because  we  can  never  fully  determine  whether 
competing  risks  are  independent,  how  can  we 
proceed  with  the  analysis  of  competing  risks 
survival  data?  The  answer  is  that  there  are  sev¬ 
eral  alternative  strategies,  but  no  one  strategy 
that  is  always  best. 
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Strategy  1  One  strategy  is  to  decide  on  clinical/biological/ 

Decide  assumption  satisfied  on  other  grounds  without  any  data  analysis  that  the 
clinical/biological/other  grounds  independence  assumption  is  satisfied  and  then 

carry  out  the  analysis  assuming  independence. 

For  example,  suppose  the  two  competing  risks 
are  Cancer  deaths  and  CVD  deaths.  Then  you 
may  decide  that  the  assumption  of  indepen¬ 
dent  competing  risks  is  reasonable  if  at  any 
time  t,  subjects  who  were  censored  because  of 
CVD  death  were  no  more  or  less  likely  to  have 
died  from  Cancer. 


EXAMPLE  OF  STRATEGY 
1  —CANCER  VS.  CVD 


Decide  independence  if  subjects  who 
were  censored  because  of  CVD  death 
were  no  more  or  less  likely  to  have 
died  from  Cancer. 


Strategy  2 

Include  common  risk  factors  for 
competing  risks  in  survival  model 


EXAMPLE  OF  STRATEGY 
2— CANCER  VS.  CVD 


Include  age  smoking  in  model  to 
remove  the  common  effects  of  these 
variables  on  competing  risks. 


A  second  strategy  is  to  measure  those  variables 
that  are  common  risk  factors  for  competing 
risks  being  considered  and  then  include  those 
variables  in  the  survival  model.  For  example, 
with  Cancer  and  CVD,  perhaps  including  age 
and  smoking  status  in  the  survival  model  might 
remove  common  effects  on  competing  risks. 


Criticism  of  Strategies  1  and  2 

Assumptions  cannot  be  verified 
with  observed  data 


A  criticism  of  each  of  the  above  strategies  is 
that  they  both  rely  on  assumptions  that  cannot 
be  verified  with  the  observed  data. 


Strategy  3 

Use  a  sensitivity  analysis 

•  Considers  “worst-case” 

violations  of  the  independence 
assumption 


Another  strategy  (3)  that  can  be  used  is  a  sensi¬ 
tivity  analysis.  As  with  Strategies  1  and  2,  a  sen¬ 
sitivity  analysis  cannot  explicitly  demonstrate 
whether  the  independence  assumption  is  satis¬ 
fied.  However,  this  strategy  allows  the  estima¬ 
tion  of  parameters  by  considering  “worst-case” 
violations  of  the  independence  assumption. 


Sensitivity  analysis 

•  Determines  extreme  ranges  for 
estimated  parameters  of  one’s 
model 


Thus,  using  a  sensitivity  analysis,  the  investigator 
can  determine  extreme  ranges  for  the  estimated 
parameters  in  one’s  model  under  violation  of  the 
independence  assumption. 


If  “worst-case”  not  meaningfully 
different  from  independence 
then 

at  most  a  small  bias  when 
assuming  independence 


If  such  "worst-case”  results  do  not  meaningfully 
differ  from  results  obtained  under  the  indepen¬ 
dence  assumption,  then  the  investigator  may 
conclude  that  at  most  a  small  bias  can  result 
from  an  analysis  that  assumes  independence. 
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If  “worst-case”  meaningfully 
different  from  independence 
then 

only  extreme  of  bias  but  not 
actual  bias  is  determined 


If,  on  the  other  hand,  the  sensitivity  analysis 
provides  results  that  meaningfully  differ  from 
results  obtained  under  the  independence 
assumption,  the  investigator  learns  only  the 
extremes  to  which  the  results  could  be  biased 
without  adjusting  for  the  actual  bias. 


EXAMPLE  BYAR  DATA 


Cause-specific  focus:  Cancer 

Censored:  CVD  deaths.  Other  deaths, 
usual  censoring 

Worst-case  situations: 

1.  CVD  or  Other  deaths  are  assumed  to 
die  of  cancer  instead 

2.  CVD  or  Other  deaths  assumed  to 
survive  as  long  as  the  largest  survival 
time  observed  in  the  study 


We  now  illustrate  how  a  sensitivity  analysis  can 
be  carried  out  using  the  Byar  data,  where  we 
focus  on  the  cause-specific  survival  for  Cancer 
deaths,  treating  CVD  and  Other  deaths  as  cen¬ 
sored  in  addition  to  usual  censoring. 

The  following  two  worst-case  situations  are 
considered.  (1)  All  subjects  that  are  censored 
because  of  CVD  or  Other  deaths  are  assumed 
to  die  of  cancer  instead.  (2)  All  subjects  that  are 
censored  because  of  CVD  or  Other  deaths  sur¬ 
vive  as  long  as  the  largest  survival  time 
observed  in  the  study. 


Table  9.4.  Edited  Output  for  Cancer 
Worst-Case  (1) 


var 

DF 

Coef 

Std.Err. 

p  >  M 

Haz. 

Ratio 

Rx 

1 

-0.185 

0.110 

0.092 

0.831 

Age 

1 

0.286 

0.087 

0.001 

1.332 

Wt 

1 

0.198 

0.093 

0.032 

1.219 

PF 

1 

0.402 

0.170 

0.018 

1.495 

Hx 

1 

0.437 

0.112 

0.000 

1.548 

HG 

1 

0.292 

0.120 

0.015 

1.339 

SZ 

1 

0.672 

0.159 

0.000 

1.958 

SG 

1 

0.399 

0.115 

0.001 

1.491 

Log  likelihood  =  —1892.091 


Table  9.4  and  Table  9.5  give  edited  output  for 
the  above  two  scenarios  followed  by  a  repeat  of 
the  output  previously  shown  (Table  9.1)  under 
the  independence  assumption. 

To  carry  out  worst-case  scenario  (1),  the  Status 
variable  (indicating  whether  a  subject  failed 
or  was  censored)  was  changed  in  the  dataset 
from  0  to  1  for  each  subject  that  had  a  CVD  or 

Other  death. 


Table  9.5.  Edited  Output  for  Cancer 
Worst-Case  (2) 


Var 

DF 

Coef 

Std.Err. 

P  >  M 

Haz. 

Ratio 

Rx 

1 

-0.411 

0.169 

0.015 

0.663 

Age 

1 

-0.118 

0.139 

0.394 

0.888 

Wt 

1 

0.086 

0.138 

0.532 

1.090 

PF 

1 

0.125 

0.254 

0.622 

1.133 

Hx 

1 

-0.266 

0.179 

0.138 

0.767 

HG 

1 

0.314 

0.169 

0.063 

1.369 

SZ 

1 

0.825 

0.197 

0.000 

2.282 

SG 

1 

1.293 

0.201 

0.000 

3.644 

Log  likelihood  =  —839.631 


For  worst-case  scenario  (2),  the  longest  survival 
time  observed  in  the  study  was  76  weeks.  Thus, 
the  survival  time  for  each  subject  that  had  a 
CVD  or  Other  death  was  changed  in  the  dataset 
from  the  actual  time  of  death  to  76  weeks. 

To  evaluate  the  results  of  the  sensitivity  analy¬ 
sis,  we  need  to  compare  the  output  in  Table  9.1, 
which  assumes  that  competing  risks  are  inde¬ 
pendent,  with  output  for  worst-case  situations 
provided  in  Table  9.4  and  Table  9.5.  We  focus 
this  comparison  on  the  estimated  coefficient  of 
the  Rx  variable. 
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Table  9.1  (Repeated).  Edited  Output  for 
Cancer  with  CVD  and  Other  censored 
(Assumes  Competing  Risks  Independent) 


Var 

DF 

Coef  Std.Err.  P>|z| 

Haz. 

Ratio 

Rx 

1 

-0.550  0.170 

0.001 

0.577 

Age 

1 

0.005  0.142 

0.970 

1.005 

Wt 

1 

0.187  0.138 

0.173 

1.206 

PF 

1 

0.253  0.262 

0.334 

1.288 

Hx 

1 

-0.094  0.179 

0.599 

0.910 

HG 

1 

0.467  0.177 

0.008 

1.596 

SZ 

1 

1.154  0.203 

0.000 

3.170 

SG 

1 

1.343  0.202 

0.000 

3.830 

Log  likelihood  =  —771.174 

Var 

DF 

Coef  Std.  Err. 

p  >  Izl  Haz.  Ratio 

Worst-Case  (1): 

Rx 

1 

-0.185  0.110 

0.092  0.831 

Worst-Case  (2): 

Rx 

1 

-0.411  0.169 

0.015  0.663 

Independent  competing  risks: 

Rx 

1 

-0.550  0.171 

0.001  0.577 

WC(1) 

WC(2) 

Independent 

HRs 

0.831 

0.663 

0.577 

P-values 

0.092 

0.015 

0.001 

(N.S.) 

(<•05) 

(<  <-01) 

Independence 

Nonindependence 

X 

_[ 

] 

.577  .663  .831 


The  first  line  of  output  corresponding  to  the  Rx 
variable  is  shown  at  the  left  for  both  worst-case 
scenarios  together  with  the  output  obtained 
from  assuming  independent  competing  risks. 


These  results  for  the  RX  variable  show  consid¬ 
erable  differences  among  all  three  scenarios. 
In  particular,  the  three  estimated  hazard  ratios 
are  0.831  (=  1/1.20),  0.663  (=1/1.51),  and  .577 
(=1/1.73).  Also,  the  P-values  for  the  signifi¬ 
cance  of  the  effect  of  Rx  (0.092,  0.015,  .001) 
lead  to  different  conclusions  about  the  effect 
of  Rx. 

Note  that  the  HR  obtained  from  assuming 
independence  does  not  lie  between  the  HRs 
from  the  two  worst-case  scenarios.  This  should 
not  be  surprising  because  both  worst-case 
scenarios  assume  non-independence. 


If 

competing  risks  not  independent 
then 

conclusions  about  the  effect  of  Rx 
could  be  very  different 


These  results  suggest  that  if  the  competing 
risks  were  not  independent,  then  the  conclu¬ 
sions  about  the  effect  of  Rx  could  be  somewhat 
different. 
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But, 

•  Have  not  demonstrated 
whether  independence 
assumption  satisfied 

•  Have  not  obtained  correct 
results  under  violation  of 
independence  assumption 


However,  these  results  do  not  demonstrate 
whether  the  independence  assumption  is  satis¬ 
fied,  nor  do  they  provide  estimates  of  the 
unbiased  hazard  ratios  and  corresponding 
Wald  tests  under  violation  of  the  independent 
assumption. 


Worst-case  (1) 

More  departure  from  indepen¬ 
dence 

More  realistic 
More  emphasis 
than 

Worst-case  (2) 


Worst-case  (1)  gives  more  departure  from  inde¬ 
pendence  than  worst-case  (2).  It  can  also  be 
argued  that  worst-case  (1)  is  more  realistic 
and  thus  should  be  emphasized  more  than 
worst-case  (2),  because  subjects  who  were  cen¬ 
sored  because  of  CVD  or  Other  deaths  would 
not  be  expected  to  survive  the  entire  study  if 
they  hadn’t  died. 


Sensitivity  analysis:  approaches  can 
vary  for  example, 

•  Randomly  select  subset  of  50% 
(or  25%)  of  subjects  censored 
with  CVD  or  Other  deaths 

•  Assume  everyone  in  subset  dies 

of  Cancer 


The  previous  observation  suggests  that  the 
investigator  can  vary  the  approach  used  to 
either  carry  out  or  interpret  such  a  sensitivity 
analysis.  For  example,  an  alternative  approach 
would  be  to  modify  worst-case  (1)  by  randomly 
selecting  a  subset  of  50%  (or  25%)  of  subjects 
censored  with  CVD  or  Other  deaths  and  then 
assuming  that  everyone  in  this  subset  dies  of 
Cancer  instead. 


Main  point: 

Sensitivity  analysis  is  one  of  sev¬ 
eral  strategies  to  address  concern 
about  independence  assumption 

Evaluates  how  badly  biased  the 
results  can  get  if  independence 
not  satisfied 


In  any  case,  the  main  point  here  is  that  a  sensi¬ 
tivity  analysis  of  the  type  we  have  illustrated  is 
one  of  several  strategies  that  can  be  used  to 
address  concern  about  the  independence 
assumption.  Such  a  sensitivity  analysis  allows 
the  investigator  to  evaluate  how  badly  biased 
the  results  could  get  if  the  independence 
assumption  is  not  satisfied. 


Nevertheless 

•  No  method  to  directly  assess 
independence  assumption 

•  Typical  analysis  assumes 
independence  assumption  is 
satisfied 


Nevertheless,  as  mentioned  earlier,  there  is  no 
method  currently  available  that  can  directly 
assess  the  independence  assumption  nor  guar¬ 
antee  correct  estimates  when  the  independence 
assumption  is  violated.  Consequently,  the  typi¬ 
cal  survival  analysis  assumes  that  the  indepen¬ 
dence  assumption  is  satisfied  when  there  are 
competing  risks,  even  if  this  is  not  the  case. 
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VI.  Cumulative  Incidence 
Curves  (CIC) 

Survival  curves  S(t): 

provide  summary  information  over 

time  of  survival  experience 

KM:  empirical  approach  for  esti¬ 
mating  survival  curves 

Adjusted  survival  curves:  general¬ 
ized  version  of  KM  using  a  regres¬ 
sion  model  to  adjust  for  covariates 

Up  to  now:  One  event-type  of 

interest  (no  competing 
risks) 

Competing  risks:  KM  may  not  be  as 
informative  as  when  only  one  risk 


Hypothetical  Study 


•  n  =  100  subjects 

•  All  subjects  with  prostate  cancer 


Survt  (months) 

#  Died 

Cause 

3 

99 

CVD 

5 

1 

Cancer 

Study  goal:  cause-specific  cancer 
survival 

Censored:  CVD  deaths 


Table  9.6.  Hypothetical  Survival  Data 


f 

tf 

nf 

mf 

qr 

Sca(tf)  ~  KM 

0 

0 

100 

0 

0 

1 

1 

3 

100 

0 

99 

1 

2 

5 

1 

1 

- 

0 

We  have  previously  discussed  (Chapter  I  and 
beyond)  the  use  of  survival  curves  to  provide 
summary  information  over  time  of  the  survival 
experience  of  (suh)  groups  of  interest.  The 
Kaplan-Meier  (KM)  approach  (Chapter  2),  also 
called  the  product-limit  approach,  is  a  widely 
used  empirical  method  for  estimating  survival 
culves.  A  generalized  version  of  KM  can  be  used 
with  a  regression  (e.g.,  Cox)  model  to  estimate 
adjusted  survival  curves  (Chapter  3)  that  account 
for  covariates.  Up  to  now,  such  survival  curves 
have  been  described  only  for  the  situation  when 
there  is  only  one  event-  type  of  interest. 


When  competing  risks  are  being  considered, 
the  KM  survival  curve  may  not  be  as  informa¬ 
tive  as  with  only  one  risk. 

Consider  the  following  hypothetical  scenario: 
a  5-month  follow-up  of  100  subjects  with  (say, 
prostate)  cancer.  Suppose  that  at  3  months 
from  start  of  follow-up,  99  of  the  100  subjects 
die  from  CVD.  And  at  5  months,  the  1  remain¬ 
ing  subject  dies  from  prostate  cancer. 


The  goal  of  the  study  is  to  determine  the  cause- 
specific  survival  experience  for  cancer  mortality, 
where  a  CVD  death  is  considered  as  censored. 

Table  9.6  summarizes  the  survival  experience  in 
this  hypothetical  study.  The  first  five  columns 
of  this  table  show  the  ordered  failure-time 
interval  number  (f),  the  time  of  failure  (tf),  the 
number  in  the  risk  set  (nfi,  the  number  who  fail 
(mf),  and  the  number  who  are  censored  at  each 
failure  time  (qfi,  assuming  that  a  subject  who 
died  of  CVD  at  a  given  time  is  censored  at  that 
time.  The  last  column  shows  the  KM  survival 
probabilities  SCa(tf)  for  cause-specific  cancer  at 
each  failure  time. 
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Risk  set  at  tf  =  5:  1  subject 
Pr(T  >  5  |  T  >  5)  =  (1  -  l)/2  =  0 


KMCa:  SCa  (t  =  5) 

=  S(t  =  4)  x  Pr(T  >  5|T  >  5) 
=  1  x  0 

0 


KMca  =  o  =►  Riskca  (T  <  5) 

=1-0=1 


Nevertheless, 


1  cancer  death 
100  initial  subjects 


0.01  (small) 


Question: 

How  many  of  the  99  CVD  deaths 
would  have  died  of  cancer  at  t  =  5 
if  they  hadn’t  died  of  CVD  at  t  =  3  ? 

Cannot  answer:  unobservable 


Table  9.7.  Hypothetical  Survival  Data 
Sensitivity  Analysis  A  (99  CVD  Deaths  of 
Cancer  at  t  =  5) 


f 

tf 

nf 

nif 

Of 

SCa(tf)^KM 

0 

0 

100 

0 

0 

1 

1 

3 

100 

0 

0 

1 

2 

5 

100 

100 

0 

0 

From  this  table,  we  see  that  there  is  only  one 
subject  in  the  risk  set  at  5  months,  and  that  this 
subject  fails  at  month  5.  The  conditional  prob¬ 
ability  of  surviving  past  5  months  given  sur¬ 
vival  up  to  5  months  is  (1  —  1)/1  =0,  so  that 
the  KM  survival  probability  at  5  months  is  0. 


Thus,  use  of  the  KMCa  curve  in  the  presence 
of  competing  risks  (for  CVD),  suggests  that 
the  5-month  risk  for  cancer  death  is  1 ;  that  is, 
1  —  SCa  (t  =  5).  Nevertheless,  because  99  patients 
died  of  CVD  instead  of  cancer,  the  proportion  of 
the  initial  100  subjects  who  died  of  cancer  is  .01, 
a  very  small  “risk”  in  contrast  to  the  KM-based 
“risk”  of  1 . 

A  natural  question  at  this  point  is,  how  many  of 
the  99  patients  who  died  of  CVD  at  3  months 
would  have  died  of  cancer  by  5  months  instead 
if  they  hadn’t  died  of  CVD? 

Unfortunately,  we  cannot  ever  answer  this 
question  because  those  dying  of  CVD  cannot 
be  observed  further  once  they  have  died. 

But  we  can  consider  a  sensitivity-type  of  analysis 
to  see  what  might  happen  under  certain  alterna¬ 
tive  scenarios.  Suppose,  for  example,  that  all 
99  subjects  who  died  of  CVD  at  3  months 
would  have  died  of  cancer  at  5  months  if  they 
hadn’t  died  of  CVD.  Also  assume  as  before  that 
the  100th  subject  survived  up  to  5  months  but 
then  immediately  died.  The  survival  experience 
for  this  situation  is  shown  in  Table  9.7.  Notice 
that  the  KM  survival  probability  at  month  5  is  0, 
which  is  the  same  value  as  obtained  in  the 
original  dataset. 


446  9.  Competing  Risks  Survival  Analysis 


KM  method  assumes  non- 
informative  (i.e.,  independent)  cen¬ 
soring 

D 

Pr(T  >  5|censored  at  month  3) 
Pr(T  >  5|survived  to  month  5)  =  0 

a- 

99  CVDs  deaths  would  have 
been  cancer  deaths  at  month  5 


Table  9.8.  Hypothetical  Survival  Data 
Sensitivity  Analysis  B  (99  CVD  Deaths  of 
survive  past  t  =  5) 


f 

tf 

nf 

mf 

Of 

Sca(tf)^KM 

0 

0 

100 

0 

0 

1 

1 

3 

100 

0 

0 

1 

2 

5 

100 

1 

99 

0.99 

The  reason  why  Tables  9.6  and  9.7  give  the 
same  5-month  survival  probability  (=0)  is  that 
the  KM  method  assumes  independent  censor¬ 
ing.  For  the  original  data  (Table  9.6),  indepen¬ 
dent  censoring  requires  that  those  who  were 
censored  at  month  3  were  as  likely  to  have  died 
from  cancer  at  month  5  as  those  who  were  in  the 
risk  set  at  month  5.  Because  the  one  person  in 
the  risk  set  at  month  5  actually  died  from  cancer, 
then  the  KM  method  assumes  that  all  99  CVD 
deaths  being  viewed  as  censored  would  have 
been  cancer  deaths  at  month  5,  which  is  what 
is  represented  in  Table  9.7. 

Now  let’s  consider  a  different  version  (B)  of  a 
sensitivity  analysis.  Suppose  that  all  99  sub¬ 
jects  who  died  of  CVD  at  3  months  would  not 
have  died  of  cancer  at  5  months  if  they  hadn’t 
died  of  CVD.  Also  assume  as  before  that  the 
100th  subject  survived  up  to  5  months  but 
then  immediately  died.  The  survival  experience 
for  this  situation  is  shown  in  Table  9.8. 


Table  9.8:  SCa(t  =  5)  =  0.99 
different  from 
Table  9.6:  SCa(t  =  5)  =  0 


The  KM  survival  probability  at  month  5  is  0.99 
(i.e.,  close  to  1),  which  is  very  different  from  the 
value  of  0  obtained  in  the  original  dataset 
(Table  9.6). 


Focus  on  1  -  S(t)  =  Risk:  If  we  then  focus  on  1  -  S(t)  instead  of  S(t), 

RiskCa(T  <  5)  =  1  —  0.99  =  0.01  sensitivity  analysis  B  suggests  that  the  5-month 

risk  for  cancer  death  is  0.01  (i.e.,  1  —  0.99). 


Table  9.6:  RiskCa(T  <  5)  =  1 
derived  from  the  data 

Table  9.8:  RiskCa(T  <  5)  =  0.01 
derived  from  sensitivity  analysis 

but  also  derived  directly  from 
data  as  a  marginal  probability 


We  thus  see  that  the  KM-based  risk  of  1  com¬ 
puted  from  the  actual  data  (Table  9.6)  is  quite 
different  from  the  KM-based  risk  of  .01  com¬ 
puted  in  Table  9.8,  where  the  latter  derives 
from  a  sensitivity  analysis  that  does  not  use 
the  actual  data.  Note,  however,  that  a  “risk” 
of  .01  for  cancer  death  can  be  derived  directly 
from  the  actual  data  by  treating  the  CVD 
deaths  as  cancer  survivors.  That  is,  .01  is  the 
proportion  of  all  subjects  who  actually  devel¬ 
oped  cancer  regardless  of  whether  they  died 
from  CVD.  This  proportion  is  an  example  of 
what  is  called  a  marginal  probability. 
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Which  is  more  informative, 

RiskCa(T<  5)  =  1  or  0.01? 

Answer:  both  informative 

“Risk”  of  .01  considers  treatment 
utility 

for  example,  proportion  of  cancer 
patients  needing  treatment 


“Risk”  of  1  considers  etiology, 
providing  competing  risks  are 
independent 

for  example,  cancer  survival  is 
unlikely  after  5  months 


Main  point 

KM  survival  curve  may  not  be  very 

informative 

•  Requires  independence 
assumption  about  competing 
risks 

•  Independence  assumption 
cannot  be  verified 

Alternative  to  KM:  Cumulative 
Incidence  Curve  (CIC)  uses 
marginal  probabilities 


Only  one  risk:  CIC  =  1  KM 

CIC  with  competing  risk 

•  Derived  from  cause-specific 
hazard  function 

•  Estimates  marginal 
probability  when  competing 
risks  are  present 

•  Does  not  require  independence 
assumption 


So  which  of  these  two  “risk”  estimates  (1  vs.  01) 
is  more  informative?  Actually,  they  are  both 
informative  in  different  ways. 


The  "risk”  of  .01  is  informative  from  the  stand¬ 
point  of  treatment  utility  for  cancer  because  in 
these  data,  the  proportion  of  cancer  patients 
needing  treatment  is  quite  small  when  allowing 
for  competing  risks. 

On  the  other  hand,  the  “risk”  of  1,  correspon¬ 
ding  to  the  survival  probability  of  0,  is  informa¬ 
tive  from  an  etiologic  standpoint  providing 
competing  risks  are  independent;  for  example, 
cancer  patients  who  don’t  die  of  CVD  would  be 
expected  to  die  from  their  cancer  by  5  months; 
that  is,  cancer  survival  is  unlikely  after  5  months. 

The  main  point  of  the  above  illustration  is 
that  when  there  are  competing  risks,  the  KM 
survival  curve  may  not  be  very  informative 
because  it  is  based  on  an  independence  assum¬ 
ption  about  competing  risks  that  cannot  be 
verified. 


This  has  led  to  alternative  approaches  to  KM 
for  competing  risk  data.  One  such  alternative, 
called  the  Cumulative  Incidence  Curve  (CIC), 
involves  the  use  of  marginal  probabilities 
as  introduced  above.  (Kalbfleisch  and  Prentice, 
1980). 

In  the  simplest  case,  if  there  is  only  one  risk, 
the  CIC  is  (1  -  KM).  With  competing  risks, 
however,  the  CIC  is  derived  from  a  cause- 
specific  hazard  function,  provides  estimates 
of  the  “marginal  probability”  of  an  event  in 
the  presence  of  competing  events,  and  does 
not  require  the  assumption  that  competing 
risks  are  independent. 


448  9.  Competing  Risks  Survival  Analysis 


Marginal  probabilities: 

•  useful  to  assess  treatment  utility 
in  cost-effectiveness  analyses 

•  example:  0.01  =  5 -month 
marginal  probability  for 
Cancer  (Table  9.6) 


Such  marginal  probabilities  are  relevant  to 
clinicians  in  cost-effectiveness  analyses  in  which 
risk  probabilities  are  used  to  assess  treatment 
utility.  For  example,  the  0.01  (5-month)  marginal 
probability  for  cancer  derived  from  the  hypothet¬ 
ical  data  in  Table  9.6  illustrates  small  treatment 
utility  for  cancer. 


Steps  to  construct  CIC: 

1 .  Estimate  hazard  at  ordered 
failure  times  tf  for  event-type 
(c)  of  interest: 

hc(tf)  =  mcf/nf 

where 

mcf  =  #  of  events  for  event-type 
c  at  time  tf 

nf  =  #  of  subjects  at  risk  at  time  tj 


How  does  one  construct  a  CIC?  We  first  esti¬ 
mate  the  hazard  at  ordered  time  points  tf  when 
the  event  of  interest  occurs.  This  hazard  esti¬ 
mate  is  simply  the  number  of  events  that  occur 
at  tf  divided  by  the  number  at  risk  at  tf  (analo¬ 
gous  to  the  KM  estimate).  We  can  write  this  as 
hc(tf)  =  mcf/nf  where  the  mcf  denotes  the  num¬ 
ber  of  events  for  risk  c  at  time  tf  and  nf  is  the 
number  of  subjects  at  that  time.  Thus,  at  any 
particular  time,  mcf/nf  is  the  estimated  propor¬ 
tion  of  subjects  failing  from  risk  c. 


2.  Estimate 

S(tf_i)  =  overall  survival 
probability  of 
surviving  previous 
time  (tf_i) 

overall  =>■  subject  survives  all 
other  competing 
events 

3.  Compute  estimated  incidence 
of  failing  from  event-type  c  at 
time  tf: 

Ic  (tf)  =  S(tf_f)  x  hc  (tf ) 


To  be  able  to  fail  at  time  tg  the  subject  needs  to  be 
“around  to  fail”,  i.e.,  he  must  have  survived  the 
previous  time  when  a  failure  occurred.  The  prob¬ 
ability  of  surviving  the  previous  time  tf_f  is 
denoted  S(tf_f),  where  S(t)  denotes  the  overall 
survival  curve  rather  than  the  cause-specific  sur¬ 
vival  curve  Sc(t).  We  must  consider  “overall”  sur¬ 
vival  here,  since  the  subject  must  have  survived 
all  other  competing  events. 

The  probability  (i.e.,  incidence)  of  failing  from 
event-type  c  at  time  tf  is  then  simply  the  pro¬ 
bability  of  surviving  the  previous  time  period 
multiplied  by  hc(tf). 


4.  The  cumulative  incidence  (CICc)  for  event- 

type  c  at  time  tf  is  then  the  cumulative  sum  up 
to  time  tf  (i.e.,  from  f=l  to  f=f)  of  these  inci¬ 
dence  values  over  all  event-type  c  failure  times. 


CICc(tf)  =  ^ic(tf)  =  J2  S(tp-i)hc(tp) 


f'=l 


f'=l 


CIC  =  1  —  KM  no  competing  risks 
but 

CICc(tf)  /  1  -  KMC  competing  risks 
since 

f 

1  -KMc  =  ^Sc(tf  ,)hc(tf) 

f'=i 

(censor  method) 


Although,  as  previously  mentioned,  the  CIC  is 
equal  to  1  —  KM  when  there  are  no  competing 
risks,  formula  4  for  CICc(tf)  differs  from  1  -  KMC 
when  there  are  competing  risks.  In  particular, 
the  CIC  formula  (4)  uses  the  overall  survival 
function  S(t)  that  counts  events  from  competing 
risks  in  addition  to  the  event-type  of  interest  as 
failures.  In  contrast,  the  formula  for  1  —  KMC 
uses  the  event-type-specific  survival  function 
Sc(t),  which  treats  failures  from  competing  risks 
as  censored  observations;  this  formula  has  been 
called  the  censor  method  (Arriagada  et  al.,1992). 
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Example  of  CIC  calculation 

•  n  =  24  subjects 

•  all  subjects  receive  treatment 
XRT  for  head  and  neck 
cancer 

Survival  time  in  (months) 


Died  of  disease:  0.7,  3,  4.9,  6,  6,  6.9, 
10,  10.8,  17.1,  20.3 
Died  of  other  causes:  1.5,  2.8,  3.8, 
4.7,  7,  10,  10,  11.2 
Censored:  3.2,  7.6,  10,  11,  15,  24.4 


Table  9.9.  CIC  calculation  Hypothetical 
data 


tf 

nf 

mf 

4,(tf) 

SOn) 

Utf) 

Cicwt,) 

0 

24 

0 

0 

- 

0 

0 

0.7 

24 

1 

0.042 

1.000 

0.042 

0.042 

1.5 

23 

0 

0 

0.958 

0 

0.042 

2.8 

22 

0 

0 

0.916 

0 

0.042 

3.0 

21 

1 

0.048 

0.875 

0.042 

0.083 

3.2 

20 

0 

0 

0.833 

0 

0.083 

3.8 

19 

0 

0 

0.833 

0 

0.083 

4.7 

18 

0 

0 

0.789 

0 

0.083 

4.9 

17 

1 

0.059 

0.745 

0.044 

0.127 

6 

16 

2 

0.125 

0.702 

0.088 

0.215 

6.9 

14 

1 

0.071 

0.614 

0.044 

0.259 

7.0 

13 

0 

0 

0.570 

0 

0.259 

7.6 

12 

0 

0 

0.526 

0 

0.259 

10 

11 

1 

0.091 

0.526 

0.048 

0.307 

10.8 

7 

1 

0.143 

0.383 

0.055 

0.361 

11.0 

6 

0 

0 

0.328 

0 

0.361 

11.2 

5 

0 

0 

0.328 

0 

0.361 

15 

4 

0 

0 

0.262 

0 

0.361 

17.1 

3 

1 

0.333 

0.262 

0.087 

0.449 

20.3 

2 

1 

0.5 

0.175 

0.087 

0.536 

24.4 

1 

0 

0 

0.087 

0 

0.536 

We  illustrate  the  calculation  of  a  CIC  through 
an  example. 

Consider  another  hypothetical  study  involving 
24  individuals  receiving  radiotherapy  (XRT) 
for  the  treatment  of  head  and  neck  cancer. 
Patients  may  either  die  of  the  disease  (cancer), 
other  causes  or  still  be  alive  at  the  time  of 
analysis. 

The  data  are  shown  on  the  left. 

The  calculations  required  for  the  cumulative 
incidence  curve  CICca  for  the  event-type 
“death  from  cancer  (i.e.,  ca)”  are  shown  in 

Table  9.9. 

From  the  table,  we  can  see  that  the  highest  CIC 
probability  of  0.536  is  reached  when  t=  20.3 
weeks  when  the  last  observed  event  occurred. 
Thus,  the  cumulative  risk  (i.e.,  marginal  prob¬ 
ability)  for  a  Cancer  death  by  week  20  is  about 
53.6%  when  allowing  for  the  presence  of  com¬ 
peting  risks  for  CVD  and  Other  Deaths. 


Because  the  CIC  curve  describes  “cumulative 
incidence”,  a  plot  of  the  curve  starts  at  0  when 
t=0  and  is  a  nondecreasing  function  up  until 
the  latest  time  of  individual  follow-up  (t=24.4). 
We  show  on  the  left  the  graphs  of  both  the 
CICca  and  1  —  KMca.  Notice  that  1  -  KMca 
overestimates  the  probability  of  failure  for  the 
event-type  “death  from  cancer  (ca).” 
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CIC  Summary 

•  Gives  marginal  probability. 

•  Does  not  use  product  limit 
formulation 

•  Not  included  in  mainstream 
commercially  avilable 
statistical  packages  (e.g.,  SAS, 
STATA,  SPSS) 

Independence  of  competing  risks 
not  required  for  CIC  approach. 

Nevertheless,  CIC  requires 

h(t)  =  hci(t)  +  hc2(t)  +  . . .  +  hck(t) 
where 

h(t)  =  overall  hazard 

hc(t)  =  hazard  for  event  type  c 

Note:  Satisfied  if 

•  Mutually  exclusive  event  types 

•  Nonrecurrent  events 

Comparing  CIC’s  for  two  or  more 
groups: 

•  Statistical  test  available  (Gray, 
1989) 

•  Analogous  to  log-rank  test 

•  No  independence  assumption 

•  Does  not  adjust  for  covariates 

Cumulative  incidence:  Death  due  to  cancer 


Thus,  as  the  example  illustrates,  the  “marginal 
probability”  estimated  by  the  CIC  does  not  use 
a  product-limit  (i.e.,  KM)  formulation.  More¬ 
over,  the  computation  of  a  CIC  is  currently  not 
included  in  mainstream  commercially  available 
statistical  packages. 


As  mentioned  earlier,  the  assumption  of  inde¬ 
pendent  competing  risks  is  not  required  for  the 
calculation  of  the  CIC,  in  contrast  to  the  KM 
survival  curve,  which  requires  this  assumption. 

Nevertheless,  the  CIC  does  require  that  the 
overall  hazard  is  the  sum  of  the  individual 
hazards  for  all  the  risk  types  (Kalbfleisch  and 
Prentice,  1980).  The  latter  assumption  will 
be  satisfied,  however,  whenever  competing 
risks  are  mutually  exclusive  and  events  are 
nonrecurrent,  i.e.,  one  and  only  one  event  can 
occur  at  any  one  time  and  only  once  over  time. 


Gray  (1988)  developed  a  test  to  compare  two  or 
more  CICs.  This  test  is  analogous  to  the  log- 
rank  test.  The  independence  assumption  is  not 
required.  However,  this  test  does  not  adjust  for 
covariates. 


The  plot  shown  on  the  left  gives  the  CICs  for  the 
two  treatments  for  the  Byar  data  that  we  origi¬ 
nally  introduced  in  Section  III  of  this  chapter. 
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Gray’s  test  results:  y_2=  6.6,  df  =  1  Using  Gray’s  test  to  compare  the  two  CICs 
P-value:  0.01  shown  in  the  plot,  we  find  the  two  curves  to 

be  significantly  different  (P=0.01). 


PH  model  used  to  obtain  CIC 

If 

Independence  of  competing  risks 
required 

(but  CIC  meaningful  for  treatment 
utility) 


So  far,  we  have  described  the  CIC  without 
considering  (Cox  PH)  models  that  account  for 
covariates.  However,  when  a  PH  model  is 
used  to  obtain  hazard  ratio  estimates  for  indi¬ 
vidual  competing  risks  as  an  intermediate  step 
in  the  computation  of  a  CIC,  the  indepen¬ 
dence  of  competing  risks  is  required.  In  any 
case,  the  CIC  has  a  meaningful  interpretation 
in  terms  of  treatment  utility  regardless  of 
whether  competing  risks  are  independent. 


Modeling  CIC  with  covariates  using 
PH  model:  Fine  and  Gray  (1999) 

(CIC  also  called  subdistribution 

function) 

Software  available  (Accord,  1997) 


Fine  and  Gray  ( 1 999)  provide  methodology  for 
modeling  the  CIC  with  covariates  using  a  pro¬ 
portional  hazards  assumption.  They  refer  to 
the  CIC  curves  as  subdistribution  functions. 
Software  is  available  that  allows  for  such 
models  to  be  fitted  (Accord,  1997). 


Fine  and  Gray  model  analogous  to 
Cox  PH  model: 

•  use  a  hazard  function  defined 
from  a  CIC 

•  effects  of  predictors  (e.g.,  HRs) 
have  similar  interpretation 


The  CIC  models  developed  by  Fine  and  Gray 
are  analogous  to  the  Cox  PH  model  but,  for  any 
failure  type,  they  model  a  hazard  function 
(also  called  a  sub-distribution  hazard)  derived 
from  a  CIC.  The  results  from  fitting  these 
models  have  a  similar  interpretation  regarding 
the  effects  of  predictors  in  the  model  as  can  be 
derived  from  the  (standard)  Cox  PH  model 
approach  for  competing  risks  data. 


Sub-distribution  hazard  function 
(for  event-type  c): 


hc, cic  (t)  =  limA^o 


Pr(t  <  Tc  <  t  +  At|B) 
At 


Where  B  =  (Tc  >  t  or  Tc.  <  t,  c'  *  c 
A  A 


either 

^survival  >  t  from  event-typej^. 


failure  <  t  from  a  different  event-type  (c') 


The  (sub-distribution)  hazard  function  for 
event-type  c  used  in  the  Fine  and  Gray  method 
is  defined  by  hc  CIC(t)  on  the  left.  This  function, 
based  on  the  CIC,  gives  the  hazard  rate  for 
a  cause-specific  event  at  time  t  based  on  the 
risk  set  that  remains  at  time  t  after  accounting 
for  all  previously  occurring  event-types  (i.e., 
including  competing  risks). 

In  the  above  hazard  formula,  the  expression 
defined  by  B  to  the  right  of  the  conditioning 
symbol  (|)  in  the  formula  conditions  on  a  sub¬ 
ject’s  surviving  from  event -type  c  past  time  t  or 
on  a  subject’s  failing  from  a  different  event-type 
(e.g.,  c')  at  or  before  time  t;  this  accounts  for  the 
occurrence  of  all  event -types  prior  to  time  t. 
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Equivalent  formula  for  hc  CIC(t): 

fr'CICl 


_  dCICc(t)/dt 
'U  1  -  CICc(t) 


analogous  to 

Hazard  when  no  competing  risks: 

f(t) 


h(t)  = 


S(t) 


where  f(t)  =  probability  density  function 

Pr[f  <  T  <  t  +  At] 


=  lim 

Ar-*  0 


At 


The  analogy: 


Competing  Risks 

No  Competing  Risks 

hc,cic(t) 

h(t) 

dCICc(t)/dt 

f(t)  =  dF(t)/dt 

where 

F(t)  =  Pr(T  <  t) 

l-CICc(t) 

S(t)  =  1  -  F(t) 

Note:  1  —  CICc(t)  is  not  strictly  a  survival 
curve  since  the  formula  for  CICc(t)  uses 
overall  survival  S(t)  in  its  calculation 
rather  than  survival  from  event-type  c, 
i.e.,  Sc(t) 


Fine  and  Gray’s  CIC  model: 

p 

he, cic  (t)  =  h0c,cic(exp)[^yiXi] 
i=l 


PH  assumption  satisfied  in  above 
model: 

p 

HRe,cic(X*,X)  =  expf^^X*  -  Xj) 
i=l 

analogous  to 

HRCoxPh(X*,X)  =  exp[£  pjpq  -  Xi) 
i=l 

Can  use  "extended”  Fine  and 
Gray  model  to  account  for  time- 
dependent  variables. 


An  equivalent  mathematical  expression  for 
hc,cic(t)  is  shown  on  the  left,  where  dCICc(t)/dt 
is  the  derivative  of  the  CIC  function  for  event- 
type  c  at  time  t. 

This  expression  for  hc  CIC(t)  is  analogous  to  a 
similar  expression  shown  on  the  left  for  the 
hazard  function  h(t)  when  there  are  no  com¬ 
peting  risks  (see  Chapter  7,  Section  II).  In  the 
latter  case,  h(t)  is  equal  to  the  probability 
density  function  f(t)  divided  by  the  survival 
function  S(t)  shown  on  the  left. 


To  clarify  the  analogy  further,  the  term  in 
dCICc(t)/dt  in  the  numerator  of  the  formula 
for  hc  CIC(t)  corresponds  to  the  density  function 
f(t)  in  the  numerator  of  h(t).  Note  that  f(t) 
is  equal  to  dF(t)/dt,  where  F(t)  is  cumulative 
distribution  function  F(t);  in  other  words,  the 
derivative  of  CICc(t)  in  hc  CIC(t)  corresponds  to 
the  derivative  of  F(t),  which  is  f(t)  in  the  expres¬ 
sion  for  h(t). 

Also,  when  there  are  no  competing  risks,  S(t) 
equals  1  -  F(t),  which  is  analogous  to  the 
denominator  1  -  CICc(t)  of  hc  Cic(t)-  Strictly 
speaking,  however,  1  -  CICc(t)  is  not  directly 
equivalent  to  a  survival  curve  for  event-type  c, 
since  the  formula  CICc(t)  treats  the  occurrence 
of  events  from  other  event-types  as  failures 
rather  than  as  censored  observations. 

The  CIC  model  developed  by  Fine  and  Gray  is 
shown  on  the  left.  This  model  is  analogous  to 
a  Cox  model,  except  that  the  model  considers 
the  subdistribution  hazard  function  hc  CIC(t) 
instead  of  the  hazard  function  h(t). 

The  model  shown  here  satisfies  the  PH  assump¬ 
tion  for  the  subpopulation  hazard  being  mod¬ 
eled,  i.e.,  the  general  HR  formula  is  essentially 
the  same  as  for  the  Cox  model,  except  that  the 
P’s  in  the  Cox  PH  model  are  now  replaced  by  y’s 
in  the  Fine  and  Gray  model. 


The  Fine  and  Gray  model  can  be  extended  to 
allow  for  variables  not  satisfying  the  PH, 
including  time-dependent  variables. 
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Table  9.10.  Edited  Output  for  Cancer 
with  CVD  and  Other  censored  Byar  data 
(Fine  and  Gray  CIC  approach) 


Var 

DF 

Coef 

Std.Err. 

p>M 

Haz. 

Ratio 

Rx 

1 

-0.414 

0.171 

0.008 

0.661 

Age 

1 

-0.112 

0.145 

0.221 

0.894 

Wt 

1 

0.088 

0.146 

0.274 

1.092 

PF 

1 

0.126 

0.260 

0.313 

1.135 

Hx 

1 

-0.256 

0.182 

0.080 

0.774 

HG 

1 

0.321 

0.191 

0.046 

1.379 

SZ 

1 

0.841 

0.207 

0.001 

2.318 

SG 

1 

1.299 

0.198 

0.001 

3.665 

—2  LOG  L  =  1662.766546 


For  the  Byar  data,  the  fitted  Fine  and  Gray  CIC 
model  that  focuses  on  cancer  deaths  as  the 
event  type  of  interest  is  shown  in  Table  9.10 
below  which  we  have  repeated  in  Table  9.1, 
which  uses  the  standard  competing  risks  Cox 
PH  model  approach. 


Table  9.1.  (Repeated).  Edited  Output  for 
Cancer  with  CVD  and  Other  Censored 
(Standard  Cox  PH  approach) 


Var 

DF 

Coef 

Std.Err. 

P  >  M 

Haz. 

Ratio 

Rx 

1 

-0.550 

0.170 

0.001 

0.577 

Age 

1 

0.005 

0.142 

0.970 

1.005 

Wt 

1 

0.187 

0.138 

0.173 

1.206 

PF 

1 

0.253 

0.262 

0.334 

1.288 

Hx 

1 

-0.094 

0.179 

0.599 

0.910 

HG 

1 

0.467 

0.177 

0.008 

1.596 

SZ 

1 

1.154 

0.203 

0.000 

3.170 

SG 

1 

1.343 

0.202 

0.000 

3.830 

Log  likelihood  =  —771.174 


Although  corresponding  coefficient  estimates 
and  standard  errors  are  different  in  the  two 
outputs,  both  outputs  are  reasonably  similar. 


Fine  and  Gray  CIC 

(Table  9.10) 

Standard  Cox  PH 

(Table  9.1) 

Prx:  -0.414 

-0.550 

HRrx  :  0.661  =  (,  (,3) 

0.577  = 

P-value  :  0.008 

0.001 

For  example,  the  estimated  coefficient  of  Rx  is 
-0.414  in  Table  9.10  vs.  -0.550  in  Table  9.1. 
The  corresponding  hazard  ratio  estimates  (e*3) 
are  0.661  (=1/1.513)  and  0.577  (=1/1.733), 
respectively,  so  that  the  strength  of  the  associ¬ 
ation  is  slightly  weaker  using  the  Fine  and  Gray 
approach  for  these  data,  although  both  hazard 
ratios  are  highly  significant. 


VII.  Conditional 

Probability  Curves 
(CPC) 


A  third  measure  of  failure  risk:  CPC 
(Other  measures:  1-KM  and  CIC) 

CPCc  =  Pr(Tc<  1 1  T>t) 

where  Tc  =  time  until  event  c  occurs 
T  =  time  until  any  compet¬ 
ing  risk  event  occurs 


Another  approach  to  competing  risks  is  called 
the  cumulative  conditional  probability  or 
CPC.  CPCs  provide  a  third  summary  measure, 
in  addition  to  ( 1  -  KM)  and  CIC,  of  the  risk  of 
failure  of  an  event  in  the  presence  of  competing 
risks.  Put  simply,  the  CPCc  is  the  probability  of 
experiencing  an  event  c  by  time  t,  given  that  an 
individual  has  not  experienced  any  of  the  other 
competing  risks  by  time  t. 
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CPCc  =  CICc/(l-CICc0 

where  CICc'  =  CIC  from  risks  other 
than  c 


For  event-type  c,  the  CPC  is  defined  by  CICc 
divided  by  (1  -  CICc0,  where  CICc'  is  the  cumu¬ 
lative  incidence  of  failure  from  risks  other  than 
risk  c  (i.e.,  all  other  risks  considered  together). 


Graphs  of  CPC’s  obtained  from  CIC’s 

Tests  to  compare  CPC’s: 

Pepe  and  Mori  (1993)  —2  curves 
Lunn  (1998)  — g  curves 

Example  of  CPC  calculation 

•  n  =  24  subjects 

•  all  subjects  receive  treatment 
XRT  for  head  and  neck  cancer 

•  CICca  previously  calculated 
(Table  9.9) 


Graphs  of  CPC  curves  can  be  obtained  from 
CIC  curves  and  have  been  studied  by  Pepe- 
Mori  (1993)  and  Lunn  (1998).  Pepe-Mori  pro¬ 
vide  a  test  to  compare  two  CPC  curves.  Lunn 
(1998)  extended  this  test  to  g-groups  and  allow 
for  strata. 

We  illustrate  the  calculation  of  a  CPC  using  the 
previously  described  hypothetical  study  data 
involving  24  individuals  receiving  radiotherapy 
(XRT)  for  the  treatment  of  head  and  neck  can¬ 
cer  (ca).  For  these  data,  the  calculations  for  the 
CICca  were  given  in  previous  Table  9.9,  below 
which  was  shown  the  graph  of  the  CICca  curve. 


Table  9.11  CPC  calculation — 
Hypothetical  data 


tf 

nf 

CKWtf) 

ClCoaftt) 

CPCc 

=  cicc/(i-cicco 

0 

24 

0 

0 

0 

0.7 

24 

0.042 

0 

0.042 

1.5 

23 

0.042 

0.042 

0.043 

2.8 

22 

0.042 

0.083 

0.045 

3.0 

21 

0.083 

0.083 

0.091 

3.2 

20 

0.083 

0.083 

0.091 

3.8 

19 

0.083 

0.127 

0.095 

4.7 

18 

0.083 

0.171 

0.101 

4.9 

17 

0.127 

0.171 

0.153 

6 

16 

0.215 

0.171 

0.259 

6.9 

14 

0.259 

0.171 

0.312 

7.0 

13 

0.259 

0.215 

0.330 

7.6 

12 

0.259 

0.215 

0.330 

10 

11 

0.307 

0.311 

0.445 

10.8 

7 

0.361 

0.311 

0.524 

11.0 

6 

0.361 

0.311 

0.524 

11.2 

5 

0.361 

0.376 

0.579 

15 

4 

0.361 

0.376 

0.579 

17.1 

3 

0.449 

0.376 

0.719 

20.3 

2 

0.536 

0.376 

0.860 

24.4 

1 

0.536 

0.376 

0.860 

Table  9.11  on  the  left  gives  the  calculation  of 
the  CPC  for  the  event-type  "death  from  can¬ 
cer”.  A  graph  of  the  CPCca  curve  is  shown 
below  the  table. 


From  the  CPCca  curve,  we  can  see,  for  example, 
that  at  10  months,  the  probability  of  dying  from 
cancer  given  that  no  other  type  of  event  has 
occurred  is  0.455.  Similarly,  at  20.3  months, 
the  probability  of  dying  of  cancer  given  that  no 
other  type  of  event  has  occurred  is  0.860. 
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Example:  Byar  Data 


Test  for  equality:  p-value  =  .01  (Pepe-Mori) 


Returning  to  the  Byar  dataset  originally 
introduced  in  Section  III  of  this  chapter,  the 
plot  shown  here  gives  the  CPC  curves  compar¬ 
ing  the  two  DES  treatments.  These  curves  give 
the  probability  of  an  event  (death)  from  pros¬ 
tate  cancer  at  any  particular  time  given  that  the 
patient  is  alive  at  this  time  to  experience  the 
event. 

(Note:  the  Fine  and  Gray  approach  has  not 
been  extended  to  model  the  CPCs  in  a  regres¬ 
sion  framework.) 

The  Pepe-Mori  test  shows  a  significant  differ¬ 
ence  between  these  two  CPC  curves. 


VIII.  Method  2— The 
Lunn-McNeil  (LM) 
Approach 


Method  I:  separate  estimates  for 
each  failure  type,  treating  the  com¬ 
peting  failure  types  as  censored 


We  have  previously  (Section  IV)  described  an 
approach  (called  Method  1)  for  analyzing  com¬ 
peting  risks  data  that  uses  the  Cox  (PH)  model 
to  separatelyestimate  hazards  and  correspon¬ 
ding  hazard  ratios  for  each  failure  type, 
treating  the  other(competing)  failure  types  as 
censored  in  additionPto  those  not  failing  from 
any  event-type. 


Method  2:  LM  Approach 

•  Uses  a  single  Cox  (PH)  model 

•  Gives  identical  results  as 
obtained  from  Method  1 

•  Allows  flexibility  to  perform 
statistical  inferences  not 
available  from  Method  1 


We  now  describe  Method  2,  called  the  Lunn- 
McNeil  (LM)  approach,  that  allows  only  one 
Cox  PH  model  to  be  fit  rather  than  separate 
models  for  each  event-type  (i,e..  Method  I 
above).  This  approach,  depending  on  the  vari¬ 
ables  put  in  the  model,  can  give  identical 
results  to  those  obtained  from  separate  models. 
Moreover,  the  LM  approach  allows  flexibility 
to  perform  statistical  inferences  about  various 
features  of  the  competing  risk  models  that  can 
not  be  conveniently  assessed  using  Method  1 . 
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Table  9.12.  Augmented  Data  for  ith  Sub¬ 
ject  at  Time  tj  Using  LM  Approach 


Subj  Stime  Status 

Di 

d2 

d3 

..Dc 

Xi  . 

■Xp 

i  t|  ei 

1 

0 

0  .. 

0 

Xii  . 

..Xip 

i  ti  e2 

0 

1 

0  .. 

0 

Xii  . 

..Xip 

i  tj  e3 

0 

0 

1 .. 

0 

Xu. 

..Xip 

i  ti  ec 

0 

0 

0  .. 

1 

Xii  • 

..Xip 

Di,  D2,  D3,. . .,  Dc:  indicators  for  event-types 


Definition 

Dc  equals  1  for  event-type  c  and 
0  otherwise,  c  =  1,2,  . . .,  C 

for  example, 


Event-type  1:  Dj  =  1,  D2  =  0, 

D3  =  0 _ Dc  =  0 

Event-type  2:  Dj  =  0,  D2  =  1, 

D3  =  0 _ Dc  =  0 

Event-type  3:  Dj  =  0,  D2  =  0, 
D3=  1,  ...,  Dc  =  0 


Table  9.13.  Augmented  Data  for  Sub¬ 
jects  1,  14,  16,  and  503  from  Byar  Data 
Using  LM  Approach 


Subj 

Stime 

Status 

CA 

CVD 

OTH 

Rx 

Age 

Wt 

1 

72 

0 

1 

0 

0 

0 

1 

2 

1 

72 

0 

0 

1 

0 

0 

1 

2 

1 

72 

0 

0 

0 

1 

0 

1 

2 

14 

49 

1 

1 

0 

0 

0 

0 

0 

14 

49 

0 

0 

1 

0 

0 

0 

0 

14 

49 

0 

0 

0 

1 

0 

0 

0 

16 

3 

0 

1 

0 

0 

1 

2 

1 

16 

3 

1 

0 

1 

0 

1 

2 

1 

16 

3 

0 

0 

0 

1 

1 

2 

1 

503 

41 

0 

1 

0 

0 

0 

1 

0 

503 

41 

0 

0 

1 

0 

0 

1 

0 

503 

41 

1 

0 

0 

1 

0 

1 

0 

To  carry  out  the  LM  approach,  the  data  layout 
must  be  augmented.  If  there  are  C  competing 
risks,  the  original  data  must  be  duplicated 
C  times,  one  row  for  each  failure  type  as 
shown  in  Table  9.12  for  the  ith  subject  with 
survival  time  L  in  the  table.  Also,  C  dummy 
variables  D1(  D2,  D3,. . .,  Dc  are  created  as 
shown  in  the  table.  The  value  of  the  status 
variable  ec,  with  c  going  from  1  to  C,  equals  1 
if  event  type  c  occurs  at  time  c,  and  equals  0  if 
otherwise.  The  Xs  in  the  table  denote  the  pre¬ 
dictors  of  interest  and,  as  shown  in  the  table, 
are  identical  in  each  row  of  the  table. 

The  dummy  variables  D1;  D2,  D3,. .  .,DC  are 
indicators  that  distinguish  the  C  competing 
risks  (i.e.,  event-types). 

Thus,  the  dummy  variable  Dc  equals  1  for 
event-type  c  and  0  otherwise. 

For  example,  for  event  type  l,the  DsareD!  =  1, 
D2  =  0,  D3  =  0,  . . .,  Dc  =  0;  for  event-type  2,  the 
Ds  are  D]  =  0,  D2  =  1,  D3  =  0,  . . .,  Dc  =  0;  and 
for  event-type  3,  the  Ds  are  D!  =  0,  D2  =  0,  D3  = 
1,...,DC  =  0. 


Table  9.13  shows  observations  for  subject  #s  1, 
14,  16,  and  503  from  the  Byar  dataset.  The  CA, 
CVD,  and  OTH  columns  denote  the  C  =  3 
dummy  variables  D1(  D2,  and  D3,  respectively. 
The  last  three  columns,  labeled  Rx,  Age,  and 
Wt  give  values  for  three  of  the  eight  predictors. 

In  this  table,  there  are  three  lines  of  data  for  each 
subject,  which  correspond  to  the  three  compet¬ 
ing  risks,  Cancer  death,  CVD  death,  and  Other 
death,  respectively.  The  survival  time  (Stime) 
for  subject  1  is  72,  for  subject  14  is  49,  for  subject 
16  is  3,  and  for  subject  503  is  41. 
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Subject  1:  Censored 
Subject  14:  died  of  Cancer 
Subject  16:  died  of  CVD 
Subject  503:  died  from  OTH 

Rx  Age 

Subject  10  1 

Subject  16  1  2 


From  the  Status  and  Event  (i.e.,  CA,  CVD, 
OTH)  columns,  we  can  see  that  subject  1  was 
censored,  subject  14  died  of  Cancer,  subject  16 
died  of  CVD,  and  subject  503  died  from  Other 
causes. 

Wt 

2  For  subject  1,  the  values  for  the  predictors  Rx, 
1  Age,  and  Wt,  were  0,  1,  and  2,  respectively. 
These  values  appear  identically  in  the  last 
three  columns  of  the  three  rows  for  this  sub¬ 
ject.  Similarly,  for  subject  16,  the  predictor 
values  for  Rx,  Age,  and  Wt,  are  1,  2,  and  1, 
respectively. 


General  Stratified  Cox  LM  Model 

g=  1,2 . C 

h*(t,X)  =  h*g(t) 

x  exp  [PjX,  +  P2X2  +  •  •  •  +  PpXp 
+  621D2X,  +  822D2X2  +  •  •  •  +  82pD2Xp 
+  831 D3X,  +  832D3X2  +  •  •  •  +  83pD3Xp 


+§ciDcXi  +  8c2DcX2  +  •  ■  ■  +  ScpDcXp] 


1st  row:  predictors  X1(  X2,. . .,  Xp 
2nd  row:  product  terms 
D2Xi,  D2X2,...,  D2Xp 

Cth  row:  product  terms 
DcXj,  DcX2„  . .,  DcXp 


To  use  the  LM  approach  with  augmented  data 
to  obtain  identical  results  from  fitting  separate 
models  (Method  1),  an  interaction  version  of  a 
stratified  Cox  PH  model  is  required.  A  general 
form  for  this  model  based  on  the  notation  used 
for  the  column  heading  variables  in  Table  9.12 
is  shown  at  the  left. 

Recall  that  the  X1(  X2,. . .,  Xp  denote  the  p  pre¬ 
dictors  of  interest.  D2,  D3,. . .,  Dc  are  C  -  1 
dummy  variables  that  distinguish  the  C  event- 
types.  Note  that  event-type  1  (g  =  1)  is  the 
referent  group,  so  variable  Dj  is  omitted  from 
the  model.  Thus,  the  first  row  in  the  exponen¬ 
tial  formula  contains  the  Xs,  the  second  row 
contains  product  terms  involving  D2  with  each 
of  the  Xs,  and  so  on,  with  the  last  (Cth)  row 
containing  product  terms  of  Dc  with  each  of 
the  Xs.  The  strata  (g  =  1, . . .,  C)  are  the  C  event- 
types. 


LM  Hazard  Model  for 
Event-Type  1 


hj(t,x)  =  hs1(t) 


x  exp 


PjXj  +  p2x2 


PpXp 


(D2  =  D3  =  ■••  =  Dc  =  0) 


For  event-type  1  (g  =  1),  the  above  stratified 
Cox  model  simplifies  to  the  expression  shown 
at  the  left.  Note  that  because  g  =  1 ,  the  values 
of  the  dummy  variables  D2,  D3,. . .,  Dc  are  all  0. 
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No  product  terms  in  model: 

HRg=1(Xj  =  1  vs.  Xj  =  0)  =  exptpj] 

Product  terms  XjX:  in  model: 

HRg=1(X!  =  1  vs.  Xt  =  0) 

=  exp[Pj  +  HPjXj] 


Thus,  for  g  =  1,  if  X]  is  a  (0,1)  variable,  the 
other  Xs  are  covariates,  and  there  are  no  prod¬ 
uct  terms  XjXj  in  the  model,  the  formula  for 
the  HR  for  the  effect  of  X]  adjusted  for  the 
covariates  is  exp[pj.  The  more  general  expo¬ 
nential  formula  described  in  Chapter  3  would 
need  to  be  used  instead  to  obtain  adjusted  HRs 
if  there  are  interaction  terms  in  the  model  of 
the  form  XjX  | . 


LM  Hazard  Model  for 
Event-Type  g  (>  1) 

h*(t,X)  =h*g(t) 

x  exp  [PjXj  +  P2X2  +  •  •  •  +  PpXp 
+SgiXi  +  8g2X2  +  •  •  •  +  SgpXp] 

=  hog(t)  exP[(Pi  +  §gi)Xi  +  (P2  +  Sg2)X2 
H - 1"  (Pp  +  8gp)xp] 


For  any  g  greater  than  1,  the  general  hazard 
model  simplifies  to  a  hazard  function  formula 
that  contains  only  those  product  terms  involv¬ 
ing  the  subscript  g,  because  Dg  =  1  and  Dg-  = 
0  for  g'  not  equal  to  g. 

With  a  little  algebra  we  can  combine  coeffi¬ 
cients  of  the  same  predictor  to  rewrite  this 
hazard  model  as  shown  here. 


No  product  terms  XjX,  in  the 
model  and  g  >  1 : 


HRg(Xi  =  1  vs.  X,  =  0) 

=  exp[(pj  +  8gl)] 

Product  terms  XjX :  in  the  model 
and  g  >  1 : 


Thus,  for  g  >  1 ,  if  Xi  is  a  (0, 1)  variable,  the  other 
Xs  are  covariates,  and  there  are  no  product 
terms  XjXj  in  the  model,  the  formula  for  the 
HR  for  the  effect  of  X  |  adjusted  for  the  covari¬ 
ates  is  exp[p!  +  8gi].  This  HR  expression  would 
again  need  to  be  modified  if  the  model  contains 
product  terms  of  the  form  XjX] . 


HRgfXi  =  1  vs.  X,  =  0) 

=  exp[(pj  +  Sgi) 

+  KPi  +  SgjXj)] 

We  now  illustrate  the  above  general  LM  model 
formation  using  the  Byar  data. 

Recall  that  using  Method  1 ,  the  separate  models 
approach,  the  Cox  hazard  formula  used  to  fit  a 
separate  model  for  Cancer  deaths,  treating  CVD 
and  Other  deaths  as  censored  is  repeated  here. 

Also  shown  is  the  formula  for  the  hazard  ratio 
for  the  effect  of  the  Rx  variable,  adjusted  for 
other  variables  in  the  model. 


EXAMPLE  OF  LM  MODEL  FOR 
BYAR  DATA 


Separate  models  approach  (Method  1): 
Cause-specific  model:  Cancer 
CVD  and  Other  deaths  censored 

No-interaction  model 
hCa(t,X)  =  h0ca(t)exp[PicaRx 

+  P2CaAge  +  P3CaWt  + 

P4CaPF  +  PsCaHx  +  P6CaHG 
+  P7CaSZ  +  PgCaSG] 

HRCa(RX  =  1  vs.  RX  =  0)  =  exp[p  1Cal 
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LM  SC  Model  for  Byar  Data 
g=  1.2,3 

hg(t,  X)  =  h0g(t) 

x  exp[P[RX  +  p2Age  -f - b  P8SG 

+  <52iD2Rx  +  <522D2Age  -(-•••-(-  <528D2SG 
+531D3RX  +  <532D3Age  +  •  •  •  +  C538D3SG] 
1st  row:  predictors 

Rx,  Age,  Wt,  PF,. . SG 
2nd  row:  products 

D2Rx,  D2Age,  . . D2SG 
3rd  row:  products 

D3Rx,  D2Age,  . . D3SG 
D2  =  CVD  and  D3  =  OTH  are  (0,1) 
dummy  variables  that  distinguish 
the  3  event-types 


Using  the  general  LM  data  layout  given  in 
Table  9.11,  the  stratified  Cox  LM  model  for 
the  Byar  data  that  incorporates  C  =  3  event- 
types  is  shown  at  the  left.  The  strata,  denoted 
by  g  =  1,  2,  3,  identify  the  three  event-types  as 
Cancer,  CVD,  and  Other,  respectively. 

Notice  that  in  the  exponential  part  of  the  model, 
there  are  3  rows  of  terms  that  correspond  to  the 
3  event-types  of  interest.  The  first  row  contains 
p  =  8  predictors  Rx,  Age,  Wt,  PF,  HX,  HG,  SZ, 
SG.  The  second  row  contains  product  terms  of 
the  dummy  variable  D2  (the  CVD  indicator) 
with  each  of  the  8  predictors.  Similarly,  the 
third  row  contains  product  terms  of  D3  (the 
Other  indicator)  with  each  of  the  predictors. 


FIRCa(Rx  =  1  vs.  Rx  =  0)  =  expfPJ 
HRcvdIRx  =  1  vs.  Rx  =  0) 

=  exp[(p!  +  S21)] 

HRqthIRx  =  1  vs.  Rx  =  0) 

=  exp[(pj  +  S31)] 


From  the  above  model,  it  follows  that  the  haz¬ 
ard  ratio  formulas  for  the  effects  of  Rx  corres¬ 
ponding  to  each  event-type  are  as  shown  at  the 
left.  Notice  that  for  CVD  and  Other  deaths, 
the  coefficient  8gi  of  the  product  term  DgRx, 
g  =  2,  3,  is  added  to  the  coefficient  pj  of  Rx  in 
the  exponential  term. 


Table  9.14.  Edited  Output  for  LM  Model 
(Interaction  SC)-Byar  Data 


Std.  Haz. 

Var  DF  Coef  Err.  p  >  |z|  Ratio 


Rx 

1 

-0.550 

0.170 

0.001 

0.577 

Age 

1 

0.005 

0.142 

0.970 

1.005 

Wt 

1 

0.187 

0.138 

0.173 

1.206 

PF 

1 

0.253 

0.262 

0.334 

1.288 

Hx 

1 

-0.094 

0.179 

0.599 

0.910 

HG 

1 

0.467 

0.177 

0.008 

1.596 

SZ 

1 

1.154 

0.203 

0.000 

3.170 

SG 

1 

1.343 

0.202 

0.000 

3.830 

RxCVD 

1 

0.905 

0.244 

0.000 

2.471 

AgeCVD 

1 

0.332 

0.196 

0.089 

1.394 

WtCVD 

1 

-0.146 

0.203 

0.472 

0.864 

PFCVD 

1 

0.222 

0.377 

0.556 

1.248 

HxCVD 

1 

1.236 

0.259 

0.000 

3.441 

HGCVD 

1 

-0.449 

0.268 

0.094 

0.638 

SZCVD 

1 

-1.375 

0.417 

0.001 

0.253 

SGCVD 

1 

-1.366 

0.275 

0.000 

0.255 

RxOth 

1 

-0.028 

0.327 

0.932 

0.972 

AgeOth 

1 

0.764 

0.248 

0.002 

2.147 

WtOth 

1 

0.344 

0.265 

0.194 

1.411 

PFOth 

1 

0.288 

0.497 

0.562 

1.334 

HxOth 

1 

0.117 

0.337 

0.727 

1.125 

HGOth 

1 

-0.111 

0.345 

0.748 

0.895 

SZOth 

1 

-0.439 

0.470 

0.350 

0.645 

SGOth 

1 

-1.797 

0.360 

0.000 

0.166 

log  likelihood  =  -1831.92 


Table  9.14  shows  edited  output  obtained  from 
fitting  the  above  LM  model. 

The  first  eight  rows  of  output  in  this  table  are 
identical  to  the  corresponding  eight  rows  of  out¬ 
put  in  the  previously  shown  Table  9.1  obtained 
from  Method  1,  which  fits  a  separate  model  for 
Cancer  deaths  only.  This  equivalence  results 
because  the  first  eight  rows  of  the  LM  output 
correspond  to  the  reduced  version  of  the  LM 
model  when  D2  =  D3  =  0,  which  identifies 
Cancer  as  the  event  of  interest. 

However,  the  remaining  1 6  rows  of  LM  output 
are  not  identical  to  the  corresponding  8  rows 
of  Table  9.2  (for  CVD)  and  8  rows  of  Table  9.3 
(for  Other).  Note  that  the  remaining  16  coeffi¬ 
cients  in  the  LM  output  identify  the  5gj  co¬ 
efficients  in  the  LM  model  rather  than  the 
sum  (pi  +  5gj)  required  for  computing  the  HR 
when  g  =  2  and  3. 
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HRCa{ Rx  =  1  vs.  Rx  =  0) 

=  exp[— 0.550]  =  0.577 
=  (1/1.733) 

Wald  ChiSq  =  (-  .550/.171)2 
=  10.345(P  =  0.001) 


95%  Cl  for  exp[p1Ca]: 
exp[-0.550  ±  1.96(0.171)] 
=  (0.413,  0.807) 


From  Table  9.14,  the  adjusted  HR  for  the  effect 
of  Rx  when  the  event-type  is  Cancer  can  be 
read  directly  off  the  output  as  0.577.  Also,  the 
Wald  statistic  for  testing  H0:  Pi  =  0  is  highly 
significant  (P  =  .001).  The  corresponding  95% 
confidence  interval  for  this  HR  has  the  limits 
(0.413,  0.807). 


LM  results  for  Cancer  identical  to  These  results  are  identical  to  those  obtained  for 
Method  1  results  for  Cancer  the  adjusted  HR,  the  Wald  test,  and  interval 

estimate  obtained  in  Table  9.1  using  Method  1 
to  assess  the  effect  of  Rx  on  survival  for  cancer 
death. 


HRcvd{ Rx  =  1  vs.  Rx  =  0) 

=  exp^Pj  +<5n) 

=  exp(— 0.550  +  0.905) 
=  exp(0.355)  =  1.426 

77Roj//(Rx  =  1  vs.  Rx  =  0) 
=  exp^  +<521) 

=  exp(— 0.550  -  0.028) 
=  exp(— 0.578)  =  0.561 


Using  Table  9.14  to  obtain  adjusted  HR  for  the 
Rx  effect  when  the  event-type  is  CVD  or  Other, 
we  must  exponentiate  the  sum  (px  +  8, i )  for 
g  =  2  and  3,  respectively. 


LM  results  for  CVD  and  Other 
identical  to  Method  1  results  for 

CVD  and  Other 


These  results  are  shown  at  the  left,  and  they 
are  identical  to  those  obtained  in  Tables  9.2 
and  9.3  using  Method  1 . 


Wald  test  statistics  for  CVD  and 

Other 


WaldcvD 


\\  ,l'l;  :  •  j  = 


Pi  +  <hl 

SEP1+^21 

Pi  +  <hl 

SEPl+^21 


Note,  however,  that  using  the  LM  model  to 
obtain  Wald  test  statistics  and  95%  confidence 
intervals  for  the  HRs  for  CVD  and  Other,  the 
mathematical  formulas  (shown  at  left  for  the 
Wald  tests)  require  obtaining  standard  errors 
of  the  sums  (P  ,  +  <V)  for  g  =  2  and  3,  whereas 
the  output  in  Table  9.14  gives  only  individual 
standard  errors  of  p1;  §n  and  S2i. 


Computer  packages  provide  for  SAS  ans  STATA  provide  special  syntax  to  spec- 
computation  of  the  above  formulas  ify  the  computer  code  for  such  computations: 

SAS’s  PHREG  allows  a  “Contrast”  statement; 
SAS:  Contrast  statement  STATA  allows  a  “lincom”  command. 


STATA:  lincom  command 
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Alternative  LM  formulation 
(LMait  model) 

Output  identical  to  Method  1 

(Tables  9.1,  9.2,  9.3) 


Nevertheless,  there  is  an  alternative  version  of 
the  LM  model  that  avoids  the  need  for  special 
syntax.  This  alternative  formulation,  which  we 
call  the  LMait  model,  results  in  output  that  is 
identical  to  the  output  from  the  separate  models 
(Method  1)  approach  for  analyzing  competing 
risk  data  as  given  in  Tables  9.1  through  9.3. 


IX.  Method  2a: 
Alternative 
Lunn-McNeil  (LMa|t) 
Approach 

•  Uses  same  data  layout  as 
Table  9.12 

•  Column  headings: 

°  Dummy  variables 
D],  D2„  . .,  Dc 
°  Predictor  variables 

Xj,  x2„ . .,  xp 

•  Above  variables  are 
transformed  into  product 
terms 


The  data  layout  required  to  fit  the  LMalt  model 
is  the  same  as  shown  earlier  in  Table  9.12. 
However,  the  variables  listed  in  the  columns 
of  this  table,  namely,  the  dummy  variables  Dt, 
D2,. . .,  Dc  and  the  predictor  variables  Xj,  X2,. . ., 
Xp,  serve  as  basic  variables  that  are  trans¬ 
formed  into  product  terms  that  define  the 
LMait  model. 


1st  row  of  LMait  model: 
product  terms 

D1X1,D1X2,...,D1Xp 
coefficients  §'n,. .  .,8'lp 

1st  row  of  LM  model 

predictors  X1;  X2,  . . .,  Xp 
coefficients  p1(. . .,  Pp 


The  primary  difference  in  the  two  formulas 
is  that  the  first  row  of  the  exponential  term  in 
the  LMait  model  contains  product  terms  DiXi, 
D!X2„ . .,  DjXp  with  coefficients  denoted 
8'n,. .  .,8'lp  whereas  the  first  row  in  the  LM 
model  contains  the  predictors  X1;  X2,  . . .,  Xp 
without  product  terms  and  coefficients 
denoted  jh,  . . .,  (3p. 


General  Stratified  Cox 
LMait  Model 

g= 

hg(tjX)  =  h'g(t) 

x  exp[8'nDiXi  +  5'I2D1X2  +  •  •  •  +  S'^DjXp 
+  5' ,  D2X[  +  6'2D2X2  +  •  •  •  +  6^pD2Xp 

+  83^3X1  +  +  •  •  •  +  §3pD3Xp 

+  ••• 


The  general  form  of  the  LMait  model  is  shown 
at  the  left.  We  have  used  a  superscript  prime  (') 
to  distinguish  the  hazard  model  formula  for 
the  LMait  model  from  the  corresponding 
formula  for  the  LM  model  given  earlier. 


+8^1DcX1  +  S^DcXt  +  •  •  •  +  S^pDcXp 
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•  LMait  and  LM  models  are 
different 

•  Estimated  regression 
coefficients  will  not  be  identical 

•  Estimated  HRs,  test  statistics, 
and  interval  estimates  are 
identical 

•  Computational  formulas  are 
different 


Because  the  LMait  model  and  the  LM  model  are 
different  hazard  models,  their  estimated 
regression  coefficients  will  not  be  identical. 
Nevertheless,  when  used  on  the  same  dataset, 
the  estimated  HRs  of  interest  and  their 
corresponding  test  statistics  and  interval  esti¬ 
mates  are  identical  even  though  the  formulas 
used  to  compute  these  statistics  are  different 
for  the  two  models. 


LMait  Hazard.  Model  for 
Event-Type  1 

h'i(t.X)  =  h'01(t) 

x  exp  [8'nXi  +  §12X2  +  •  •  •  +  SjpXpJ 


For  g  =  1  (i.e.,  event-type  1),  the  LMait  model 
simplifies  to  the  expression  shown  at  the  left. 
Note  that  because  g  =  1,  the  values  of  the 
dummy  variables  are  =  1,  and  D2  =  D3  = 
. .  .=  Dc  =  0. 


(Dj  =  1,  D2  =  D3=-=Dc  =  0) 
HRg=i(Xi  =  1  vs.  Xj  =  0)  =  exp[5'n] 

(no  products  X,  X:  in  model) 


Thus,  for  g  =  1,  if  X]  is  a  (0,1)  variable,  the 
other  Xs  are  covariates,  and  there  are  no  prod¬ 
uct  terms  of  the  form  XjXj  in  the  model,  the 
formula  for  the  HR  for  the  effect  of  Xj  adjusted 
for  the  covariates  is  exp[5'n]. 


LM  HR  =  expfpj] 


Recall  that  for  the  LM  model,  the  correspon¬ 
ding  HR  formula  also  involved  the  coefficient 
of  the  X!  variable,  denoted  as  pi. 


LMait  Hazard  Model  for 
Event-Type  g  (>  1) 


h'i(t,X)  =  h'  (t) 


x  exp 


5'iXi  +  S^2  X2  +  •  •  •  +  SgnX, 


&2 


gp''p 


(Dg  =  1  and  Dg<  =  0  for  g’  7^  g) 
HRg(X]  =  1  vs.  Xi  =  0)  =  exp 

(no  products  XjXt  in  model) 


8' 


gi 


For  any  g  greater  than  1,  the  general  hazard 
model  simplifies  to  a  hazard  function  formula 
that  contains  only  those  product  terms  involv¬ 
ing  the  subscript  g,  because  Dg  =  1  and  Dg'  =  0 
for  g'  7^  g. 

Thus,  for  g  >  1,  if  X]  is  a  (0,1)  variable,  the 
other  Xs  are  covariates,  and  there  are  no  pro¬ 
ducts  Xj  X|  in  model,  the  formula  for  the  HR 
for  the  effect  of  X|  adjusted  for  the  covariates  is 
exP[Sg,]- 


LM  HR  =  exp[p!  +  8gl] 


Recall  that  for  the  LM  model,  the  exponential 
in  the  HR  formula  involved  the  sum  (p!  +  §gi). 
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Statistical  inferences  (i.e.,  Wald 
test,  95%  Cl) 

LMait  model:  need  standard  error  for 
5' j  (directly  provided  by  output) 

LM  model:  standard  error  of 
(Pi+Sgi).  (more  complicated 
computation) 

Next:  Byar  data  example  of  LMait 
model 

LMajt  SC  Model  for  Byar  Data 

g  =  1,2,  3 
hg(t,X)  =  h'0g(t) 

x  exp[8'nDiRx  + - b§'18DiSG 

Si^Rx  -f-  •  •  •  -(-  89gD2SG 
+  831D3Rx  +  •  •  •  +  538D3SG] 


Dj  =  CA,  D2  =  CVD,  and  D3  =  OTH 
are  (0,1)  dummy  variables  for  the  3 
event-types 

1st  row:  products 

DtRx,  D]Age,  . . .,  D]SG 
(LM  predictors,  Rx,  Age,. . .,  SG) 
2nd  row:  products 

D2Rx,  D2Age,  . . .,  D2SG 
3rd  row:  products 

D3Rx,  D3Age,  . . .,  D3SG 


HRCa(Rx  =  1  vs.  Rx  =  0)  =  expfSjJ 
HRcvd(Rx  =  1  vs.  Rx  =  0) 

=  exp[§21] 

HR0th(Rx  =  1  vs.  Rx  =  0) 

=  exp[8'31] 


Thus  for  g  >  1 ,  statistical  inferences  about  HRs 
using  the  LMait  model  only  require  use  of  the 
standard  error  for  5' j  that  is  directly  provided 
in  the  output. 


In  contrast,  the  LM  model  requires  computing 
the  more  complicated  standard  error  of  the 
sum  (Pj  +  5gi). 


We  now  illustrate  the  above  general  LMait 
model  formation  using  the  Byar  data. 


The  stratified  Cox  (SC)  LM^t  model  that  incor¬ 
porates  the  C  =  3  event-types  is  shown  at  the 
left.  The  strata,  denoted  by  g  =  1,  2,  3,  identify 
the  three  event-types,  Cancer,  CVD,  and 
Other. 


Notice  that  in  the  exponential  part  of  the 
model,  the  first  row  contains  product  terms  of 
the  dummy  variable  Di  (the  CA  indicator)  with 
each  of  the  8  predictors  Rx,  Age,  Wt,  PF,  HX, 
HG,  SZ,  SG.  Recall  that  in  the  LM  version  of 
this  model,  the  first  row  contained  main  effects 
of  the  predictors  instead  of  product  terms. 

The  second  and  third  rows,  as  in  the  LM  model, 
contain  product  terms  of  the  dummy  variable 
D2  (the  CVD  indicator)  and  D3  (the  OTH 
indicator),  respectively,  with  each  of  the  8  - 
predictors. 

From  the  above  model,  it  follows  that  the  HR 
formulas  for  the  effects  of  Rx  corresponding  to 
each  event-type  are  of  the  form  exp(8gl),  where 
Sgjis  the  coefficient  of  the  product  term  DgRx 
in  the  LMait  model. 
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Waldg  = 


2 


SEA, 

.  5ei 


g  =  1  (CA),  2  (CVD),  3  (OTH) 


Consequently,  Wald  test  statistics  (shown  at 
the  left)  and  confidence  intervals  for  these 
HRs  use  standard  errors  that  are  directly 
obtained  from  the  standard  error  column 
from  the  output  obtained  for  the  LMalt  model. 


Statistical  inference  information 

LMait  model:  directly  provided  by 
output 

LM  model:  not  directly  provided  by 
output  (requires  additional  com¬ 
puter  code) 


Thus,  the  LMalt  model  allows  the  user  to  per¬ 
form  statistical  inference  procedures  using  the 
information  directly  provided  in  the  computer 
output,  whereas  the  LM  model  requires  addi¬ 
tional  computer  code  to  carry  out  more  com¬ 
plicated  computations. 


Table  9.15.  Edited  Output  for 
SC  LMait  Model — Byar  Data 

Std.  Haz. 

Var  DF  Coef  Err.  p  >  |z|  Ratio 


RxCa  1 
AgeCa  1 
WtCa  1 
PFCa  1 
HxCa  1 
HGCa  1 
SZCa  1 
SGCa  1 

RxCVD  1 
AgeCVD  1 
WtCVD  1 
PFCVD  1 
HxCVD  1 
HGCVD  1 
SZCVD  1 
SGCVD  1 

RxOth  1 
AgeOth  1 
WtOth  1 
PFOth  1 
HxOth  1 
HGOth  1 
SZOth  1 
SGOth  1 


-0.550 

0.170 

0.005 

0.142 

0.187 

0.138 

0.253 

0.262 

-0.094 

0.179 

0.467 

0.177 

1.154 

0.203 

1.343 

0.202 

0.354 

0.174 

0.337 

0.134 

0.041 

0.150 

0.475 

0.270 

1.141 

0.187 

0.018 

0.202 

-0.222 

0.364 

-0.023 

0.186 

-0.578 

0.279 

0.770 

0.204 

0.532 

0.227 

0.541 

0.422 

0.023 

0.285 

0.357 

0.296 

0.715 

0.423 

-0.454 

0.298 

0.001 

0.577 

0.970 

1.005 

0.173 

1.206 

0.334 

1.288 

0.599 

0.910 

0.008 

1.596 

0.000 

3.170 

0.000 

3.830 

0.042 

1.429 

0.012 

1.401 

0.783 

1.042 

0.079 

1.608 

0.000 

3.131 

0.929 

1.018 

0.542 

0.801 

0.900 

0.977 

0.038 

0.561 

0.000 

2.159 

0.019 

1.702 

0.200 

1.718 

0.935 

1.023 

0.228 

1.428 

0.091 

2.045 

0.127 

0.635 

log  likelihood  =  -  1831.916 


Table  9.15  (LM^)  output 
identical  to 

Tables  9.1,  9.2,  9.3  (Method  1) 

output  combined 


Table  9.15  shows  edited  output  obtained  from 
fitting  the  above  LMaU  model. 

The  first  eight  rows  of  output  in  this  table  are 
identical  to  the  eight  rows  of  output  in  the 
previously  shown  Table  9.1  obtained  from 
Method  1,  which  fits  a  separate  model  for 
Cancer  deaths  only,  treating  CVD  and  Other 
deaths  as  censored. 

The  next  eight  rows  in  the  table  are  identical 
to  the  eight  rows  of  output  in  the  previous 
Table  9.2,  which  fits  a  separate  model  for 
CVD  deaths  only,  treating  Cancer  and  Other 
deaths  as  censored. 

The  last  eight  rows  in  the  table  are  identical 
to  the  eight  rows  of  output  in  the  previous 
Table  9.3,  which  fits  a  separate  model  for 
Other  deaths  only,  treating  Cancer  and  CVD 
deaths  as  censored. 


Thus,  the  output  in  Table  9.15  using  the  single 
LMalt  model  gives  identical  results  to  what  is 
obtained  from  fitting  3  separate  models  in 

Tables  9.1,  9.2,  and  9.3. 
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X.  Method  1  (Separate 
Models)  versus 
Method  2 
(LM  Approach) 

Why  bother  with  LM  or  LMait 
models  when  you  can  simply  fit 
3  separate  models? 

Answer:  Can  perform  statistical 
inferences  that  cannot  be  done 
when  fitting  3  separate  models 


The  reader  may  have  the  following  question  at 
this  point:  Why  bother  with  the  LM  or  LMaIl 
models  as  long  as  you  can  get  the  same  results 
from  fitting  three  separate  models  using 
Method  1  ?  The  answer  is  that  the  LM  or  LM^ 
model  formulation  allows  for  performing 
statistical  inferences  about  various  features  of 
the  competing  risk  models  that  cannot  be  con¬ 
veniently  assessed  when  fitting  three  separate 
models  using  Method  1 . 


LM  Model  for  Byar  Data 

g=  1.2,3 
hg(t,X)  =  h*g(t) 

x  exp[|31RX+  (i2 Age  +  •  •  •  +  |38SG 
+  82i  D2Rx  I-  822D2 Age  4-  •  •  •  +  S28D2SG 
+  831 D3RX  +  83^3  Age  H - b  838D3SG] 


We  illustrate  such  “extra”  inference-making 
using  the  LM  model  previously  described  for 
the  Byar  data  example.  This  model  is  shown 
again  at  the  left.  Equivalent  inferences  can  be 
made  using  the  LMa]t  model  (see  Exercises  at 
end  of  this  chapter). 


Inference  question:  Byar  data 

No-interaction  SC  LM  model 
versus 

interaction  SC  LM  model 


One  inference  question  to  consider  for  the  Byar 
data  is  whether  a  no-interaction  SC  LM  model 
is  more  appropriate  than  the  interaction  SC 
LM  model  defined  above. 


No-interaction  SC  model 
g=  1.2,  3 
hg(t)X)  =  hog(t) 

x  exp[pjRX  +  p2Age  3 - b  PgSG] 


The  no-interaction  SC  model  is  shown  here  at 
the  left. 


Assumes 

HRcA(Xi)  =  HRcvD(Xi) 

=  HRoTH(Xi) 

=  HR(Xj)  for  any  Xi  variable 


This  model  assumes  that  the  hazard  ratio  for 
the  effect  of  a  single  predictor  (say,  binary)  X4 
adjusted  for  the  other  variables  in  the  model  is 
the  same  for  each  event-type  of  interest. 


for  example,  Rx  =  0  vs  Rx  =  1 : 
HRca(Rx)  =  HRcvd(Rx) 

=  HRoth(Rx) 

=  exp[pt] 


For  example,  in  the  above  no-interaction  SC 
LM  model  the  hazard  ratio  for  the  effect  of  Rx 
is  exp[P]]  for  each  g,  where  p]  is  the  coefficient 
of  Rx. 
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H0:  all  5gj  =  0, 
g  =  2,  3;  j  =  1,  2,  . . 8 
where  5gj  is  coefficient  of  DgXj  in 
the  interaction  SC  LM  model 


Likelihood  Ratio  Test 


To  carry  out  the  comparison  of  the  interaction 
with  the  no-interaction  SC  LM  models,  the  null 
hypothesis  being  tested  is  that  the  coefficients 
of  the  16  product  terms  (5gj)  in  the  interaction 
SC  model  are  equal  to  zero. 


LR  =  —  2  log  Lr  —  (—2  log  Lp) 
approx  x  i6  under  Ho 
R  =  no  interaction  SC  (reduced) 
model 

F  =  interaction  SC  (full)  model 


This  null  hypothesis  is  conveniently  tested 
using  the  LM  model  with  a  likelihood  ratio 
test  statistic  obtained  by  subtracting  —2  log  L 
statistics  from  the  two  models  being  compared. 
The  degrees  of  freedom  being  tested  is  16,  the 
number  of  5gj  coefficients  being  set  equal  to 
zero  under  H0. 


Table  9.16.  Edited  Output  -  No- 
Interaction  SC  LM  Model-Byar  Data 


Var 

DF 

Coef 

Std. 

Err. 

p  >  M 

Haz. 

Ratio 

Rx 

1 

-0.185 

0.110 

0.092 

0.831 

Age 

1 

0.287 

0.087 

0.001 

1.332 

Wt 

1 

0.198 

0.093 

0.032 

1.219 

PF 

1 

0.402 

0.170 

0.018 

1.495 

Hx 

1 

0.437 

0.112 

0.000 

1.548 

HG 

1 

0.292 

0.120 

0.015 

1.339 

SZ 

1 

0.672 

0.159 

0.000 

1.958 

SG 

1 

0.399 

0.115 

0.001 

1.491 

log  likelihood  =  -1892.091 


Table  9.16  gives  the  output  resulting  from  the 
no-interaction  SC  LM  model  for  the  Byar  data¬ 
set.  In  this  table,  there  is  one  coefficient 
corresponding  to  each  of  the  eight  predictors 
in  the  model,  as  should  be  the  case  for  a  no¬ 
interaction  SC  model.  Nevertheless,  baseline 
hazard  functions  hjg(t)  are  allowed  to  be  differ¬ 
ent  for  different  g  even  if  the  coefficients  are 
the  same  for  different  g. 


Table  9.16:  Log  likelihoodR 
=  -1892.091 

Table  9.14:  Log  likelihoodp 
=  -1831.916 

LR  =  —2  log  Lr  —  (—2  log  LF) 

=  —2(— 1892.091) 

-  ( — 2( — 1831.916)) 

=  120.35  approx  y(6  under  Ho 
(P  <  0.001) 


From  Table  9.16,  we  find  that  the  log-likelihood 
statistic  for  the  reduced  (no-interaction  SC) 
model  is  —1892.091.  From  Table  9.14  (or  9.15), 
the  log-likelihood  statistic  for  the  full  (interac¬ 
tion  SC)  model  is  —1831.916. 

The  likelihood  ratio  test  statistic  (LR)  is  then 
calculated  to  be  120.35,  as  shown  at  the  left. 
This  statistic  has  an  approximate  chi-square  dis¬ 
tribution  with  1 6  degrees  of  freedom  under  H0. 


Reject  H0:  interaction  SC  model 
more  appropriate  than 

no-interaction  SC  model  The  P-value  is  less  than  .001,  which  indicates  a 

highly  significant  test  result,  thus  supporting 
use  of  the  full-interaction  SC  model. 
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Cancer  and  CVD  very  different 
clinically 

a- 

HRCa(Rx  =  1  vs.  0) 

HRCVd(Rx  =  1  vs.  0) 


For  the  Byar  dataset,  the  decision  to  reject  the 
no-interaction  SC  model  makes  sense  when 
considering  that  two  of  the  competing  risks 
are  Cancer  deaths  and  CVD  deaths.  Because 
Cancer  and  CVD  are  clinically  very  different 
diseases,  one  would  expect  the  effect  of  any  of 
the  predictors,  particularly  Rx,  on  time  to  failure 
to  be  different  for  these  different  disease  entities. 


DIFFERENT  STUDY  EXAMPLE 


Competing  risks:  Stage  1  vs.  Stage  2 
Breast  Cancer 

If 

HRstgi  (Rx  =  0  vs.  1) 

=  HRstg2  (Rx  =  0  vs.  1) 

4 

No-interaction  SC  Cox  reasonable 
depending  on  similarity  of  competing 
risks 


Suppose,  however,  the  competing  risks  for  a 
different  study  had  been,  say,  two  stages  of 
breast  cancer.  Then  it  is  plausible  that  the 
effect  from  comparing  two  treatment  regimens 
might  be  the  same  for  each  stage.  That  is,  a  no¬ 
interaction  SC  LM  model  may  be  (clinically) 
reasonable  depending  on  the  (clinical)  similar¬ 
ity  between  competing  risks. 


Unstratified  LM  model  (LMu): 

h* *(t,X)=h$(t) 
x  exp  [y!  CVD  +  y2OTH 
+  p,Rx  +  P2Age  +  •  •  ■  +  P8SG 
-f-  S21D2Rx  -j-  S22D2Age  +■■■-}-  62gD2SG 
+  5*1D3Rx  +  5;2D3Age  +  •  ■  •  +  5*gD3SG] 


Returning  again  to  the  Byar  data  example, 
another  variation  of  the  LM  model  is  shown 
at  the  left  and  denoted  LMy.  This  is  a  Cox  PH 
model  applied  to  the  augmented  data  of  Table 
9.12  that  is  not  stratified  on  the  competing 
risks  (i.e.,  there  is  no  subscript  g  in  the  model 
definition).  We  have  used  a  superscript  bullet 

(•)  to  distinguish  the  LMu  model  from  the  LM 
and  LMait  models. 


LMu  model:  CVD  and  OTH 

included  in  model 

LM  model:  CVD  and  OTH  not 
included  in  model 
(Both  LMu  and  LM  models  use 
augmented  dataset) 


The  LMu  model  includes  the  two  event-type 
dummy  variables  CVD  and  OTH  in  the 
model,  rather  than  stratifying  on  these  vari¬ 
ables.  As  for  the  LM  model,  the  fit  of  the  LMu 
model  is  based  on  the  augmented  dataset  given 
in  Table  9.12. 


LMu  model:  need  to  check  PH 
assumption  (Chapter  4) 

PH  assumption  not  satisfied 

If 

Use  LM  instead  of  LMu  model 


Because  the  LMu  model  is  an  unstratified  Cox 
PH  model,  we  would  want  to  use  the  methods 
of  Chapter  4  to  assess  whether  the  PH  assump¬ 
tion  is  satisfied  for  the  CVD  and  OTH  variables 
(as  well  as  the  other  variables).  If  the  PH 
assumption  is  found  wanting,  then  the  (strati¬ 
fied  Cox)  LM  model  should  be  used  instead. 
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PH  assumption  satisfied 

II 

Determine  HRs  using  exponential 
formula  (Chapter  3) 


If  the  PH  assumption  is  satisfied,  hazard  ratios 
for  the  effects  of  various  predictors  in  the  LMW 
model  can  be  determined  using  the  standard 
exponential  formula  described  in  Chapter  3  for 
the  Cox  PH  model. 


Cancer  survival  (CVD  =  OTH  =  0): 

HRCa(Rx  =  1  vs.  Rx  =  0)  =  exp[pj] 

CVD  survival  (CVD  =  1,  OTH  =  0): 
HRcvd(Rx  =  1  vs.  Rx  =  0) 

=  exp[y,+P*  +§*j] 

Other  survival  (CVD  =  0,  OTH  =  1 ): 
HRoth(Rx  =  1  vs.  Rx  =  0) 

=  exp[y2+P*  +  8'3l] 


In  particular,  to  obtain  the  hazard  ratio  for  the 
effect  of  Rx  on  Cancer  survival,  we  would  spec¬ 
ify  CVD  =  OTH  =  0  in  the  model  and  then 
exponentiate  the  coefficient  of  the  Rx  variable 
in  the  model,  as  shown  at  the  left. 

Similar  HR  expressions  (but  involving  y1  and 
y2  also)  are  obtained  for  the  effect  of  Rx  when 
CVD  deaths  and  Other  deaths  are  the  event- 
types  of  interest. 


Essential  point 

Use  of  single  LM-type  model  offers 
greater  flexibility  for  the  analysis 
than  allowed  using  Method  1 


At  this  point,  we  omit  further  description  of 
results  from  fitting  the  LMu  model  to  the 
Byar  dataset.  The  essential  point  here  is  that 
the  use  of  a  single  LM-type  model  with  aug¬ 
mented  data  allows  greater  flexibility  for  the 
analysis  than  can  be  achieved  when  using 
Method  1  to  fit  separate  hazard  models  for 
each  event-type  of  interest. 


XI.  Summary 


Competing  Risks 


Each  subject  can  experience  only 
one  of  several  different  types  of 
events  over  follow-up 


This  chapter  has  considered  survival  data  in 
which  each  subject  can  experience  only  one  of 
several  different  types  of  events  over  follow-up. 
The  different  events  are  called  competing  risks. 


Typical  approach 

•  Cox  PH  model 

•  Separate  model  for  each  event- 
type 

•  Other  (competing)  event-types 
treated  as  censored 


We  have  described  how  to  model  competing  risks 
survival  data  using  a  Cox  PH  model.  The  typical 
approach  for  analyzing  competing  risks  data  is 
to  perform  a  survival  analysis  for  each  event-type 
separately,  where  the  other  (competing)  event- 
types  are  treated  as  censored  categories. 
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Drawbacks  There  are  two  primary  drawbacks  to  the  above 

method.  One  problem  is  the  requirement  that 

1 .  Require  independent  competing  competing  risks  be  independent.  This  assump- 

risks  that  is,  censored  subjects  tion  will  not  be  satisfied  if  those  subjects  cen- 
have  same  risk  as  non-censored  sored  from  competing  risks  do  not  have  the 
subjects  in  risk  set  same  risk  for  failing  as  subjects  who  are  not 

censored  from  the  cause-specific  event  of  inter¬ 
est  at  that  same  time. 

2.  Product-limit  (e.g.,  KM)  curve  A  second  drawback  is  that  the  estimated 
has  questionable  interpretation  product-limit  survival  curve  obtained  from  fit¬ 
ting  separate  Cox  models  for  each  event-type 
has  questionable  interpretation  when  there  are 
competing  risks. 

Several  alternative  strategies  Regarding  the  independence  assumption, 
regarding  independence  assump-  several  alternative  strategies  for  addressing 
tion:  No  single  strategy  is  always  best  this  issue  are  described,  although  no  single 

strategy  is  always  best. 

Sensitivity  analysis:  worst-case  vio-  A  popular  strategy  is  a  sensitivity  analysis, 
lations  of  independence  assumption  which  allows  the  estimation  of  parameters  by 

considering  worst-case  violations  of  the  inde- 
For  example,  subjects  censored  pendence  assumption.  For  example,  subjects 
from  competing  risks  treated  in  censored  from  competing  risks  might  be  trea- 
analysis  as  if  ted  in  the  analysis  as  either  all  being  event-free 

or  all  experiencing  the  event  of  interest. 

•  All  event-free 

•  All  experience  event  of  interest 

•  Independence  assumption  not  Unfortunately,  the  independence  assumption 

easily  verifiable  is  not  easily  verifiable.  Consequently,  the  typi- 

•  Typical  analysis  assumes  cal  competing  risks  analysis  assumes  that  the 

independence  assumption  is  independence  assumption  is  satisfied  even  if 
satisfied  this  is  not  the  case. 


CIC  Alternative  to  KM  To  avoid  a  questionable  interpretation  of  the 

KM  survival  curve,  the  primary  alternative  to 

•  Derived  from  cause-specific  using  KM  is  the  Cumulative  Incidence  Curve 

hazard  function  (CIC),  which  estimates  the  marginal  probabil- 

•  Estimates  marginal  ity  of  an  event.  Marginal  probabilities  are  rele- 

probability  when  competing  vant  for  assessing  treatment  utility  whether 
risks  are  present  competing  risks  are  independent. 

•  Does  not  require  independence 
assumption 

•  Useful  to  assess  treatment 
utility  in  cost-effectiveness 
analyses 
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f 


cic(t(f))  =  ^ic(tf,) 

f=i 

f'=i 


The  formula  for  the  calculating  the  CIC  is  shown 
at  the  left.  The  hc(tf)  in  the  formula  is  the 
estimated  hazard  at  survival  time  tf  for  the 
event-type  (c)  of  interest.  The  term  S(tf_f) 
denotes  the  overall  survival  probability  of  previ¬ 
ous  time  (tf_i),  where  “overall  survival”  indicates 
a  subject  that  survives  all  competing  events. 


hc(tf)  =  estimated  hazard  at  ordered 
failure  time  tf  for  the  event-type  (c) 

S(tf_i)  =  overall  survival  probabil¬ 
ity  of  previous  time  (tf_i) 

CIC 

•  Does  not  use  product  limit 
formulation 

•  Not  included  in  mainstream 
commercially  available 
statistical  packages  (e.g.,  SAS, 
STATA,  SPSS,  R) 

PH  model  used  to  obtain  CIC 

Independence  of  competing  risks 
required 


Modeling  CIC  with  covariates  using 
PH  model:  Fine  and  Gray  (1999) 

Software  available  (Gebski,  1997) 
Fine  and  Gray  model  analogous  to 
Cox  PH  model 

Alternative  to  CIC 

CPCc  =  Pr(Tc  <  t  |T  >  t) 

where  Tc  =  time  until  event  c 
occurs 

T  =  time  until  any 

competing  risk  event 
occurs 

CPCc  =  CICc/(l  -  CICcd 

where  CICc'  =  CIC  from  risks  other 
than  c 


As  the  formula  indicates,  the  CIC  is  not 
estimated  using  a  product-limit  formulation. 
Also,  its  computation  is  not  included  in  main¬ 
stream  commercially  available  standard  statisti¬ 
cal  packages. 


If  a  proportional  hazard  model  is  used  to 
obtain  hazard  ratio  estimates  for  individual 
competing  risks  as  an  intermediate  step  in  the 
computation  of  a  CIC,  the  assumption  of  inde¬ 
pendent  competing  risks  is  still  required. 

Recent  work  of  Fine  and  Gray  ( 1 999)  provides 
methodology  for  modeling  the  CIC  with  covari¬ 
ates  using  a  proportional  hazards  assumption. 
Software  is  available  for  this  method  (Gebski, 
1997,  Tai  et  al.,  2001),  although  not  in  standard 
commercial  packages. 

An  alternative  to  the  CIC  is  the  Conditional 
Probability  Curve  (CPC).  For  risk  type  c, 
CPCc  is  the  probability  of  experiencing  an 
event  c  by  time  t,  given  that  an  individual  has 
not  experienced  any  of  the  other  competing  risks 
by  time  t. 


The  CPC  can  be  computed  from  the  CIC 
through  the  formula  CPCc  =  CICc/(l  —  CICc), 
where  CICc'  is  the  cumulative  incidence  of 
failure  from  risks  other  than  risk  c  (i.e.,  all 
other  risks  considered  together). 
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Tests  to  compare  CPCs:  Pepe-Mori  provide  a  test  to  compare  two 

CPC  curves.  Lunn  (1998)  extended  this  test  to 
Pepe  and  Mori  (1993)  -  2  curves  g-groups  and  allows  for  strata. 

Lunn  (1998)  -  g  curves 


Method  2:  LM  Approach 

•  Uses  a  single  Cox  (PH)  model 

•  Gives  identical  results  as 
obtained  from  Method  1 

•  Allows  flexibility  to  perform 
statistical  inferences  not 
available  from  Method  1 


We  have  also  described  an  alternative  approach, 
called  the  Lunn-McNeil  (LM)  approach,  for 
analyzing  competing  risks  data.  The  LM 
approach  allows  only  one  model  to  be  fit  rather 
than  separate  models  for  each  event-type 
(Method  1).  This  method  is  equivalent  to  using 
Method  1.  The  LM  model  also  allows  the  flexi¬ 
bility  to  perform  statistical  inferences  to  deter¬ 
mine  whether  a  simpler  version  of  an  initial  LM 
model  is  more  appropriate. 


Augmented  Data  for  ith  Subject  at  Time  t 
Using  LM  Approach 


Subj  Stime 

Status 

D, 

d2 

d3. 

..Dc 

Xi. 

■Xp 

i  ti 

ei 

1 

0 

0  .. 

.  0 

X„  . 

..xip 

i  tj 

e2 

0 

1 

0  .. 

.  0 

x„  . 

,.Xip 

i  ti 

e3 

0 

0 

1  .. 

.  0 

Xn  . 

,.xip 

i  ti 

ec 

0 

0 

0  .. 

.  1 

Xn  . 

,.xip 

To  carry  out  the  LM  approach,  the  data  layout 
must  be  augmented.  If  there  are  C  competing 
risks,  the  original  data  must  be  duplicated  C 
times,  one  row  for  each  failure  type. 


g=  1,2 . C 

hg(t,  X)  =  hgg(t) 

x  exp  [PjXj  +  P2X2  +  •  •  •  +  PpXp 
+  52iD2X!  +  522D2X2  +  •  •  •  +  82pD2Xp 
+  53JD3XJ  +  832D3X2  +  •  •  ■  +  83pD3Xp 
H - 

+SciDcXi  +  8q2DcX2  +  •  •  •  +  ScpDcXp] 


To  use  the  LM  approach  with  augmented  data  to 
obtain  identical  results  from  fitting  separate 
models  (Method  1),  an  interaction  version  of 
a  stratified  Cox  (SC)  PH  model  is  required. 
A  general  form  for  this  model  is  shown  at  the  left. 


LM  model:  need  standard  error  of 
(Pi  +  §gi) 

(special  syntax  required  for  com¬ 
putation) 


The  LM  model  can  be  used  to  obtain  Wald  test 
statistics  and  95%  confidence  intervals  for 
HRs  separately  for  each  competing  risk.  These 
statistics  require  obtaining  standard  errors  of 
the  sums  of  estimated  regression  coefficients 
(e.g.,  fii+S gi).  Such  computations  require 
special  syntax  available  in  standard  computer 
packages  such  as  SAS,  STATA,  SPSS,  and  R. 
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Alternative  LM  formulation:  LMait 
model 

LMait  yields  output  identical  to 
Method  1 


1st  row  of  LMait  model 

D 

Product  terms  DiXi,  DiX2, 
D|Xp  Coefficients  8'u, . . . ,  8'ip 

1st  row  of  LM  model 

a- 

Predictors  X1;  X2,  . . Xp 
Coefficients  p1(  . . Pp 


LMait  model:  Wald„  = 


8' 


gi 


SEs, 

Ogl. 


directly  obtained  from  output 


Statistical  inference  information 

LMait  model:  directly  provided  by 
output 

LM  model:  not  directly  provided  by 
output  (requires  additional  com¬ 
puter  code) 


Nevertheless,  there  is  an  alternative  formation 
of  the  LM  model  that  avoids  the  need  for  spe¬ 
cial  syntax.  This  alternative  formulation,  called 
the  LMait  model,  yields  output  that  is  identical 
to  the  output  from  the  separate  models 
(Method  1)  approach  for  analyzing  competing 
risk  data. 

The  primary  difference  in  the  two  formulas  is 
that  the  first  row  of  the  exponential  term  in  the 
LMait  model  contains  product  terms  DiXi, 
DjXz,  . . .,  DiXp]  with  coefficients  denoted 
8'n, . . . ,  8'ip  whereas  the  first  row  in  the  LM 
model  contains  the  predictors  X1(  X2,  . . .,  Xp 
without  product  terms  and  coefficients 
denoted  p1(  . . .,  pp. 


Using  the  LMait  model,  Wald  test  statistics 
(shown  at  the  left)  and  confidence  intervals 
use  standard  errors  that  are  directly  obtained 
from  the  standard  error  column  from  the  out¬ 
put  obtained  for  the  LMait  model. 

Thus,  the  LMait  model  allows  the  user  to  per¬ 
form  statistical  inference  procedures  using  the 
information  directly  provided  in  the  computer 
output,  whereas  the  LM  model  requires  addi¬ 
tional  computer  code  to  carry  out  more  com¬ 
plicated  computations. 


Advantage  of  LM  (Method  2)  over 
method  1: 

LM  offers  flexibility  for  statistical 
inferences  to  consider  simpler 
models 


An  advantage  of  using  either  the  LM  or  LMait 
approach  instead  of  fitting  separate  models 
(Method  1)  is  the  flexibility  to  perform  statisti¬ 
cal  inferences  that  consider  simpler  versions  of 
an  interaction  SC  LM  model. 
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For  example, 

No-interaction  SC  LM  model 
versus 

interaction  SC  LM  model 

Unstratified  LM  model 
versus 

SC  LM  model 


For  example,  one  inference  question  to  consider 
is  whether  a  no-interaction  SC  LM  model  is 
more  appropriate  than  an  interaction  SC 
model.  A  different  question  is  whether  an 
unstratified  LM  model  is  more  appropriate 
than  a  stratified  LM  model.  These  questions 
can  be  conveniently  addressed  using  a  single 
(i.e.,  LM)  model  instead  of  fitting  separate  mod¬ 
els  (Method  1). 


Overall, 

•  Can  use  standard  computer 
packages 

•  Independence  assumption 
required 


Overall,  in  this  chapter,  we  have  shown  that 
competing  risks  data  can  be  analyzed  using 
standard  computer  packages  provided  it  can  be 
assumed  that  competing  risks  are  independent. 


474  9.  Competing  Risks  Survival  Analysis 


Detailed 

Outline 


I.  Overview  (page  430) 

A.  Focus:  competing  risks  -  analysis  of  survival 
data  in  which  each  subject  can  experience  only 
one  of  different  types  of  events  over  follow-up. 

B.  Analysis  using  Cox  PH  model. 

C.  Drawbacks  to  typical  approach  that  uses  Cox 
model. 

D.  Alternative  approaches  for  analysis. 

II.  Examples  of  Competing  Risks  Data 

(pages  430-432) 

A.  Dying  from  either  lung  cancer  or  stroke. 

1.  Assess  whether  lung  cancer  death  rate  in 
exposed"  persons  is  different  from  lung  can¬ 
cer  death  rate  in  “unexposed,”  allowing  for 
competing  risks. 

2.  Also,  compare  lung  cancer  with  stroke  death 
rates  controlling  for  predictors. 

B.  Advanced  cancer  patients  either  dying  from 
surgery  or  getting  hospital  infection. 

1.  If  focus  on  hospital  infection  failure,  then 
death  from  surgery  reduces  burden  of  hos¬ 
pital  infection  control  required. 

C.  Soldiers  dying  in  accident  or  in  combat. 

1 .  Focus  on  combat  deaths. 

2.  If  entire  company  dies  from  accident  on  way 
to  combat,  then  KM  survival  probability  for 
combat  death  is  undefined. 

3.  Example  illustrates  that  interpretation  of 
KM  curve  may  be  questionable  when  there 
are  competing  risks. 

D.  Limb  sarcoma  patients  developing  local  recur¬ 
rence,  lung  metastasis,  or  other  metastasis. 

1.  None  of  failure  types  involves  death,  so 
recurrent  events  are  possible. 

2.  Can  avoid  problem  of  recurrent  events  if 
focus  only  on  time  to  first  failure. 

3.  Analysis  of  recurrent  events  and  competing 
risks  in  same  data  not  addressed. 

III.  Byar  Data  (pages  433-434) 

A.  Randomized  clinical  trial  comparing  treat¬ 
ments  for  prostate  cancer. 

B.  Three  competing  risks:  deaths  from  prostate 
cancer;  CVD,  or  other  causes. 
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C.  Covariates  other  than  treatment  are  Age, 
Weight  (Wt),  Performance  Status  (PF),  History 
of  CVD  (Hx),  Hemoglobin  (Hg),  Lesion  size 
(SZ),  and  Gleeson  score  (SG). 

D.  Competing  risks  considered  independent,  for 
example,  death  from  CVD  independent  of 
death  from  death  from  cancer. 

IV.  Method  1:  Separate  Models  for  Different  Event 

Types  (pages  434-437) 

A.  Use  Cox  (PH)  model  to  estimate  separate 
hazards  and  HRs  for  each  failure  type,  where 
other  competing  risks  are  treated  as  censored  in 
addition  to  usual  reasons  for  censoring:  loss  to 
follow-up,  withdrawal  from  study,  or  end  of 
study. 

B.  Cause-specific  hazard  function: 

hc(t)  =  limAt^0P(t  <  Tc  <  t  +  At|Tc  >  t)/At 
where  Tc  =  time-to-failure  from  event  c,  c  =  1 , 
2,  . . .,  C  (#  of  event  types). 

C.  Cox  PH  cause-specific  model  (event-type  c): 


hc(t,  X)  =  h0c(t)  exp 


i  — - 1 


where  c  =  1,. . .,  C,  and  (3ic  allows  effect  of  X;  to 
differ  by  event-type. 

D.  Byar  data  example:  Cancer,  CVD,  Other  Deaths 
are  C  =  3  competing  risks. 

1.  Cause-specific  (no-interaction)  model  for 

Cancer: 

hCa(t,X)  =  hoca(t)  exp[p1CaRx  +  P2CaAge 
+  P3CaWt  +  P4CaPF  +  Pscaffr 

+  p6CaHG  +  p7CaSZ  +  PscaSG] 

where  CVD  and  Other  deaths  treated  as  cen¬ 
sored  observations 

HRCa(RX  =  1  vs.  RX  =  0)  =  exp[p1Ca] 

2.  Separate  cause-specific  (no-interaction) 
models  for  CVD  and  Other. 

3.  Edited  output  presented  for  each  cause-spe¬ 
cific  model: 

a.  Cause-specific  Cancer  results  for  RX 
(with  CVD  and  Other  censored): 

fr^Ca(RX  =  1  vs.  Rx  =  0)  =  0.575  (P  =  0.001) 
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b.  Cause-specific  CVD  results  for  RX  (with 
Cancer  and  Other  censored): 

/^cvd(RX  =  1  vs.  Rx  =  0)  =  1 .429  (P  =  0.040) 

c.  Cause-specific  Other  results  for  RX 
(with  Cancer  and  CVD  censored): 

HRotu(RX  =  1  vs.  Rx  =  0)  =  0.560  (P  =  0.038) 

V.  The  Independence  Assumption  (pages  437-443) 

A.  Independent  censoring:  h(t)  for  censored  sub¬ 
jects  at  time  t  is  the  same  as  for  non-censored 
subjects  in  the  same  subgroup  at  time  t 

1.  Typical  (chapter  1)  context:  no  competing 
risks; 

2.  Informative  censoring  can  lead  to  bias 
results. 

B .  (Independent)  censoring  with  competing  risks. 

Censored  subjects  in  the  risk  set  at  time  t  with  a 
given  set  of  covariates  have  the  same  failure  rate 
as  non-censored  subjects  in  the  risk  set  at  time 
t  with  the  same  set  of  covariates  regardless  of 
whether  the  reason  for  censoring  is  a  competing 
risk,  withdrawal  from  study,  or  loss  to  follow-up. 

1 .  Non-independent  censoring:  Subjects  in  the 
risk  set  at  time  t  who  are  censored  from  a 
competing  risk  do  not  have  the  same  failure 
rate  as  non-censored  subjects  in  the  risk  set 
at  time  t. 

2.  Synonym:  Competing  risks  are  independent. 

C.  Assessing  the  independence  assumption. 

1.  No  method  available  to  directly  assess  the 
independence  assumption  nor  guarantee 
unbiased  estimates  if  independence 
assumption  is  violated. 

2.  Consequently,  the  typical  analysis  of  com¬ 
peting  risks  assumes  that  the  independence 
assumption  is  satisfied,  even  if  not. 

3.  Strategies  for  considering  independence 
assumption 

a.  Decide  that  assumption  holds  on  clini¬ 
cal/biological/other  grounds: 

b.  Include  in  your  model  variables  that  are 
common  risk  factors  for  competing 
risks. 
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c.  Use  a  frailty  model  containing  a  random 
effect  that  accounts  for  competing  risks. 

d.  Perform  a  sensitivity  analysis  by  consid¬ 
ering  worst-case  violations  of  indepen¬ 
dence  assumption. 

e.  All  of  above  strategies  rely  on  assump¬ 
tions  that  cannot  be  verified  from 
observed  data. 

4.  Example  of  sensitivity  analysis  using  Byar 

data. 

a.  Treat  all  subjects  that  die  of  competing 
risks  CVD  and  Other  as  Cancer  deaths. 

b.  Treat  all  subjects  that  die  of  competing 
risks  CVD  and  Other  as  surviving  as  long 
as  the  largest  survival  time  in  the  study. 

c.  Results  suggest  that  if  competing  risks  are 
not  independent,  then  conclusions  about 
the  effect  of  Rx  could  be  very  different. 

d.  Alternative  sensitivity  approach:  ran¬ 
domly  select  a  subset  (e.g.,  50%)  of  sub¬ 
jects  who  have  CVD  or  Other  deaths  and 
assume  everyone  in  subset  dies  of  Cancer. 

VI.  Cumulative  Incidence  Curves  (CIC) 

(pages  444-453) 

A.  Hypothetical  study:  n  =  100  subjects,  all  sub¬ 
jects  with  prostate  cancer 

Survt  (months)  #  Died  Cause 

3  99  CVD 

5  1  Cancer 

Study  goal:  cause-specific  Cancer  survival 

Censored:  CVD  deaths 

KMCa:  SCa(t  =  5)  =  0  and  Riskca(T  <  5)  =  1 

B.  How  many  of  99  deaths  from  CVD  would  have 
died  from  Cancer  if  not  dying  from  CVD? 

1.  No  answer  is  possible  because  those  with 

CVD  deaths  cannot  be  observed  further. 

2.  Sensitivity  analysis  A:  99  CVD  deaths  die  of 

Cancer  at  t  =  5. 

a.  KMCa:  SCa(t  =  5)  =  0  and  RiskCa(T  <  5)  =  1 
because  KM  assumes  independent  cen¬ 
soring;  that  is,  those  censored  at  t  =  3 
were  as  likely  to  die  from  cancer  at  t  =  5 
as  those  who  were  in  the  risk  set  at  t  =  5. 

b.  Same  KM  result  as  obtained  for  actual 
data. 
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3.  Sensitivity  analysis  B:  99  CVD  deaths 
survive  past  t  =  5. 

a.  KMCa:  SCa(t  =  5)  =  0.99  and 
Riskca  (T  <  5)  =  0.01. 

b.  Different  KM  result  than  actual  data. 

c.  Can  be  derived  directly  from  actual  data 
as  a  marginal  probability. 

4.  Main  point:  KM  survival  curve  may  not  be 
very  informative. 

C.  Cumulative  Incidence  Curve  (CIC):  alternative 
to  KM  for  competing  risks. 

1 .  Derived  from  cause-specific  hazard  function. 

2.  Estimates  marginal  probability. 

3.  Does  not  require  independence  assumption. 

4.  Has  a  meaningful  interpretation  in  terms 
of  treatment  utility. 

5.  CIC  formula: 

f  f 

cic(tf)  =  ]Tic(tfO  =  ^s^.ob c(tP) 

t=i  F=i 

6.  Calculation  of  CIC  for  another  hypothetical 
dataset. 

7.  Tests  have  been  developed  (Pepe  and  Mori, 
1992)  for  comparing  the  equality  of  CICs 
for  two  or  more  groups:  analogous  to  log 
rank  test. 

8.  When  a  PH  model  is  used  to  obtain  hazard 
ratio  estimates  in  the  computation  of  a 
CIC,  the  independence  of  competing  risks 
is  required. 

9.  Fine  and  Gray  (1999)  provide  methodology 
for  modeling  the  CIC  (also  called  subdis¬ 
tribution  function)  with  covariates  using 
a  proportional  hazards  assumption:  analo¬ 
gous  to  fitting  Cox  PH  model. 

10.  Example  of  Fine  and  Gray  output  compared 
with  Cox  PH  output  for  Byar  data. 

VII.  Conditional  Probability  Curves  (CPC) 

(pages  453-455) 

A.  CPCc  =  Pr(Tc  <  t|T  >  t)  where  Tc  =  time  until 
event  c  occurs,  T  =  time  until  any  competing 
risk  event  occurs. 

B .  Formula  in  terms  of  CIC :  CPCc  =  CICc/ ( 1  —  CICc' ) 
where  CICc'  =  CIC  from  risks  other  than  c. 

C.  Graphs  of  CPCs  can  be  derived  from  graphs  of 
CICs. 

D.  Tests  to  compare  CPCs:  Pepe  and  Mori  (1993)  - 
2  curves;  Lunn  (1998)  -  g  curves. 
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VIII.  Method  2 — The  Lunn-McNeil  (LM)  Approach 

(pages  455-461) 

A.  Allows  only  one  Cox  PH  model  to  be  fit  rather 
than  fitting  separate  modes  for  each  event  type 
(Method  1). 

B.  LM  uses  an  augmented  data  layout. 

1 .  for  ith  subject  at  time  ti(  layout  has  C  rows  of 
data,  where  C  =  #  event-types. 

2.  Dummy  variables  D1(  D2,. . .,  Dc  are  created 
to  distinguish  the  C  event- types. 

3.  The  Status  variable,  ec,  c  =  1,. . .,  C,  equals  1 
for  event-type  c  and  0  otherwise. 

4.  Predictors  are  denoted  by  X1(. . .,  Xp. 

5.  Example  of  data  layout  for  Byar  dataset. 

C.  General  form  of  LM  model  (interaction  SC 
model). 

hg(t)X)  =  h)lg(t)  exptax!  +  p2X2  +  •  •  •  +  ppXp 

g=l,2,...,C 

+  S21D2X1  +  822D2X2  +  •  •  •  +  82pD2Xp 

+  831D3X1  +  832D3X2  -| - h  83PD3XP 

+  ••• 

+  SciDcXj  +  SC2DcX2  +  •  •  •  +  8cpDcXp] 

1 .  LM  model  for  event-type  g  =  1 : 

a.  hj  (t,X)  =  hgj  (t) 

x  exp  PjXi  +  P2X2  +  •  •  •  +  ppXp 

b.  D2  =  D3  =  . . .  =  Dq  —  0 

c.  HRg=1(X!  =  1  vs.  Xj  =  0)  =  exp[pi] 

2.  LM  model  for  event-type  g  (>  1): 

a.  h*(t,X)  =  h*g(t) 

x  exp[(P[  +  8gl)X!  +  (p2  +  8g2)X2 

5 - f  (Pp  +  Sgp)xp 

b.  HRg(Xi  =  1  vs.  Xj  =  0  =  exp[(p!  +  8gl)] 

D.  LM  model  for  Byar  data. 

1.  h*(t,x)  =h;g(t) 

x  exp[P[  Rx  +  P2Age  +  •  •  •  +  P8SG 
+  82[D2Rx  +  822D2Age  +  •  •  •  +  828D2SG 
+831D3RX  +  832D3Age  +  •  •  •  +  838D3SG] 
g  =  1,2,3 

2.  D2  =  CVD  and  D3  =  OTH  are  (0,1)  dummy 
variables  for  3  event-types. 
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3.  HRCa(Rx  =  1  vs.  Rx  =  0)  =  exp[fh] 
HRCvd(Rx  =  1  vs.  Rx  =  0)  =  exp[Pi  +  82i] 
HR0th(Rx  =  1  vs.  Rx  =  0)  =  exp[p]  +  §31] 

4.  Mathematical  formulas  for  Wald  tests  and 
confidence  intervals  require  standard  errors 
for  sums  +  Sgi)  for  g  =  2  and  3;  requires 
special  syntax  for  computer. 

5.  Output  provided  and  compared  to  output 
from  Method  1 . 

IX.  Method  2a:  Alternative  Lunn-McNeil  (LMait) 
Approach  (pages  461-464) 

A.  An  alternative  LM  formulation  that  gives  iden¬ 
tical  output  to  that  obtained  from  Method  1 
(separate  models  approach). 

B.  Same  data  layout  as  for  LM  model,  but  only 
product  terms  in  LMait  model. 

C.  General  form  of  LMait  (interaction  SC)  model: 

g=  1,...,C 

h'g(t,X)  =  h'0g(t) 

x  expfS'nDjXj  +  5'12DiX2  +  •  •  •  +  §'iPDiXp 
+  S'21D2Xi  +  S'22D2X2  +  •  •  •  +  8'2pD2Xp 
H - 

+  S^iDcXi  +  8'c2DcX2  +  •  •  •  +  8'cpD2Xp] 

D.  Hazard  ratio  formula  involves  only  coefficients 
of  product  terms  for  all  g: 

HRg(X!  =  1  vs.  Xi  =  0)  =  exp[8'gl],  g  =  1,  2,  3 

a.  Statistical  inference  information  directly 
provided  by  LM^  output. 

E.  Example  of  output  for  LMa]t  model  using  Byar 
data. 

X.  Method  1  (Separate  Models)  versus 

Method  2  (LM  Approach)  (pages  465-468) 

A.  LM  and  LMait  models  allow  flexibility  to  perform 
statistical  inferences  about  features  of  competing 
risks  model  not  conveniently  available  using 
separate  models  (Method  1)  approach. 

B.  LM  and  LMait  models  can  assess  whether  a  no¬ 
interaction  SC  model  is  more  appropriate  than 
the  initial  interaction  SC  LM  model. 
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C.  Example  of  comparison  of  no-interaction  with 
interaction  SC  model  using  Byar  data. 

D.  LM  and  LMajt  models  can  assess  whether  an 
unstratified  LM  model  (called  LMu)  is  more 
appropriate  than  a  stratified  LM  model. 

E.  Example  of  LMu  model  involving  Byar  data. 

XI.  Summary  (pages  468-473) 

A.  Competing  risks  survival  data  can  be  analyzed 
using  Cox  PH  models  and  standard  computer 
packages. 

B.  There  are  two  alternative  methods  that  use  a 
Cox  PH  model  formulation. 

1.  Fit  separate  models  for  each  cause-specific 
event  type,  treating  the  remaining  event 
types  as  censored. 

2.  Use  the  Lunn-McNeil  (LM)  approach  to  fit 
a  single  model  that  incorporates  the  analysis 
for  each  cause-specific  event. 

C.  Each  of  the  above  approaches  requires  that 
competing  risks  be  independent  (i.e.,  indepen¬ 
dent  censoring). 

D.  Without  the  independence  assumption,  methods 
for  competing  risks  analysis  are  unavailable. 

E.  The  Cumulative  Incidence  Curve  (CIC)  or  the 
Conditional  Probability  Curve  (CPC)  are 

alternatives  to  the  KM  curve,  when  use  of  a 
KM  curve  has  questionable  interpretation. 


Practice  Answer  questions  1  to  15  as  true  or  false  (circle  T  or  F). 

Exercises 

T  F  l.A  competing  risk  is  an  event-type  (i.e.,  failure 
status)  that  can  occur  simultaneously  with 
another  event  of  interest  on  the  same  subject. 

T  F  2.  An  example  of  competing  risks  survival  data  is 
a  study  in  which  patients  receiving  radiother¬ 
apy  for  head  and  neck  cancer  may  either  die 
from  their  cancer  or  from  some  other  cause  of 
death. 

T  F  3.  if  all  competing  risks  in  a  given  study  are  dif¬ 
ferent  causes  of  death,  then  it  is  possible  to 
have  both  competing  risks  and  recurrent 
events  in  the  same  study. 
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T  F  4. 

T  F  5. 

T  F  6. 

T  F  7. 

T  F  8. 

T  F  9. 

T  F  10. 

T  F  11. 


T  F  12. 
T  F  13. 

T  F  14. 


Suppose  patients  with  advanced-stage  cancer 
may  die  after  surgery  before  their  hospital  stay 
is  long  enough  to  get  a  hospital  infection,  then 
such  deaths  from  surgery  reduce  the  hospital’s 
burden  of  infection  control. 

The  typical  approach  for  analyzing  competing 
risks  using  a  Cox  PH  model  involves  fitting 
separate  models  for  each  competing  risk  ignor¬ 
ing  the  other  competing  risks. 

Suppose  that  a  cause-specific  risk  of  interest  is 
development  of  lung  metastasis,  and  a  compet¬ 
ing  risk  is  local  recurrence  of  a  lung  tumor.  Then 
a  patient  who  develops  a  local  recurrence  is 
treated  as  a  failure  in  a  competing  risk  analysis. 
When  there  are  no  competing  risks,  then  any 
study  subject  in  the  risk  set  at  a  given  time  has 
the  same  risk  for  failing  as  any  other  subject  in 
the  risk  set  with  the  same  values  for  covariate 
predictors  at  time  t. 

If,  when  analyzing  competing  risks  survival 
data,  it  is  assumed  that  censoring  is  indepen¬ 
dent,  then  a  subject  in  the  risk  set  at  time  t  is  as 
likely  to  fail  from  any  competing  risk  as  to  be 
lost  to  follow-up. 

When  a  sensitivity  analysis  indicates  that  a 
worst-case  scenario  gives  meaningfully  differ¬ 
ent  results  from  an  analysis  that  assumes  inde¬ 
pendence  of  competing  risks,  then  there  is 
evidence  that  the  independence  assumption  is 
violated. 

The  typical  competing  risk  analysis  assumes 
that  competing  risks  are  independent  even  if 
this  assumption  is  not  true. 

The  Cumulative  Incidence  Curve  (CIC)  pro¬ 
vides  risk  estimates  for  the  occurrence  of  a 
cause-specific  event  in  the  presence  of  compet¬ 
ing  risks. 

CIC  =  1  —  KM,  where  KM  denotes  the  Kaplan- 
Meier  curve. 

A  CIC  for  a  cause-specific  event  that  ignores 
the  control  of  covariates  does  not  require  the 
assumption  of  independent  competing  risks. 

A  Cumulative  Probability  Curve  (CPC)  gives 
the  probability  of  experiencing  an  event  c  by 
time  t,  given  that  an  individual  has  experienced 
any  of  the  other  competing  risks  by  time  t. 
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T  F  15.  If  CICc  =  .4,  then  CPC  =  .4/. 6  =  .667. 

T  F  16.  The  Lunn-McNeil  (LM)  approach  fits  a  single 
stratified  Cox  model  using  an  augmented  data¬ 
set  to  obtain  the  same  results  as  obtained  by 
fitting  separate  Cox  models  for  each  cause- 
specific  competing  risk. 

T  F  17.  An  advantage  of  the  Lunn-McNeil  (LM) 
approach  over  the  approach  that  fits  separate 
Cox  models  is  that  the  LM  approach  allows  for 
testing  whether  a  no-interaction  SC  model 
might  be  preferable  to  an  interaction  SC 
model. 

T  F  18.  Given  the  LM  model  stratified  on  two  cause- 
specific  events,  Cancer  and  CVD: 


hg(t,  X)  =  h*g(t)  exp[P,Rx  +  p2Age 

+  §!(D  x  Rx)  +  82(D  x  Age)], 
g  =  1 ,  2  where 
D  =  0  if  Ca  and  =  1  if  CVD 

then 

HRCvd(Rx  =  1  vs.  Rx  =  0)  =  expfPi  +  8]] 
T  F  19.  Given  the  LMait  model  for  two  cause-specific 
events,  Cancer  and  CVD: 

h'g(t.X)  =  h'og  (t)  x  exp[8'nDiRx  +  S'i2DiAge 
+  8'21D2Rx  +  8'22D2Age] , 
g  =  1 , 2  where 

D]  =  1  if  Ca  or  0  if  CVD,  and 
D2  =  0  if  Ca  or  1  if  CVD, 

then 

HRcvd(Rx  =  1  vs.  Rx  =  0)  =  exp[S'2i] 

T  F  20.  The  LMu  model  that  would  result  if  the  LM 
model  of  Question  18  were  changed  to  an 
unstratified  Cox  PH  model  can  be  written  as 
follows. 

h*  (t,  X)  =  hg  (t)  exp  [  p*  Rx  +  P*  Age  +  8*  j  (D  x  Rx) 
+  8*2  (D  x  Age)] 
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Consider  a  hypothetical  study  of  the  effect  of  a  bone 
marrow  transplant  for  leukemia  on  leukemia-free  survival, 
where  transplant  failures  can  be  of  one  of  two  types: 
relapse  of  leukemia  and  nonrelapse  death  (without  prior 
relapse  of  leukemia).  Suppose  that  in  hospital  A,  100 
patients  undergo  such  a  transplant  and  that  within  the 
first  4  years  post-transplant,  60  die  without  relapse  by 
year  2  and  20  relapse  during  year  4.  Suppose  that  in  hospi¬ 
tal  B,  100  patients  undergo  such  a  transplant  but  post¬ 
transplant,  there  are  20  non-relapse  deaths  by  year  1 ,  15 
relapses  during  year  2,  40  non-relapse  deaths  between 
years  3  and  4,  and  5  relapses  during  year  4. 

2 1 .  What  are  the  competing  risks  in  this  study? 

22.  What  is  the  proportion  of  initial  patients  in  hospitals  A 
and  B,  respectively,  that  have  leukemia  relapse  by  4 
years? 

The  following  tables  provide  the  Kaplan-Meier  curves  for 
relapse  of  leukemia  for  each  study. 


Hospital  A  Hospital  B 


tj 

nj 

mj 

<Jj 

S(tj) 

tj 

nj 

mj 

9j 

S(tj) 

0 

100 

0 

60 

1 

0 

100 

0 

20 

1 

2 

40 

0 

0 

1 

1 

80 

0 

0 

1 

4 

40 

20 

20 

.5 

2 

80 

15 

0 

0.8125 

3 

65 

0 

40 

0.8125 

4 

25 

5 

20 

0.65 

23.  How  have  both  tables  treated  the  competing  risk  for 
nonrelapse  death  in  the  calculation  of  the  KM  prob¬ 
abilities? 

24.  Why  are  the  KM  probabilities  different  at  4  years  for 
each  hospital? 
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25.  Compute  the  CIC  curves  for  each  hospital  using  the 
following  tables. 

Hospital  A 


tf 

nf 

mf 

hca(tf) 

s  (tf_i) 

Ica(tf) 

ClC(tf) 

0 

100 

0 

0 

— 

— 

2 

40 

0 

0 

! 

0 

0 

4 

40 

20 

- 

- 

- 

- 

Hospital  B 

tf 

nf 

mf 

hca(tf) 

S  (tf_f) 

Ica(tf) 

ClC(tf) 

0 

100 

0 

0 

— 

— 

— 

1 

80 

0 

0 

1 

0 

0 

2 

80 

15 

- 

- 

- 

- 

3 

65 

0 

- 

- 

- 

- 

4 

25 

5 

- 

- 

- 

- 

26.  Why  are  the  CIC  probabilities  the  same  at  4  years? 

Consider  a  hypothetical  study  to  assess  the  effect  of  a  new 
hospital  infection  control  strategy  for  patients  who 
undergo  heart  transplant  surgery  in  a  given  hospital.  The 
exposure  variable  of  interest  is  a  binary  variable  Group 
(G):  G  =  0  for  those  patients  receiving  heart  transplants 
from  1992  through  1995  when  the  previous  hospital  con¬ 
trol  strategy  was  used;  G  =  1  for  those  patients  receiving 
heart  transplants  from  1996  through  1999  when  the  new 
hospital  infection  control  strategy  was  adopted.  The  pri¬ 
mary  event  of  interest  is  getting  a  hospital  infection  after 
surgery.  A  competing  risk  is  death  during  recovery  from 
surgery  without  getting  a  hospital  infection.  Control  vari¬ 
ables  being  considered  are  tissue  mismatch  score  (TMS)  at 
transplant  and  AGE  at  transplant.  The  outcome  variable  of 
interest  is  time  (DAYS  after  surgery)  until  a  patient  devel¬ 
oped  a  hospital  infection. 

27.  State  a  cause-specific  no-interaction  Cox  PH  model 
for  assessing  the  effect  of  group  status  (G)  on  time 
until  a  hospital  infection  event. 

28.  When  fitting  the  model  given  in  Question  27,  which 
patients  should  be  considered  censored? 

29.  Describe  or  provide  a  table  that  would  show  how  the 
data  on  the  ith  patient  should  be  augmented  for  input 
into  a  Lunn-McNeil  (LM)  model  for  this  analysis. 
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Test 


30.  State  a  LM  model  that  can  be  used  with  an  augmented 
dataset  that  will  provide  identical  results  to  those 
obtained  from  using  the  model  of  Question  27. 

3 1 .  For  the  LM  model  of  Question  30,  what  is  the  formula 
for  the  hazard  ratio  for  the  group  effect  G,  controlling 
for  TMS  and  AGE. 

32.  Describe  how  you  would  test  whether  a  no-interaction 
SC  LM  model  would  be  more  appropriate  than  an 
interaction  SC  LM  model. 

33.  State  a  LMait  model  that  can  be  used  with  an  aug¬ 
mented  dataset  that  will  provide  identical  results  to 
those  obtained  from  using  the  model  of  Question  27. 

34.  For  the  LMait  model  of  Question  33,  what  is  the  for¬ 
mula  for  the  hazard  ratio  for  the  group  effect  G, 
controlling  for  TMS  and  AGE? 


The  dataset  shown  below  describes  a  hypothetical  study  of 
recurrent  bladder  cancer.  The  entire  dataset  contained  53 
patients,  each  with  local  bladder  cancer  tumors  who  are 
followed  for  up  to  30  months  after  transurethral  surgical 
excision.  Three  competing  risks  being  considered  are  local 
recurrence  of  bladder  cancer  tumor  (event  =  1),  bladder 
metastasis  (event  =  2),  or  other  metastasis  (event  =  3). 
The  variable  time  denotes  survival  time  up  to  the  occur¬ 
rence  of  one  of  the  three  events  or  censorship  from  loss  to 
follow-up,  withdrawal,  or  end  of  study.  The  exposure  vari¬ 
able  of  interest  is  drug  treatment  status  (tx,  0  =  placebo, 
1  =  treatment  A),  The  covariates  listed  here  are  initial 
number  of  tumors  (num)  and  initial  size  of  tumors  (size) 
in  centimeters. 


id 

event 

time 

tx 

num 

size 

1 

1 

8 

1 

i 

1 

2 

0 

1 

0 

i 

3 

3 

0 

4 

1 

2 

1 

4 

0 

7 

0 

1 

1 

5 

0 

10 

1 

5 

1 

6 

2 

6 

0 

4 

1 

7 

0 

10 

1 

4 

1 

8 

0 

14 

0 

1 

1 

9 

0 

18 

1 

1 

1 

10 

3 

5 

0 

1 

3 

11 

0 

18 

1 

1 

3 

12 

1 

12 

0 

1 

1 

13 

2 

16 

1 

1 

1 

14 

0 

18 

0 

1 

1 

15 

0 

23 

1 

3 

3 

( Continued  on  next  page) 
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(i Continued l) 


id 

event 

time 

tx 

num 

size 

16 

3 

10 

0 

i 

3 

17 

1 

15 

1 

i 

3 

18 

0 

23 

0 

i 

3 

19 

2 

3 

1 

i 

1 

20 

3 

16 

0 

i 

1 

21 

1 

23 

1 

i 

1 

22 

1 

3 

0 

3 

1 

23 

2 

9 

1 

3 

1 

24 

2 

21 

0 

3 

1 

25 

0 

23 

1 

3 

1 

26 

3 

7 

0 

2 

3 

27 

3 

10 

1 

2 

3 

28 

1 

16 

0 

2 

3 

29 

1 

24 

1 

2 

3 

30 

1 

3 

0 

1 

1 

31 

2 

15 

1 

1 

1 

32 

2 

25 

0 

1 

1 

33 

0 

26 

1 

1 

2 

34 

1 

1 

0 

8 

1 

35 

0 

26 

1 

8 

1 

36 

1 

2 

0 

1 

4 

37 

1 

26 

1 

1 

4 

38 

1 

25 

0 

1 

2 

39 

0 

28 

1 

1 

2 

40 

0 

29 

0 

1 

4 

41 

0 

29 

1 

1 

2 

42 

0 

29 

0 

4 

1 

43 

3 

28 

1 

1 

6 

44 

1 

30 

0 

1 

6 

45 

2 

2 

1 

1 

5 

46 

1 

17 

0 

1 

5 

47 

1 

22 

1 

1 

5 

48 

0 

30 

0 

1 

5 

49 

3 

3 

1 

2 

1 

50 

2 

6 

0 

2 

1 

51 

3 

8 

1 

2 

1 

52 

3 

12 

0 

2 

1 

53 

0 

30 

1 

2 

1 

1.  Suppose  you  wish  to  use  these  data  to  determine 
the  effect  of  tx  on  survival  time  for  the  cause-specific 
event  of  a  local  recurrence  of  bladder  cancer.  State  a 
no-interaction  Cox  PH  model  for  assessing  this  rela¬ 
tionship  that  adjusts  for  the  covariates  num  and  size. 
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2.  When  fitting  the  model  given  in  Question  1,  which 
subjects  are  considered  censored? 

3.  How  would  you  modify  your  answers  to  Questions  1 
and  2  if  you  were  interested  in  the  effect  of  tx  on 
survival  time  for  the  cause-specific  event  of  finding 
metastatic  bladder  cancer? 

4.  For  the  model  considered  in  Question  1,  briefly 
describe  how  to  carry  out  a  sensitivity  analysis  to  deter¬ 
mine  how  badly  the  results  from  fitting  this  model 
might  be  biased  if  the  assumption  of  independent  com¬ 
peting  risks  is  violated. 

5.  The  following  two  tables  provide  information  neces¬ 
sary  for  calculating  CIC  curves  for  local  recurrence  of 
bladder  cancer  (event  =1)  separately  for  each  treat¬ 
ment  group.  The  CIC  formula  used  for  both  tables  is 
given  by  the  expression 

f  f 

ciCi(tf)  =  ^ii(t'f)  =  £s(tp  _  i)hl(tp) 
p=i  f=i 

where  hi(tf)  =  mlf/nf,  mlf  is  the  number  of  local  recur¬ 
rent  failures  at  time  tf  and  S(tf_i)  is  the  overall  (event- 
free)  survival  probability  for  failure  from  either  of  the 
two  competing  risks  at  time  tf_i. 


tx  =  1  (Treatment  A) 


tf 

nf 

du 

hi(b) 

S(tf_!) 

Ii(tf) 

CICfrtf) 

0 

27 

0 

0 

— 

— 

— 

2 

27 

0 

0 

1 

0 

0 

3 

26 

0 

0 

.9630 

0 

0 

4 

24 

0 

0 

.8889 

0 

0 

8 

23 

1 

.0435 

.8889 

.0387 

.0387 

9 

21 

0 

0 

.8116 

0 

.0387 

10 

20 

0 

0 

.7729 

0 

.0387 

15 

17 

1 

.0588 

.7343 

.0432 

.0819 

16 

15 

0 

0 

.6479 

0 

.0819 

18 

14 

0 

0 

.6047 

0 

.0819 

22 

12 

1 

.0833 

.6047 

.0504 

.1323 

23 

11 

1 

.0910 

.5543 

.0504 

.1827 

24 

8 

1 

.1250 

.5039 

.0630 

.2457 

26 

7 

1 

.1429 

.4409 

.0630 

.3087 

28 

4 

0 

0 

.3779 

0 

.3087 

29 

2 

0 

0 

.2835 

0 

.3087 

30 

1 

0 

0 

.2835 

0 

.3087 
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tx  =  1  (Placebo) 


tf 

nf 

du 

hi(tf) 

S(tf_f) 

Ii(tf) 

CICf(tf) 

0 

26 

0 

0 

— 

— 

— 

1 

26 

1 

.0400 

1 

.0400 

.0400 

2 

24 

1 

.0417 

.9615 

.0400 

.0800 

3 

23 

2 

.0870 

.9215 

.0801 

.1601 

5 

21 

0 

0 

.8413 

0 

.1601 

6 

20 

0 

0 

.8013 

0 

.1601 

7 

18 

0 

0 

.7212 

0 

.1601 

10 

16 

0 

0 

.6811 

0 

.1601 

12 

15 

1 

.0667 

.6385 

.0426 

.2027 

14 

13 

0 

0 

.6835 

0 

.2027 

16 

12 

1 

.0833 

.5534 

.0461 

.2488 

17 

10 

1 

.1000 

.4612 

.0461 

.2949 

18 

9 

0 

0 

.4150 

0 

.2949 

21 

8 

0 

0 

.4150 

0 

.2949 

23 

7 

0 

0 

.3632 

0 

.2949 

25 

6 

1 

.1667 

.3632 

.0605 

.3554 

29 

4 

0 

0 

.2421 

0 

.3554 

30 

2 

1 

0 

.2421 

0 

.3554 

a.  Verify  the  CICi  calculation  provided  at  failure  time 
tf  =  8  for  persons  in  the  treatment  group  (tx  =  1); 
that  is,  use  the  original  data  to  compute  hi(tf), 
S(tf_i),  Ii(tf),  and  CICi(tf),  assuming  that  the  cal¬ 
culations  made  up  to  this  failure  time  are  correct. 

b.  Verify  the  CICi  calculation  provided  at  failure  time 
tf  =  25  for  persons  in  the  placebo  group  (tx  =  0). 

c.  Interpret  the  CICf  values  obtained  for  both  the 
treatment  and  placebo  groups  at  tf  =  30. 

d.  How  can  you  calculate  the  CPCi  values  for  both 
treatment  and  placebo  groups  at  tf  =  30? 
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6.  The  following  output  was  obtained  using  separate 
models  for  each  of  the  3  event-types. 


Event  =  1 


Var 

DF 

Coef 

Std.Err. 

P  >  M 

Haz.ratio 

tx 

1 

-0.6258 

0.5445 

0.2504 

num 

1 

0.0243 

0.1900 

0.8983 

size 

1 

0.0184 

0.1668 

0.9120 

1.125 

Event  = 

2 

Var 

DF 

Coef 

Std.Err. 

P  >  M 

Haz.ratio 

tx 

1 

-0.0127 

0.6761 

0.9851 

0.987 

num 

1 

-0.1095 

0.2281 

0.6312 

0.896 

size 

1 

-0.6475 

0.3898 

0.0966 

0.523 

Event  = 

3 

Var 

DF 

Coef 

Std.Err. 

P  >  M 

Haz.ratio 

tx 

1 

-0.3796 

0.6770 

0.5750 

0.684 

num 

1 

-0.1052 

0.3135 

0.7372 

0.900 

size 

1 

-0.0238 

0.2177 

0.9128 

0.976 

a.  What  is  the  effect  of  treatment  on  survival  from 
having  a  local  recurrence  of  bladder  cancer,  and 
is  it  significant? 

b.  What  is  the  effect  of  treatment  on  survival  from 
developing  metastatic  bladder  cancer,  and  is  it  sig¬ 
nificant? 

c.  What  is  the  effect  of  treatment  on  survival  from 
other  metastatic  cancer,  and  is  it  significant? 
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7.  Below  is  the  output  from  fitting  a  LM  model  to  the 
bladder  cancer  data. 


Var  DF  Coef  Std.Err.  p  >  |z|  Haz. ratio 


txd2 

1 

0.6132 

0.8681 

0.4800 

1.846 

txd3 

1 

0.2463 

0.8688 

0.7768 

1.279 

numd2 

1 

-0.1337 

0.2968 

0.6523 

0.875 

numd2 

1 

-0.1295 

0.3666 

0.7240 

0.879 

sized2 

1 

-0.6660 

0.4239 

0.1162 

0.514 

sized3 

1 

-0.0423 

0.2742 

0.8775 

0.959 

tx 

1 

-0.6258 

0.5445 

0.2504 

0.535 

num 

1 

0.0243 

0.1900 

0.8983 

1.025 

size 

1 

0.0184 

0.1668 

0.9120 

1.125 

a.  State  the  hazard  model  formula  for  the  LM  model 
used  for  the  above  output. 

b.  Determine  the  estimated  hazard  ratios  for  the 
effect  of  each  of  the  3  cause-specific  events  based 
on  the  above  output. 

c.  Verify  that  the  estimated  hazard  ratios  computed 
in  Part  b  are  identical  to  the  hazard  ratios  com¬ 
puted  in  Question  6. 

8.  Below  is  the  output  from  fitting  a  LMait  model  to  the 
bladder  cancer  data. 


Var  DF  Coef 

Std.Err. 

p  >  M 

Haz.ratio 

txdl  1 

-0.6258 

0.5445 

0.2504 

0.535 

txd2  1 

-0.0127 

0.6761 

0.9851 

0.987 

txd3  1 

-0.3796 

0.6770 

0.5750 

0.684 

numdl  1 

0.0243 

0.1900 

0.8983 

1.025 

numd2  1 

-0.1095 

0.2281 

0.6312 

0.896 

numd3  1 

-0.1052 

0.3135 

0.7372 

0.900 

sized  1  1 

0.0184 

0.1668 

0.9120 

1.125 

sized2  1 

-0.6475 

0.3898 

0.0966 

0.523 

sized3  1 

-0.0238 

0.2177 

0.9128 

0.976 

a.  State  the  hazard  model  formula  for  the  LM^ 
model  used  for  the  above  output. 

b.  Determine  the  estimated  hazard  ratios  for  the 
effect  of  each  of  the  3  cause-specific  events  based 
on  the  above  output. 

c.  Verify  that  the  estimated  hazard  ratios  computed 
in  Part  b  are  identical  to  the  hazard  ratios  com¬ 
puted  in  Questions  6  and  7. 
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Answers  to 

Practice 

Exercises 


9.  State  the  formula  for  a  no-interaction  SC  LM  model  for 
these  data. 

10.  Describe  how  you  would  test  whether  a  no-interaction 
SC  LM  model  would  be  more  appropriate  than  an 
interaction  SC  LM  model. 


1 .  F:  Only  one  competing  risk  event  can  occur  at  a  given 
time. 

2.  T 

3.  F:  You  can  die  only  once. 

4.  T 

5.  F:  Competing  risks  must  be  treated  as  censored  obser¬ 
vations,  rather  than  ignored. 

6.  F:  A  patient  who  develops  a  local  recurrence  will  be 
treated  as  censored. 

7.  F:  The  statement  would  be  true  providing  censoring  is 
independent. 

8.  T 

9.  F:  A  sensitivity  analysis  can  never  provide  explicit 
evidence  about  whether  the  independence  assump¬ 
tion  is  satisfied;  it  can  only  suggest  how  biased  the 
results  might  be  if  the  assumption  is  not  satisfied. 

10.  T 

11.  T 

12.  F:  The  formula  is  correct  only  if  there  is  one  risk.  See 
Section  V  in  the  text  for  the  general  formula. 

13.  T 

14.  F:  The  correct  statement  should  be:  CPC  gives  the 
probability  of  experiencing  an  event  c  by  time  t, 
given  that  an  individual  has  not  experienced  any  of 
the  other  competing  risks  by  time  t. 
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15.  F:  the  correct  formula  for  CPC  is: 

CPCc  =  CICc/(l  -  CICc')  where  CICc  =  .4  and  CICc'  = 
CIC  from  risks  other  than  c,  where  the  latter  is  not 
necessarily  equal  to  .4. 

16.  T 

17.  T 

18.  T 

19.  T. 

20.  F:  The  correct  LMu  model  is 

h*(t,X)  =  hj(t)  exp [y XD  +  p*Rx  +  p*  Age  +  8*j (D  x  Rx) 
+8*2  (D  x  Age)] 

21.  The  competing  risks  are  (1)  relapse  of  leukemia  and 
(2)  nonrelapse  death. 

22.  20/100  =  0.2. 

23.  Both  tables  have  treated  the  competing  risks  as  if  they 
were  censored  observations. 

24.  The  KM  probabilities  are  different  for  the  two  hospi¬ 
tals  because  the  competing  risks  contribute  a  different 
pattern  of  censorship  in  the  two  hospitals. 

25.  The  CIC  curves  for  each  hospital  are  calculated  as 
follows. 


Hospital  A 


tf 

nf 

mf 

hca(tf) 

S(tf_j) 

Ica(tf) 

CIC(tf) 

0 

100 

0 

0 

— 

— 

— 

2 

40 

0 

0 

1 

0 

0 

4 

40 

20 

0.5 

0.4 

0.20 

0.20 

Hospital  B 

tf 

nf 

mf 

hca(tf) 

S(tf_f) 

Ica(tf) 

CIC(tf) 

0 

100 

0 

0 

— 

— 

— 

1 

80 

0 

0 

1 

0 

0 

2 

80 

15 

0.1875 

0.8 

0.15 

0.15 

3 

65 

0 

0 

0.65 

0 

0.15 

4 

25 

5 

0.20 

0.25 

0.05 

0.20 
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26.  The  CIC  probabilities  are  the  same  at  4  years  because 
they  give  marginal  probabilities  that  are  not  influ¬ 
enced  by  the  pattern  of  censorship  of  the  competing 
risks  that  are  treated  as  censored.  In  hospital  B,  for 
example,  the  marginal  probability  of  0.15  at  year  2  is 
given  by  the  proportion  of  the  initial  risk  set  of  100 
subjects  that  had  a  relapse  of  leukemia  by  year  2, 
regardless  of  the  number  of  nonrelapse  deaths  prior 
to  year  2.  Similarly  for  hospital  B,  the  marginal  prob¬ 
ability  of  .20  at  year  4  adds  to  the  marginal  probability 
at  year  2  the  proportion  of  the  initial  risk  set  of  100 
subjects  that  had  a  relapse  between  year  2  and  year  4, 
regardless  of  the  number  of  nonrelapse  deaths  that 
occurred  between  year  2  and  4. 

27.  hm(t,X)  =  h0(t)  exp[p1HIG  +  P2hiTMS  +  p3HiAGE] 
where  HI  denotes  a  hospital  infection  event 

28.  Patients  who  die  after  surgery  without  developing  a 
hospital  infection  are  censored.  Also  censored  are  any 
patients  who  are  either  lost  to  follow-up  or  withdraw 
from  the  study,  although  such  patients  are  unlikely. 

29.  Augmented  Data  for  LM  Approach 


Subj 

Stime 

Status 

Di 

D2 

G 

TMS 

AGE 

i 

ti 

en 

1 

0 

Gi 

TMS; 

AGE; 

i 

h 

^2i 

0 

0 

Gi 

TMS; 

AGEi 

where  en  =  1  if  the  ith  subject  develops  a  hospital 
infection,  0  otlierwi.se 
e2i  =  1  if  ith  subject  dies  after  surgery, 

0  otherwise 

D  i  =  indicator  for  hospital  infection  event 
D2  =  indicator  for  death  after  surgery  event 

30.  h*(t,  X)  =  h*g(t)  expjpj  G  +  p2  TMS  +  p3  AGE 

g=i,  2 

+82iD2G  +  822D2TMS  +  823D"2AGE] 


31.  HRm(RX  =  1  vs.  RX  =  0)  =  expfpj 
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32.  Carry  out  a  likelihood  ratio  test  to  compare  the  follow¬ 
ing  two  models.  Full  (SC  Interaction  LM)  model: 

h*(t,  X)  =  h*g(t)  expfp!  G  +  p2  TMS  +  p3  AGE 

g=l,2 

+82iD2G  +  822D2TMS  +  823D2AGE] 

Reduced  (no-interaction  SC  LM)  model: 
hg(t,  X)  =  h5g(t)  exp[p!  G  +  p2  TMS  +  p3  AGE] 

g=l,2 

LR  test  statistic  =  —  2  In  LR  —  (— 2  In  LF)  is  distributed 
X3  under  H0:  no-interaction  model 

33.  h'g(t,  X)  =  h'og(t)  exp [8' 11  DiG  +  8'12  DjTMS  +  S'i3  Di AGE 

g=i,  2 

-t-S'aiDaG  +  8'22D2TMS  +823D2AGE] 

34.  HRffl(RX  =  1  vs.  Rx  =  0)  =  exp[8'n] 
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Introduction 


Abbreviated 

Outline 


In  this  chapter,  we  consider  issues  involved  when  designing 
studies  with  time-to-event  outcomes.  This  includes  deter¬ 
mining  the  expected  number  of  events  to  occur  over  the 
study  period,  the  total  number  of  subjects  to  be  recruited 
into  the  study,  the  power  of  the  study,  the  time  period  over 
which  to  accrue  the  study  subjects,  the  time  period 
over  which  enrolled  subjects  will  be  followed,  and  how  to 
adjust  sample  size  requirements  to  allow  for  subjects  who 
might  be  lost  to  follow-up  and/or  who  might  switch  thera¬ 
pies  from  the  one  they  were  originally  allocated  during  the 
study  period. 

Our  primary  focus  will  consider  prospective  randomized 
trials  of  two  groups  (control  and  treatment)  with  equal 
allocation  into  each  group.  We  will  also  consider  how  to 
modify  sample  size  and/or  power  calculations  when  there 
is  unequal  allocation  into  the  two  groups. 

The  outline  below  gives  the  user  a  preview  of  the  material 
to  be  covered  by  the  presentation.  A  detailed  outline  for 
review  purposes  follows  the  presentation. 

I.  Introduction:  Background  to  Statistical  Decisions 

(pages  500-501) 

II.  Design  Considerations:  Time-to-Event  Outcomes 

(pages  502-503) 

III.  Determining  the  Required  Number  of  Events 

(Nev)  (pages  504-505) 

IV.  Determining  the  Required  Total  Number  of 
Participants  (N)  (pages  505-511) 

V.  Other  Design  Issues  ( pages  511-514) 

VI.  Summary  (pages  514-517) 
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Objectives 


Upon  completing  this  chapter,  the  learner  should  be  able  to: 

1.  Define  and  illustrate  the  following  terms:  significance 
level,  type  I  error,  type  II  error,  power,  effect  size,  accrual 
period,  follow-up  period,  loss  to  follow-up,  crossover 
subject,  drop-in,  drop-out. 

2.  Given  a,  (3,  and  A  for  a  planned  time-to-event  study  in  a 
clinical  to  compare  two  treatments,  determine  the 
number  of  events  required  for  the  study. 

3.  Given  a,  (!,  A,  and  either  the  hazard  rate,  survival 
probability,  or  median  survival  time  expected  for  the 
control  group  in  a  clinical  trial  to  compare  two 
treatments,  determine  the  study  size  required  for  an 
allocation  ratio  of  1 : 1  between  treatment  and  control 
groups. 

4.  For  the  same  situation  as  in  objective  3,  determine  the 
study  size  required  for  an  allocation  ratio  of  R:  1  for  a 
specified  R,  where  R  =  N|/N0,  Nj  is  the  study  size  in  the 
i-th  group,  and  N  is  the  total  study  size. 

5.  For  the  same  situation  as  in  objective  4,  describe  and/or 
illustrate  how  to  adjust  the  study  size  to  account  for 
possible  loss  to  follow-up. 

6.  For  the  same  situation  as  in  objective  4,  describe  and/or 
illustrate  how  to  adjust  the  study  size  to  account  for 
subjects  who  crossover  from  treatment  to  control 
group  or  from  control  to  treatment  group. 


500  10.  Design  Issues  for  Randomized  Trials 


Presentation 


I.  Introduction: 
Background  to 
Statistical  Decisions 


•  How  many  subjects? 

•  Accrual  period  for  recruitment? 

•  Follow-up  period  after  accrual? 

•  What  to  do  if  subjects  switch  therapies? 


The  preceding  chapters  have  been  focused  on 
developing  a  methodological  and  statistical 
framework  for  analysis  of  time  to  event  or  sur¬ 
vival  problems.  In  this  chapter,  we  concentrate 
on  the  issues  which  are  needed  when  designing 
studies  with  time  to  event  outcomes. 

These  issues  typically  include  questions  such 
as:  how  many  subjects  should  be  recruited  into 
the  study;  over  what  period  should  the  study 
accrue  the  subjects;  for  what  period  after  accrual 
should  the  subjects  be  followed-up  for  an  event; 
what  is  the  consequence  of  subjects  switching 
therapies  from  the  one  they  were  originally  allo¬ 
cated  during  the  study  period?  Addressing  these 
issues  is  key  to  designing  a  study  which  has 
sound  scientific/statistical  principles  and  adds 
to  the  credibility  of  the  final  results  among  the 
scientific  community. 


Review:  “power”  and 
"level  of  significance” 


Before  we  develop  these  issues,  we  need  to 
review  the  key  statistical  concepts  of  statistical 
power  and  level  of  significance. 


Most  survival  studies: 

•  Compare  two  or  more  groups 

•  H0:  survival  experience  is  the 
same  in  groups  being 
compared 

•  Ha:  survival  experience  is 
different  in  groups  being 
compared 


Most  survival  studies  are  designed  to  compare 
the  time  to  event  of  two  or  more  groups.  We 
typically  begin  with  the  assumption  that  the  sur¬ 
vival  experience  is  the  same  in  these  groups.  This 
assumption  is  called  the  null  hypothesis  (H0). 

We  then  seek  (through  the  study)  to  determine 
whether  there  is  sufficient  evidence  to  demon¬ 
strate  that  the  survival  experience  in  the  groups 
is  different.  This  is  termed  the  alternative 
hypothesis  (HA). 


P-value: 

•  Gives  evidence  against  H0 

•  P  small  =>•  statistical 
significance 


The  amount  of  evidence  found  against  the  null 
hypothesis  is  determined  by  the  P-value  (e.g. 
from  a  logrank  or  proportional  hazards  tests). 
If  the  P-value  is  small  enough,  we  conclude 
that  there  is  statistically  sufficient  evidence  to 
conclude  that  the  groups  are  different. 
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How  small? 

Traditionally: 

P  <  0.05  =>■  Evidence  vs.  Ho 
(i.e..  Reject  Ho) 

a  =  level  of  significance: 
Pr(RejectHojH0)  =  Pr(Type  I  Error)  =  a 


How  small  is  small  enough?  Traditionally  a  P- 
value  less  than  0.05  is  regarded  as  providing 
sufficient  evidence  against  the  null  hypothesis 
of  no  difference  between  the  two  groups. 

This  threshold  value,  referred  to  as  the  level  of 
significance,  is  denoted  by  a.  For  a  specified 
value  of  a,  we  are  saying  that  there  is  a  proba¬ 
bility  of  a  that  we  may  decide  to  reject  the  null 
hypothesis  when  in  fact  the  null  hypothesis  is 
correct.  In  statistical  terms,  this  type  of  error  is 
referred  to  as  a  Type  I  error. 


Pr(Do  not  reject  Ho  |Ha) 

=  Pr(Type  II  Error)  =  [> 


The  other  error  in  statistical  decision  making  is  to 
declare  that  there  is  insufficient  evidence  to  say 
the  groups  are  different  (associated  with  P-values 
>  0.05)  when  in  fact  they  are  actually  different, 
This  is  referred  to  statistically  as  a  Type  II  error. 


Power  =  Pr(RejectHo|HA) 

=  1  —  Pr(Typelerror) 


The  power  of  the  study  is  1-  probability  of  a 
j>  Type  II  error,  also  denoted  1-P,  where  p  is  the 
probability  of  a  Type  II  error. 


Sample  size  =>  Power 
(information)  (declare  H^j 


The  power  will  typically  increase  as  the  sample 
size  increases,  i.e.,  the  more  information  we 
gather,  the  more  likely  we  will  declare  the  two 
groups  being  compared  as  different  when  they 
truly  are  different. 


Strength  of  evidence,  i.e.,  effect 
size  (A): 

•  Different  ways  to  quantify  A: 

A  =  0i  —  9q  or  0i  /90 where 

1 .  0j  are  hazard  rates 

2.  0;  are  survival  probabilities 

3.  Oj  are  mean  survival  times 

•  A  large  =>■  smaller  sample  size 
required 


The  final  quantity  to  consider  is  the  strength  of 
evidence  of  interest,  typically  referred  to  as  the 
effect  size,  and  denoted  as  A.  This  measure  (A) 
can  be  either  in  the  form  of  a  difference  or  ratio 
of  hazard  rates,  survival  probabilities,  or 
median  survival  times.  If  the  effect  size  is  large 
(e.g.  a  60%  increase  in  disease  free  survival  at  5 
years),  then  fewer  subjects  would  be  required  to 
make  a  decision  that  the  groups  are  different. 


EXAMPLES:  RATIO  MEASURES  OF 
EFFECT  SIZE 


a)  Reduction  in  annual  event  rate  from 
10%  to  5%:  A  =  Vfr  =  10/5  =  2 

b)  Increase  in  3-year  survival  from  74%  to 
86%:  A  =  (Si/So)  =  0.86/0.74  =  1.16  or 
A  =  (lnSo/lnSO  =  ln(0.74)/ln(0.86)  =  2 

c)  Increase  in  median  survival  from  7 
to  14  months:  A  =  (m]/m0)  =  14/7  =  2 

Sample  size  (N)  related  to 
a,  1  -  p,  and  A 


Examples  of  three  types  of  effect  sizes  that  are 
all  ratio  measures  are  shown  at  the  left.  We 
typically  define  A  to  be  greater  than  1  in  order 
to  reflect  a  “positive”  effect.  For  example,  in  a) 
at  the  left,  a  reduction  in  event  rates  indicates  a 
positive  effect,  so  the  numerator  in  the  ratio 
formula  for  A  contains  k0  rather  than  /., . 

At  this  point,  we  have  identified  three  crucial 
inputs  required  for  sample  size  calculation:  the 
level  of  significance  (a),  the  power  (1-P)  and  the 
effect  size  (A). 
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II.  Design  Considerations: 
Time-to-Event 
Outcomes 


Primary  focus: 

•  Prospective  randomized  trials 

•  Comparing  two  groups 

•  Equal  allocation 

•  Unequal  allocation 

Assume: 

A  =  accrual  period 
F  =  follow-up  period 


A  +  F  =  total  study  period 


F 

0 


F 

0 


Observed  event 

accural  follow-up 

^  \/ - ^ 


Censored  event 


accural 


T 

entry 


A 


follow-up 


~y 


— I 
A+F 


— I 
A+F 


Information  on  each  subject: 


•  Occurrence  of  event  (yes  or  no)? 

•  Time  to  event  or  censoring 

•  Treatment  group  allocation 


Design  issues: 

•  Effect  size  of  interest  (plausible 
and  clinically  relevant) 

•  Sufficient  funds 

•  Adequate  resources 

•  Sufficient  #  of  patients 
available  to  participate 


Our  discussion  of  design  and  sample  size 
considerations  in  survival  analysis  studies  will 
primarily  focus  on  prospective  randomized 
trials  of  two  groups  (control  and  intervention) 
with  equal  allocation  into  each  group.  We  will 
also  consider  how  to  modify  sample  size  and/or 
power  calculations  when  there  is  unequal  allo¬ 
cation  into  the  two  groups. 


Consider  a  study  where  we  plan  to  accrue  sub¬ 
jects  for  a  period  of  A  (days,  months,  years)  time 
units,  and  then  observe  or  follow  them  up  for  a 
further  F  time  units.  This  defines  the  total  study 
period  to  have  A  +  F  time  units. 

The  diagrams  on  the  left  show  how  subjects 
who  experience  an  event  or  are  censored  are 
represented  in  the  total  study  time  A+F.  The 
top  diagram  illustrates  a  subject  experiencing 
the  event  T  units  after  recruitment  into  the 
study. 

The  bottom  diagram  illustrates  a  subject  not 
experiencing  the  event  (i.e.,  censored)  after 
recruitment  into  the  study. 

For  each  subject,  we  require  information  on 
whether  or  not  the  event  of  interest  had 
occurred,  the  time  from  recruitment  into  the 
study  to  either  the  time  that  the  event  occurs  or 
the  time  of  censoring,  and  the  treatment  group 
to  which  the  subject  was  allocated. 


A  key  design  issue  for  clinical  trials  already 
mentioned  is  the  effect  size  of  interest  for 
comparing  the  two  treatments.  This  difference 
should  be  both  plausible  and  clinically  worth¬ 
while.  Other  issues  include  the  need  for  suffi¬ 
cient  funds  to  support  the  study,  adequate 
resources  to  cany  out  the  study  (number  of 
institutions  participating,  etc.),  and  sufficient 
number  of  patients  with  the  disease  who 
would  be  prepared  to  participate  in  the  pro¬ 
posed  study. 
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Study  size  (N)  versus  #  of  Events  When  planning  a  clinical  trial  involving  time- 
(Nev)  to-event  outcomes,  it  is  important  to  distin¬ 

guish  the  study  size  required  from  the  number 
of  events  required. 


N  =  total  #  of  study  participants  The  study  size  refers  to  the  total  number  of 
•  includes  those  who  get  event  and  subjects  chosen  to  participate  in  the  study, 
those  who  do  not  get  the  event  including  those  who  get  the  event  and  those 

who  do  not  get  the  event. 


NEv  =  #  of  participants  who  get  The  number  of  events  refers  to  those  study 
event  during  study  period  participants  who  actually  get  the  event  during 

the  study  period. 


Process  for  determining  N  in  time- 
to-event  studies: 

Step  1 :  Determine  NEV  using  a, 

1— P,  A 

Step  2:  Determine  N  from  NEV 
by  extrapolation  using 
N  =  NEV/pEV 

where 

pEV  =  Pr(  study  participant  gets 
event) 

In  practice: 

recruit  study  participants  (N) 

rather  than 

events  (NEV)-  occur  after  study 
begins 


When  determining  sample  size  and/or  power  in 
such  studies,  we  typically  first  determine  the 
expected  (or  required)  number  of  events 
(Nev),  after  which  we  then  determine  the  study 
size  required  (N).  This  means  that  a,  P  and  A  are 
needed  to  determine  NEv,  which  then  gets  “extra¬ 
polated”  to  N  by  dividing  by  (a  guestimate  of)  the 
probability  that  a  study  participant  gets  an  event. 
(Details  on  “extrapolation”  in  Section  IV.) 

Note  that  the  recruitment  of  study  participants 
(rather  than  only  those  participants  who 
become  cases)  is  what  actually  happens  in 
practice,  since  events  do  not  start  occurring 
until  after  the  study  begins. 


Section  IV  (page  ): 

determining  N  from  NEv  using 
assumptions  on 

study  duration 

and  distribution  of  event  times 


Nevertheless,  the  study  size  required  can  be 
determined  without  waiting  for  events  to  start 
occurring,  as  we  will  show  in  Section  IV,  by 
making  certain  assumptions  about  study  dura¬ 
tion  (i.e.,  accrual  and  follow-up  periods)  and  the 
(hazard  or  survival)  distribution  of  event  times. 


Study  duration 

•  long  enough  to  observe  NEv 

•  specify  in  advance: 

accrual  and  follow-up  periods, 
and  NEv 


The  study  duration  must  be  determined  to  be 
sufficiently  long  to  observe  the  expected  num¬ 
ber  of  events.  If  the  accrual  and  follow-up  per¬ 
iods  as  well  as  expected  number  of  events  are 
determined  in  advance,  it  is  possible  to  project 
a  sufficiently  long  study  duration. 
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III.  Determining  The 
Required  Number  of 
Events  (NEV) 

Assumptions:  For  convenience,  we  describe  how  to  determine 

1.  Ni  =  N0  i.e.,  equal  study  sizes  in  the  required  number  of  events  by  making  the 

each  group  assumptions  shown  at  the  left. 

2.  Ha  is  two  tailed 

3.  Nj  and  N0  large  enough  to  use 
normal  approximation,  i.e.,  Z  ~ 

N(0,1) 

4.  Proportional  hazards  satisfied, 
i.e.,  XJXq  constant  over  time 


Assumption  4  (PH  satisfied)  allows 
ratio  measures  of  effect  size  mea¬ 
sures  of  any  type: 

A  =  Ai/20,  A  =  Si/S0,or  A  =  mi/m0 


If  we  assume  proportional  hazards,  then  the 
effect  size  can  be  calculated  based  on  any  of  the 
three  survival  attributes:  event  rates  (/.;),  survival 
probabilities  (Si)  or  median  survival  times  ( m). 


nev  = 


(zl-a/2  +  zl-fi')(A+  l)'' 


(A-l) 


For  a  given  effect  size  (A),  level  of  significance 
^  a,  and  power  (l-(3),  the  number  of  events 
required  for  both  groups  combined  is  given 
(Freedman  1982)  by  equation  (1)  on  the  left. 


Power  =  Pr(Z  <  zev) . (2) 


where  zEV 


The  power  corresponding  to  a  specified  number 
of  events  can  be  obtained  using  equation  (2). 
Note  that  Pr(Z  <  zEV)  is  the  cumulative  pro¬ 
bability  below  the  value  zEV  of  the  standard 
normal  distribution. 


EXAMPLE 


A  (=  mj/mo)  =  14/7  =  2 
a  =  0.05,  zj_a/2  =  1.96,  p  =  0.20, 
Zi_p  =  0.84 


Nev  = 


/  (1.96  +  0.84)(2  +  l)\ 

l  (2-1)  ) 


-2, 

fe  70.56  ^  M 


Power:  Pr(Z  <  zEV)  where 


1.373 


so  that 

Pr(Z  <  1.373)  =  0.92 


As  an  example,  suppose  we  are  interested  in  the 
increase  in  median  survival  time  defined  by  the 
effect  size  A  (=  irq/mo)  =  14/7  =  2.  Suppose 
further  we  set  the  level  of  significance  at  5%, 
so  Z\—<yj2  =  1-96,  and  we  desire  80%  power,  so 
zi-p  =  0.84,  where  the  z  values  are  obtained 
from  the  standard  normal  distribution. 

The  number  of  events  required  using  equation 
( 1 )  is  given  by  the  calculation  shown  at  the  left; 
thus,  71  events  are  required. 

Suppose  we  commence  the  study  and  find  that 
the  event  rate  is  occurring  much  faster  than 
anticipated,  and  we  have  observed  100  events. 
Then  the  power  of  our  study  using  equation  (2) 
is  given  by  the  calculation  shown  at  the  left. 
Thus  with  100  events,  our  study  has  92% 
power  to  detect  a  increase  in  median  survival 
time  from  7  months  to  14  months. 
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Unequal  group  sizes: 

•  Requires  modifying  formula  ( 1 ) 

•  One  treatment  is  more 
expensive  than  the  other 

•  Gain  more  experience  with  a 
new  intervention 


Formula  (1)  can  be  modified  to  allow  for 
unequal  group  sizes. 

Unequal  group  sizes  may  be  considered  when 
one  treatment  is  more  expensive  than  the  other 
or  to  gain  more  experience  with  a  new  inter¬ 
vention. 


R  =  Nj/Nq  where 
N  =  Nj  +  N0,  Nj=#  of  events  in 
group  i,  i=0,  1 


NEv  = 


(zl-a/2  +zl-/?)  (RA+1)N 


Vr(a-i) 


(3) 


We  let  R  denote  the  ratio  Nj/Nq,  where  N; 
denote  the  number  of  subjects  (with  and  with¬ 
out  events)  in  the  i-th  treatment  group,  i=0,  1 , 
so  that  N=  NrfNo  gives  the  total  sample  size 
for  the  study. 

The  modified  formula  for  the  required  number 
of  events  (NEV)  is  modified  to  equation  (3)  on 
the  left. 


EXAMPLE:  R=2  I 

Nev  =  | 

/'(1.96  +  0.84)(2(2)  +  l)N\ 

2 

=  98 

^  v^(2pl)  ) 

Here,  we  illustrate  calculations  using  formula 
(3)  for  R=2.  Note  that  the  value  obtained  for 
Nev  (i.e.,  98)  when  R  =  2  is  somewhat  higher 
than  the  value  previously  obtained  (i.e.,  71) 
when  we  used  equal  sample  sizes  (i.e.,  R=l). 


IV.  Determining  the 
Required  Total 
Number  of 
Participants  (N) 


accrual  period 


follow-up  (F) 


time  of  entry  into  study 


To  calculate  the  number  of  participants 
required  in  designing  a  clinical  study,  some 
additional  pieces  of  information  are  required 
(see  left  diagram); 

1 .  the  accrual  period,  A; 

2.  the  follow-up  period,  F,  after  the  last 
subject  has  been  entered  onto  the  study; 

3.  the  median  follow-up  time  MF  =  A/2  +  F 

4.  X  denotes  the  time  point  at  study  entry  for 
any  subject  entering  between  time  0  and 
time  A 

5 .  A  +  F  —  X  denotes  the  maximum  follow-up 
since  study  entry  for  a  subject  entering  at 
time  X 

6.  the  distribution  of  the  time(s)  to  event  T: 

specify  either 

the  survivor  functions  S;(t),  i=l ,2,  for  each 
group 

or  assume 

constant  event  rates  fy,  so  that  Sj(t)  =  e-u 
(exponential  distribution). 
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Determining  N  (cont’d) 

Relating  number  of  events  to  total 
sample  size: 

NEv  =  N  x  pEV 
where 

N=  total  sample  size 
pEV  =  Pr (event  occurs  since 
study  entry) 

Pev  =  Pr(EV|i  =  1)  Pr(i  =  1) 

+  Pr(EV|i  =  0)  Pr(i  =  0) 

=  Wj  pEvl  +  Wo  pEVo 
where 

PEVi  =  Pr(EV|i)  and  Wj  =  Pr(i),  i  =  0,1 


A  general  formula  for  the  total  sample  size 
(subjects  with  and  without  events)  is  derived 
by  assuming  that  the  number  of  events  needed 
(Nev)  is  the  product  of  the  total  sample  size  (N) 
times  the  probability  (pEv)  that  a  subject  will 
get  an  event  since  study  entry. 

Since  we  are  assuming  two  treatment  groups, 
we  can  use  basic  conditional  probability  rules 
to  express  pEV  as  the  sum  of  the  two  compo¬ 
nents  shown  at  the  left. 

The  term  pEVi  denotes  the  probability  of  an 
event  occurring  given  a  subject  is  in  treatment 
group  i,  i=0,l.  The  term  w,  denotes  the  proba¬ 
bility  of  being  in  treatment  group  i. 


If  R  =  Nj/Nq  and  Ni+N0=N 
where  N,  is  the  sample  size  in 
group,  i=0,l  then 
w;  =  Nj/N,  i  =  0,  1 


If  the  allocation  ratio  for  subjects  into  the  two 
treatment  groups  is  determined  in  advance  to 
be  R  =  N|/N0,  where  Nj  is  the  total  sample  size 
in  group  i,  then  w;,  which  is  effectively  a 
weight,  denotes  the  proportion  of  the  total 
sample  size  in  group  i  (i.e. ,  N/N). 


Algebra: 

Wl  =N1/(N1+N0)  =  R/(R+1) 
w0=N0/(N1+No)  =  1/(R+1) 


N  = 


NEv 


R 


"Pevi 


1 


R  +  l^vl  R  +  1 
Special  Case:R  =  1 
NEv 


Pevo 


N  = 


(Pevi  +  Pevo)/2 


With  algebra,  we  can  divide  both  the  numera¬ 
tor  and  denominator  of  wj  by  N0  to  express  the 
weights  (wi)  in  terms  of  R,  as  shown  on  the  left. 

We  can  thus  write  the  formula  for  total  sample 
size  (N)  in  terms  of  the  number  of  events  (NEV) 
and  the  allocation  ratio  R. 

As  a  special  case,  if  the  allocation  ratio  is  R=  1 , 
i.e.,  Ni  =N0,  then  the  denominator  in  the  for¬ 
mula  for  N  simplifies  to  the  arithmetic  average 
of  the  pEVi. 


Alternative  formula: 
Express  (pEvi)  in  terms  of 
parameters  of  Si(t) 


An  alternative  formula  for  the  total  sample  size 
can  be  obtained  if  we  can  express  the  event 
probabilities  (pEvi)  in  terms  of  survival  curve 
parameters. 


pEVi  =  1  —  Pr(no  event  since  study  entry  |  i) 
=  1  —  Pr  (surviving  since  study  entry  |  i) 
«1  — Si(MF) 

where 

S;(t)  =  survival  function  at  time 
t  for  group  i 

Mf  =  A/2  +  F  =  median  follow¬ 
up  time 


We  first  write  pEVi  as  1  minus  the  probability  of 
no  event  occurring  since  study  entry,  that  is, 
one  minus  the  probability  of  surviving  since 
study  entry. 

The  latter  survival  probability  can  be  approxi¬ 
mated  as  the  probability  that  a  subject  survi¬ 
ves  past  the  median  follow-up  time  (i.e.,  MF  = 
A/2  +  F)  for  all  subjects. 


Presentation:  IV.  Determining  the  Required  Total  Number  of  Participants  (N)  507 


Total  Sample  Size:  Formula  1  Thus,  a  general  formula  for  the  total  sample 

size  can  be  given  in  terms  of  the  survival 
model  assumed  by  the  investigators,  as  shown 
on  the  left. 

In  particular,  if  the  survival  function  is  expo¬ 
nential,  the  formula  can  be  further  specified  as 
shown  here. 

As  an  example,  suppose  the  accrual  period  is 
2  years,  and  the  follow-up  period  is  4  years,  so 
that  the  median  follow-up  time  (MF)  is  5.  Sup¬ 
pose  further  that  R=l,  a=.05,  p=.10,  the  con¬ 
trol  group  hazard  rate  (A,0)  is  0. 1 0  and  the  effect 
size  (A  =  Xq/Xi)  is  2,  i.e.,  we  consider  a  reduc¬ 
tion  in  hazard  rates  from  0. 1 0/year  to  0.05/year. 

Substituting  this  information  into  the  sample 
size  formula,  we  obtain  a  total  required  sample 
size  of  308  subjects,  which  will  yield  approxi¬ 
mately  95  events,  with  an  approximate  overall 
event  probability  of  0.31.  Note  that  the  latter 
probability  estimate  is  calculated  using  the 
denominator  in  the  sample  size  formula. 


EXAMPLE 


A=2,  F=4,  so  Mf=A/2  +  F  =  5 
R=1 

a=0.05,  p=0.10 

X<,  =  0.10,A  =  XoA.i=2 

M_  [( 1.96  +  1.282)  (2  +  l)/(2  —  l)]2 

1  |  !  _  e— 2(0  05X5)  |  +  1  |  j  _  e — (0-05){5)  j 

94  595  — 

=  =  307.8 

0.307 


Alternative  approach  for 
computing  N: 

1 .  Assumes  exponential  survival 
Sj(t)  =  exp(— 2it),  i  =  0, 1 

2.  Does  not  use  Sj(MF) 


We  now  describe  an  alternative  approach  for 
deriving  a  total  sample  size  formula  that  assumes 
an  exponential  survival  curve,  but  doesn’t  use  the 
median  follow-up  time  (MF)  to  approximate  the 
calculation  of  survival  probabilities. 


Exponential  survival  function: 
typically  yields  reasonable  sample 
size  estimates 

(note:  Cox  model  has  exponential 
component) 


N  = 


NEv 


R  1 

rTTPevi  +  rTTPevo 


Parametric  survival  distributions  other  than 
exponential  are  always  possible  for  survival 
data.  Nevertheless,  by  assuming  an  exponential 
function,  we  typically  obtain  reasonable  sample 
size  estimates.  This  makes  sense  because  of  the 
wide  applicability  of  the  Cox  model,  whose 
parametric  component  has  exponential  form. 

Recall  that  we  previously  provided  a  formula 
for  the  total  sample  size  (shown  again  here) 
that  involved  the  number  of  events  required 
(N|.;v),  the  allocation  ratio  (R),  and  the  prob¬ 
abilities  (pEVi)  of  an  event  occurring  in  each 
treatment  group. 


Next:  calculating  pEvi,  i=0,l. 


We  now  describe  how  to  calculate  event  prob¬ 
abilities  (pEvi)  in  terms  of  exponential  survival 
parameters. 
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Pr/EV  and  entry  at  X) 

=  Pij(EV  |  entry  at  X)Pr(entry  at  X) 

|  Pr(A  and  B)  =  Pr(A|Bj  Pr(B)  ; 


Assume  uniform  distribution  for 
each  i: 


We  first  write  the  probability  that  a  subject 
entering  treatment  group  i  at  time  X  gets  the 
event  as  the  product  of  probabilities.  This  fol¬ 
lows  from  the  basic  probability  rule  that  the 
joint  probability  of  two  events  A  and  B  equals 
the  conditional  probability  of  A  given  B  times 
the  probability  of  B. 


X  ~  U[0,  A]  =>  Pr( entry  at  X)  =  1/A 
so  that 

Pi'i(EV  and  entry  at  X)  =  ip+EV]  entry  at  X) 

-^-Pri(EV|  entry  at  X) 

=  -^[1  —  Pri(No  EV|entry  at  X)] 

=  [1  —  Prj(Survive  past  A  +  F  —  X|entry  at  X)] 


We  can  simplify  the  above  formula  if  we 
assume  that  the  time  X  at  which  any  subject 
enters  the  study  has  the  uniform  distribution 
over  the  accrual  period. 

The  probability  of  an  event  is  equal  to  1  minus 
the  probability  of  no  event,  which  is  1  minus  the 
probability  of  surviving  since  entry.  Thus,  we 
can  rewrite  the  right  hand  side  of  above  formula 
as  shown  here.  Note  that  A+F— X  gives  the  sur¬ 
vival  time  for  a  subject  entering  at  time  X  and 
surviving  to  the  end  of  the  study  (at  time  A+F). 


Integration  formula  for  pevp 


PEVi  — 


[1  —  S(A  +  F  —  X)]  dx. 


Recognizing  that  a  subject’s  entry  time  X 
ranges  between  0  and  A,  we  can  now  integrate 
over  this  range  to  obtain  pKvi,  the  probability 
that  any  subject  entering  treatment  group  i  at 
time  X  gets  the  event  since  study  entry.  This 
integral  formula  is  shown  at  the  left. 


Let  u  =  A+F— X,  so 


X  =  A  =>  u  =  F  and  X  =  0=>u  =  A  +  F 

Then, 


PEVi  —  1  ~  ^ 


A+F 

S(u)du 

F 


Letting  u  denote  A+F— X,  and  noting  that  u  will 
range  from  F  to  A+F  as  X  ranges  between  A 
and  0,  we  can  further  rewrite  this  integral  as 
shown  here. 


Further  simplification: 
Assume  S(t)  =  exp(— At) 


PEVi  —  l  - 

=  1  - 
=  1  - 


1 

A 


A+F 


+u)du 


F 

^[-exp(-liu)]£+F 

;.'A  fexp(-AiF)  -  exp(— Ai(A  +  F))] 


We  can  further  simplify  this  formula  if  we  are 
willing  to  assume  that  the  survival  curve  fol¬ 
lows  an  exponential  distribution. 

We  can  then  carry  out  the  integration  and 
corresponding  algebra  to  obtain  an  expression 
for  pEVi  that  involves  the  constant  hazard  rate  A,- 
for  treatment  group  i,  the  accrual  period  A,  and 
the  follow-up  period  F  as  shown  on  the  left. 
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Total  Sample  Size:  Formula  2 


N  = 


NEv 


R 


1 


R+  1 


Pevi 


R  +  1 


Pevo 


1 


e-AiF  _  e— ^(A+F) 


where 

PEVi  =  1~M  r 

/.]  =  constant  hazard  for  group  i 
R  =  allocation  ratio, 

A  =  accrual  period, 

F  =  follow-up  period 


We  then  obtain  the  general  formula  shown  at 
the  left  for  the  required  number  of  total  parti¬ 
cipants  (N)  in  terms  of  the  total  expected  events 
(N|.;v),  exponential  hazard  rates  (/,j)  in  each 
group,  effect  size  (A  =  (Aj/^q),  allocation  ratio 
R,  accrual  period  (A),  and  follow-up  period  (F). 


EXAMPLE 


A=2,  F— 4 
R=1 

a=0.05,  |3=0.10 

=  0.10,  A  =  Xo/Xx  =  2 
Nev  =  94.595 


„  ,  -  1 _ 1 _  L-(0.05)(4)  _  -(0.05)(2+4)l 

Pev1  “  1  (0.05)  (2)  F  6  J 

=  1  -  0.7791  =  0.2207 


„  -  I _ 1 _  L-(0.10)(4)  _  -(0.101(244)1 

P^0-1  (0.10)  (2)  F  e  J 

=  1  -  0.6075  =  0.3925 
94.595 


N  = 


|(0.2207)  +1(0.3925) 
94.595 


0.3066 


-  =  308.52  «  309 


We  again  consider  the  same  example  we  previ¬ 
ously  used  with  Formula  1 ,  but  now  applied  to 
Formula  2.  Recall  that  here  we  consider  a 
reduction  in  hazard  rates  from  0.10/year  to 
0.05/year,  so  that  the  effect  size  (A)  is  2.  The 
allocation  ratio  is  assumed  to  be  1:1,  i.e.,  R  =  1. 

We  previously  found  for  this  example  that  the 
number  of  events  required  (NEV)  was  94.595. 

Substituting  the  above  information  into  for¬ 
mula  for  Pevu  find  that  pEVi  =  0.2207  and 
Pevo  =  0.3925. 


We  then  substitute  these  values  for  pEVi  into 
Formula  2,  as  shown  at  the  left  to  obtain  a  total 
sample  size  (N)  of  309,  which  is  essentially  the 
same  as  the  308  obtained  from  Formula  1. 


SECOND  EXAMPLE 


Patients  with  metastatic  colorectal 
cancer 

Intervention  group:  molecular 
targeted  therapy 
vs. 

Control  group  standard  therapy 


As  a  second  example,  suppose  we  wish  to 
determine  the  sample  size  for  a  clinical  trial 
looking  at  a  molecular  targeted  therapy  (inter¬ 
vention  group)  for  patients  with  metastatic 
colorectal  cancer. 
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The  study  aim  is  to  increase  the  median  time  to 
disease  progression  survival  from  6  months  in 
the  control  group  to  10  months  in  the  interven¬ 
tion  group.  Assume  a  15-month  accrual  period 
and  a  12-month  follow-up  period,  90%  power 
and  a  5%  significance  level.  Also  subjects  are  to 
be  allocated  in  a  1:2  ratio  (2  subjects  in  the 
intervention  for  each  control),  i.e.,  R=2. 

Note  that  in  this  example,  we  are  given  the 
desired  increase  in  median  survival  times  (mi) 
from  control  to  intervention  group,  but  not  the 
corresponding  set  of  hazard  rates  (Xj)  for  these 
two  groups,  which  we  need  to  use  in  our  sam¬ 
ple  size  formula. 

Nevertheless,  we  can  determine  the  values  for 
>w  from  their  corresponding  median  survival 
times  mj  by  setting  the  exponential  survival 
probability  S^mO  =  exp(— ^m;)  equal  to  0.5 
for  group  i,  and  then  solving  for  Xb  as  shown 
at  the  left.  Note  that  the  value  of  any  survival 
function  at  its  median  survival  time  is,  by  defi¬ 
nition,  always  equal  to  0.5. 

Thus,  substituting  the  values  of  m0=6  and 
mi=10  in  the  above  formula  for  Xb  we  obtain 
the  values  A,0  =  0.1 16  and  Xj  =  0.069,  as  shown 
at  the  left. 

Consequently,  the  effect  size  (A  =  ^o/?u)  needed 
for  our  sample  size  formula  is  0.1155/0.0693, 
which  equals  1.667. 

To  determine  the  sample  size  required  for  this 
study,  we  first  compute  the  required  number  of 
events  (NEV),  as  shown  on  the  left. 

Thus,  we  require  222  total  events. 

Next:  Computing  Nj  and  N0  using  We  now  use  our  Formula  2  (shown  again  here) 
Formula  2  to  compute  the  sample  sizes  required  in  each 

Nev  treatment  group. 

N  =  ^  I 

rTT  Pevi  +  R+l  Pevo 

where 

p™  =  1-3Me~1F''~1,la+FI- 
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SECOND  EXAMPLE:  (continued) 


Pevi 

=  1  ”  (0.0693)  (15)  l 
-  1  -  0.2707  =  10.72931 


~(0.0693)(12)  _  „  — (0.0693)  (15+12) 


Pevo 

=  1  - 


1 


(0. 1 155) (15)  I 
=  1  -0.1188  -  0.8812 
221.6 


-(0.1155)(12) 


-(0.1 155)(15+12) 


N  = 


^-(0.7293)  +  2^  (0-8812) 


=  284.12; 


Ni  =  [R7(R+1)]N  and  N0  =  Nj/2 

R=2  and  N  =  285  =>Ni  =  190  and 

N0  =  95 


The  computation  of  pKvi,  i=0, 1  is  shown  on  the 
left. 

We  obtain  pEVi  =  0.7293 
and 


Pevo  —  0.8812. 


We  now  substitute  NEV=221.6,  pEVi  =  0.7293, 
Pevo  =0.8812,  and  R=2  into  the  formula  for  N 
as  shown  at  the  left. 

We  obtain  a  total  required  sample  size  (N) 
of  285. 

Now,  given  N  and  R,  we  can  solve  for  Nj  and  N0 
using  the  formula  shown  at  the  left. 


Since  R  =2,  so  that  N!=2N0,  it  follows  that 
N |  =  1 90  and  N0=  95  subjects  are  required  in 
groups  1  and  0  respectively. 


V.  Other  Design  Issues  Other  issues  to  consider  in  the  design  of  time- 

to-event  studies  are  shown  at  the  left. 

•  Choosing  accrual  and  follow¬ 
up  times; 

•  Adjusting  for  loss  to  follow-up; 

•  Adjusting  for  cross-over  (drop- 
in/drop-out) 


Accrual  and  follow-up  times: 

A  =  accrual  period 
F  =  follow-up  period 

•  Need  balance  between  the 
choices  of  accrual  and  follow¬ 
up  periods 

•  Cost  of  Accrual  >> 

Cost  of  follow-up 


We  have  seen  the  accrual  period  (A)  and  length 
of  follow-up  period  (F)  contribute  in  the  expres¬ 
sions  for  the  probability  of  an  event  as  well  as  to 
total  sample  size.  In  any  design,  there  needs  to 
be  a  balance  between  the  choices  of  accrual  and 
follow-up  periods.  The  cost  of  accruing  subjects 
into  a  study  is,  in  general,  far  greater  than  the 
cost  of  follow-up. 
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Impact  of  accrual  and  follow-up 
times  on  the  study  size 

Colorectal  example:  Nev=  161, 
a=0.05, 

1— (3=0.90 


Study  time 

N 

Accrual 

follow-up 

12 

12 

206 

12 

15 

194 

12 

18 

186 

15 

12 

200 

15 

18 

184 

18 

12 

196 

18 

15 

188 

18 

18 

182 

Loss  to  follow-up 

Subjects  leave  study  before  experi¬ 
encing  event 
Reasons: 

•  personal  circumstances 

•  experiencing  unacceptable 
toxicity 

•  loss  of  interest 

•  receiving  newer  therapy 

Follow-up  losses  Nev 

=>  Power 

Adjusting  for  loss  to  follow-up: 
Need 

piof  =  proportion  of  subjects 
expected  to  be  lost  to 
follow-up 

N  =  study  size  with  full  follow-up 
Then 

NLOFadj  =  N/(l  -  Pl0f) 


EXAMPLE 


N  =  270,  plot  =  0.25 
11 

NLOFadj  =  270/(1  -  0.25)  =  360 


The  table  on  the  left  illustrates,  using  the  colorectal 
cancer  example,  how  the  study  size  required 
(N)  varies  depending  on  the  duration  periods 
of  accrual  and  follow-up.  This  table  indicates 
that  a  longer  follow-up  time  has  a  larger 
impact  on  sample  size  than  a  longer  accrual 
time. 

For  example,  for  an  accrual  period  of  1 2  months, 
the  sample  size  needed  is  reduced  from  206  sub¬ 
jects  to  186  subjects  as  the  follow-up  period  is 
increased  from  12  to  18  months.  However,  for  a 
follow-up  period  of  12  months,  the  sample  size 
needed  is  reduced  from  206  subjects  to  only  196 
subjects  as  the  accrual  period  is  increased  from 
12  to  18  months. 


In  large  scale  clinical  trials,  it  is  usually  difficult 
to  retain  complete  follow-up  on  all  subjects.  Inev¬ 
itably,  a  proportion  of  subjects  will  leave  the 
study  before  experiencing  an  event.  Some  rea¬ 
sons  include  personal  circumstances  (moving  to 
another  district/city/state/country),  experiencing 
unacceptable  toxicity  from  the  therapy,  loss  of 
interest,  or  receiving  a  newer  therapy. 

As  more  subjects  leave  the  study,  the  number  of 
events  will  be  reduced  and  the  power  of  the 
study  will  be  compromised.  This  can  be  partially 
adjusted  for  by  increasing  the  sample  size  in  an 
attempt  to  maintain  the  number  of  events. 

We  can  adjust  for  loss  to  follow-up  in  sample 
size  calculations  if  we  have  an  estimate  from 
previous  studies  of  the  proportion  of  subjects 
expected  to  be  lost  to  follow-up  (pi0f).  We 
also  need  the  study  size  required  with  full 
follow-up  (N). 

Then  the  adjusted  sample  size  (NLOFadj)  that 
accounts  for  the  loss  to  follow-up  is  obtained 
by  dividing  N  by  (1  -  piof). 

For  example,  in  a  study  of  270  subjects,  it  is 
observed  during  the  study  that  25%  of  subjects 
are  lost  to  follow-up.  To  ensure  that  the  study 
power  is  maintained,  the  sample  size  would 
need  to  be  increased  to  360  subjects. 
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Cross-over  (drop-in/drop-out) 
adjustments 

One  approach:  Intention-to-treat 
(ITT)  principle- 

subjects  analyzed  according  to  the 
originally  assigned  treatment  group 


In  randomized  controlled  trials,  the  analysis 
is  typically  based  on  the  Intention-to-Treat 
(ITT)  principle.  This  principle  states  that  sub¬ 
jects  will  be  analyzed  to  the  treatment  group 
to  which  they  were  originally  assigned  at  the 
time  of  randomization. 


No  ITT  =>  randomization  =>  confounding 


Not  adhering  to  the  ITT  principle  would  break 
the  randomization  to  some  extent,  which  could 
potentially  lead  to  a  confounded  adjusted 
estimate. 


Another  analysis  approach:  Adjust 
for  crossovers 

Crossover:  a  subject  who  changes 
from  T  to  C  or  from  C  to  T  during 
the  trial 
where 

C  =  control  group, 

T  =  treatment  group 

T  to  C  =>■  drop-out 
C  to  T  =>•  drop-in 

N  =  original  total  sample  size  (w/o 
adjusting  for  cross-overs) 
dc  =  proportion  of  (drop-outs)  from 
T  to  C 

dt  =  proportion  of  (drop-ins)  from 
C  to  T 


Another  approach  to  the  analysis  is  to  adjust 
for  cross-over  from  one  treatment  group  to 
another. 

If  a  subject  changes  groups  during  the  trial, 
from  either  treatment  (T)  to  control  (C)  or 
from  control  to  treatment,  we  say  that  this 
subject  is  a  “cross-over”.  A  change  from  T  to  C 
is  called  a  “drop-out”  and  a  change  from  C  to 
is  a  drop-m. 

Let  N  denote  the  originally  calculated  sample 
size  prior  to  considering  the  possibility  of  cross¬ 
overs.  Also,  let  dc  and  dt  be  the  potential  pro¬ 
portion  of  subjects  who  drop  into  the  control 
and  intervention  groups;  these  are  guestimates. 


Sample-size  formula  adjusted 
for  ITT: 


NlTTadj 


N 

(1-dr  -d,)2 


Then  it  can  be  shown  that  for  an  intention- 
to-treat  analysis,  to  maintain  the  study  power, 
the  total  study  size  should  be  adjusted  to  give  a 

sample  size  inflation  factor.  This  adjustment 
formula  (NITTadj)  is  shown  at  the  left. 


Sample  Size  Inflation  factor 


Drop-out  rate 

Drop-in  rate 

0%  1%  5% 

10% 

15% 

20% 

0% 

1 

1.02 

1.11 

1.23 

1.38 

1.56 

1% 

1.02 

1.04 

1.13 

1.26 

1.42 

1.60 

5% 

1.11 

1.13 

1.23 

1.38 

1.56 

1.78 

10% 

1.23 

1.26 

1.38 

1.56 

1.78 

2.04 

15% 

1.38 

1.42 

1.56 

1.78 

2.04 

2.37 

20% 

1.56 

1.60 

1.78 

2.04 

2.37 

2.78 

The  table  at  the  left  gives  the  sample  size  infla¬ 
tion  required  for  different  levels  of  drop- 
ins/ 'drop-outs  originally  assigned  at  the  time  of 
randomization.  Thus,  subjects  which  receive 
the  opposite  treatment  to  which  they  were  ran¬ 
domized  will  (in  general)  dilute  the  effect  seen 
by  the  intervention  due  to  this  cross-over. 
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EXAMPLE 


Exercise  i - ~ Disease  progression  in 

regimen  ' - - - 1/  colorectal  cancer  patients 

Patients  who  stop  exercise:  drop-outs 
Patients  who  start  exercise:  drop-ins 

N  =  600,  dc  =  0.05,  d,  =  0.15 

T 

NiTTadj  =  600/(1  -  0.05  -  0.15)2 
=  600  x  1.56  =  936 


As  an  example,  we  consider  a  study  investigating 
the  impact  of  exercise  on  time  to  disease  pro¬ 
gression  in  colorectal  cancer  patients,  rando¬ 
mized  to  an  exercise  regimen. 

Those  patients  who  stop  exercising  are  dropping 
out  of  the  exercise  arm  (into  the  control  arm). 
Those  who  commence  an  exercise  program  are 
dropping  into  the  exercise  arm  (out  of  the 
control  arm). 

Suppose  the  study  size  originally  required  600 
subjects,  with  5%  expected  to  drop  out  of  the 
intervention  arm,  and  15%  expected  to  drop 
into  the  intervention  arm.  Then,  using  ITT,  the 
sample  size  (see  the  table  on  the  left)  needs  to 
be  increased  to  936  subjects. 


VI.  Summary 

Issues: 


A  summary  of  this  presentation  is  now  provided. 
We  have  addressed  several  issues,  which  are 
listed  on  the  left. 


•  How  many  events  to  expect? 

•  How  many  subjects? 

•  Impact  of  length  of  accrual  and 
follow-up  periods? 

•  Adjusting  for  loss  of  follow-up? 

•  What  to  do  if  subjects  switch 
therapies? 


Key  statistical  concepts: 

H0,  Ha,  P-value, 

Type  I  error  (a),  Type  II  error  ( (S ) , 
Power  =  1  — (3,  effect  size  (A). 


We  began  by  reviewing  key  statistical  concepts, 
including  describing  the  terms  null  hypothesis 
(H0),  alternative  hypothesis  (HA),  P-value,  Type  I 
error  (a),  Type  II  error  ((3),  Power  =1  —  0,  and 
effect  size  (A). 


A  can  be  0j  —  0O  or  0i/0o  where  0j 
are  hazard  rates  (X,;),  survival 
probabilities  (SO,  or  mean 
survival  times  (mO 


accrual  period  follow-up  (F) 


time  of  entry  into  study 


The  effect  size  (A)  can  be  either  in  the  form  of 
difference  or  ratio  of  hazard  rates,  survival 
probabilities,  or  median  survival  times. 

We  consider  randomized  trials  in  which  subjects 
are  accrued  for  a  period  of  A  (days,  months, 
years)  time  units,  and  then  followed  for  a  further 
period  of  F  time  units.  For  each  subject,  we 
obtain  information  on  whether  or  not  the  event 
of  interest  had  occurred,  the  time  from  study 
entry  (X)  to  the  occurrence  of  the  event  or  cen¬ 
soring,  and  the  treatment  group  allocation. 
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Process  for  Determining  N 

Step  1 :  Determine  NEV  using  a, 
1-p,  A 

Step  2:  Determine  N=  Nev/pev 
where 

pEV  =  Pr(study  participant  gets 
event) 


When  determining  the  sample  size  and/or 
power,  we  typically  first  determine  the  expected 
(i.e.,  required)  number  of  events  (NEV),  after 
which  we  then  determine  the  study  size 
required  (N). 

The  parameters  a,  p,  and  A  are  needed  to  deter¬ 
mine  Nev,  which  then  gets  “extrapolated”  to  N 
by  dividing  by  (a  guestimate  of)  the  probability 
that  a  study  participant  gets  an  event  (pEV). 


Formula  for  NEV: 

(zl-a/2  +  zl-/s)(A+  1)'' 


NEv 


(A-l) 


•(1) 


Assumes:  Nj  =  N0,  both  large, 
PH  assumption 


Power  =  Pr(Z  <  zEv) 


'(2) 


where 


zev  = 


Given  a,  p,  and  A,  the  required  number  of  events 
(Nev)  for  a  trial  involving  equal  study  sizes  in 
two  groups  is  given  by  formula  (1)  shown  at  the 
left.  This  formula  assumes  that  the  sample  sizes 
(N,  and  N0)  in  each  group  are  large  enough  to 
use  a  normal  approximation.  The  formula  also 
assumes  that  the  PH  assumption  is  satisfied, 
i.e.,  A  is  constant  over  time. 

The  power  corresponding  to  a  specified  num¬ 
ber  of  events  (NEV)  can  be  obtained  using  for¬ 
mula  (2),  where  P(Z  <  zEV)  is  the  cumulative 
probability  below  the  value  zEV  of  the  standard 
normal  distribution. 


Unequal  allocations:  R=N!/N0 
where  N^Nq 


nev  = 


(zl— a/2  +  zl-/?)  (RA  +  l)'' 


Vr(a-  i) 


•(3) 


The  formula  for  NEV  can  be  modified  for 
unequal  sample  sizes  as  shown  at  the  left. 


Formulae  for  N:  Two  versions 

Both  versions  assume 
R=Ni/N0  where  N=N!+N0 
and  extrapolate  from  NEV  to  N 
using  N  =  NEV/pEv 
where  pEV  =  Pr( event  occurs) 


There  are  two  approaches  to  determining  the 
total  sample  size  N  and  the  corresponding 
sample  sizes  Nj  and  N0  in  groups  1  and  0, 
respectively,  for  R— N|/N0  where  N)  +N0  =N. 
Both  formulae  are  determined  by  extrapolating 
from  NEV  to  N  by  dividing  by  (a  guestimate  of) 
the  probability  that  a  study  participant  gets  an 
event  (pEV). 
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Formula  using  MF  =  A/2  +  F  (i.e., 
median  FU  time) 


The  formula  shown  at  the  left  computes  pEV 
making  use  of  the  median  follow-up  time  (MF) 
for  all  study  subjects,  where  MF  =  A/2  +  F,  with 
A  denoting  the  accrual  period  and  F  denoting 
the  follow-up  period  for  the  study. 


Formula  assuming  exponential  The  second  formula  shown  here  computes  pEv 
survival  Sj(t)  =  exp(-/Ht),  i=0,l,  making  use  of  guestimates  of  the  two  para- 
and  guestimates  Ai(  S;  or  m;.  meters  being  compared  to  define  the  effect  size 

of  interest.  These  two  parameters  can  be  hazards 
0-\,  i=0,l),  survival  probabilities  (S;,  i  =  0,1)  or 
median  survival  times  (m:,  i=0, 1)  for  each  group. 

Even  if  the  parameters  being  guestimated 
are  Si7  or  m ; ,  rather  than  /H,  one  can  transform 
the  guestimate  into  a  corresponding  value  for 
}.j  if  an  exponential  survival  distribution  is 
assumed.  In  particular,  if  the  guestimates  are 
median  survival  times  (mj),  it  follows  that 
>4  =ln  2/mj. 

Nj  =  [R/(R+1)]N  and  N0  =  Nh/2  To  obtain  Nj  and  N0,  from  N  and  R,  the  for¬ 
mulae  shown  at  the  left  can  be  used. 


Other  issues:  Three  other  issues  were  also  considered  as 

shown  at  the  left. 

•  Choosing  accrual  and  follow¬ 
up  times; 

•  Adjusting  for  loss  to  follow-up; 

•  Adjusting  for  cross-over  (drop- 
in/drop-out) 

Accrual  versus  follow-up  periods:  There  needs  to  be  a  balance  between  the  lengths 
A  =  accrual  period,  F  =  follow-up  of  accrual  and  follow-up  periods.  The  cost 
period  of  accruing  subjects  into  a  study  is,  in  general, 

far  greater  than  the  cost  of  follow-up.  In  partic- 

•  Need  balance  between  lengths  ular,  a  longer  follow-up  time  has  a  larger  impact 
of  accrual  and  follow-up  periods  on  sample  size  than  a  longer  accrual  time 

•  Cost  of  Accrual  >>  Cost  of  (illustrated  by  example), 

follow-up 

Adjusting  for  loss  to  follow-up:  We  can  adjust  for  possible  loss  to  follow-up  in 
Need  sample  size  calculations  if  we  have  an  estimate 

plof  =  proportion  of  subjects  from  previous  studies  of  the  proportion  of  sub¬ 
expected  to  be  lost  to  follow-up  jects  expected  to  be  lost  to  follow-up  (pi0f)- 

N  =  study  size  with  full  follow-up 

- - -  The  adjusted  sample  size  (NLOFadj)  that  accounts 

NLOFadj  —  N/(l  ~  Plof)  for  ]oss  |^()  follow-up  is  obtained  by  dividing  N 

by  (1-pioi). 
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Cross-over  (drop-in  or  drop-out)  The  typical  approach  to  analysis  when  study 
problems:  subjects  drop-in  or  drop-out  of  study  groups 

originally  assigned  is  the  Intention-to-Treat 
Typical  approach  to  analysis  uses  principle  (ITT):  analyze  subjects  according  to 
the  ITT  principle:  their  original  treatment  group  allocation, 

subjects  analyzed  according  to  the 
originally  assigned  treatment  group 

Another  analysis  approach: 

Adjust  for  crossovers 

N  =  original  total  sample  size 

(w/o  adjusting  for  cross-overs) 
dc  =  proportion  of  (drop-outs) 
from  T  to  C 

dt  =  proportion  of  (drop-ins)  from 
C  to  T 

Sample-size  formula  adjusted  Then  it  can  be  shown  that  for  an  intention- 
for  ITT:  to-treat  analysis,  to  maintain  the  study  power, 

the  total  study  size  should  be  adjusted  to  give  a 
sample  size  inflation  factor.  This  adjustment 
formula  (NITTadj)  is  shown  at  the  left. 


Another  approach  to  this  problem  involves 
adjusting  the  sample  size  for  potential  cross¬ 
overs.  An  adjustment  formula  can  be  derived 
by  defining  dc  and  dt  as  the  potential  propor¬ 
tion  of  subjects  who  drop  into  the  control  and 
intervention  groups,  respectively. 
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Detailed 

Outline 


I.  Introduction:  Background  to  Statistical 
Decisions  (pages  500-501) 

A.  Issues  addressed  in  this  chapter 

1 .  How  many  subjects? 

2.  Accrual  period  for  recruitment? 

3.  Follow-up  period  after  accrual? 

4.  What  to  do  if  subjects  switch  therapies? 

B.  Review  of  statistical  concepts 

1.  List  of  concepts:  H0,  HA,  P-value,  Type  I 
eiTor  (a),  Type  II  error  (p),  Power  =  1  — p, 
effect  size  (A). 

2.  A  can  be  9i  -  0O  or  0i/0o  where  9i  are  hazard 
rates  (ifr),  survival  probabilities  (Si),  or  mean 
survival  times  ( m ; ) 

II.  Design  Considerations:  Time-to-Event  Out¬ 
comes  (pages  502-503) 

A.  Randomized  trials  scheduling:  Subjects  are 

1.  accrued  for  a  period  of  A  (days,  months, 
years)  time  units 

2.  followed  for  a  further  period  of  F  time  units. 

3.  observed  as  to  whether  or  not  the  event  or 
censoring  occurred  over  follow-up  from 
study  entry  (X) 

4.  randomly  allocated  into  treatment  groups 

B.  Study  Size  (N)  versus  Number  of  Events  (NEV) 

1 .  N=  total  #  of  study  participants 

2.  Nev  =  #  of  participants  who  get  event  during 
the  study  period 

3.  Process  for  determining  N  in  time-to-event 
studies: 

Step  i:  Determine  NEV  using  a,  1  — P,  A 
Step  ii:  Determine  N  from  NEV  using 
extrapolation  formula 

N  =  NEv/pEv  where 

pEV  =  Pr  (study  participant  gets  event) 

4.  In  practice:  determine  N  first,  since  NEV 
occurs  after  study  begins 

5.  Determine  N  from  NEV  using  assumptions  on 

i.  study  duration 

ii.  distribution  of  event  times. 


III.  Determining  the  Required  Number  of  Events 

(NEV)  (pages  504-505) 

A.  Formula  for  NEV  (Assumes:  N]  =  N0,  both  large, 
PH  assumption): 


NEv 


/  (zl— a/2  +  zl-/s)  +  1) 

V  (a^I) 


2 


(1) 
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B.  Formula  for  Power,  given  NEV,  a,  and  A: 

Power  =  Pr(Z  <  zEv) . (2) 

where  zEV=  v/Nev(stt)  -  z,_a/2  andZ~N(0,l) 

C.  Formula  for  NEV  (Allows  for  unequal  sample 
sizes  Nj  and  N0  where  N  =  Nj  +  N0  and 

R  =  Nj/Nq): 

(zl-a/2  +  Zi_/j)(RA  +  1)\ 


NEv  = 


Vr(a-  i) 


(3) 


D.  Examples  for  a  =  0.05,  1  — p  =  0.80, 

A  (=  mi/m2)  =  2:  R=1  =>■  NEV=71; 

R=2  =►  Nev=98 

IV.  Determining  the  Required  Total  Number  of  Par¬ 
ticipants  (N)  (pages  505-511) 

A.  Two  versions,  each  assume: 

1 .  R=N!/N0  where  N  =  N^Nq 

2.  N  is  extrapolated  from  NEV  to  N  using 

N  =  NEV/pEv  where  pEV  =  Pr( event  occurs). 
Version  1:  Uses  MF  =  A/2  +  F  (i.e.,  median  FU 
time) 

1 .  Any  survival  distribution: 

N  = _ Nev _ 

R+  j  I1  “  Si(Mf)}  +R+  1  {1  -  S0(Mp)} 

2.  Special  case:  assumes  exponential  survival 
Sj(t)  =  exp(-Zjt),  i=0,l: 

NEv 


B. 


N  = 


R 


1 


_{l_e-*MP}+_JL_{i_e-^} 


R  +  1  1  J  R 

C.  Version  2:  Uses  guestimates  for  hazard  func¬ 
tions  /.j,  i=0,l  and  assumes  exponential  survival 
Sj(t)  =  exp(— Zjt),  where  U  =  constant  hazard  for 
group  i=0,l. 

NEv 


N  = 


R 


R  +  1 


where 


Pevi 


1 


1 


R  +  1 


Pevo 


e-tiF  _  e-ti(A+F) 


Pew  -  1  ~  [ 

D.  Formula  for  obtaining  Nt  and  N0  from  N  and  R: 
N,  =  [R/(R+1)]N  and  N0  =  Nj/2. 
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E.  Example  using  A=2,  F=4.  a=0.05,  (3=0.10, 
A  =  A0/A,  =2,  R=l. 

1.  Version  1:  MF=  5,  N«  308 

2.  Version  2:  =  0.10,  ^  =  0.05, 

Pevi  =  0.2207,  pEV0  =  0.3925,  N  w  309 

F.  Second  Example:  A=15,  F=12,  a=0.05,  (3=0.10, 
m0  =  6,  mj  =  10,  R=2. 

1 .  Using  version  2,  must  change  m;  to  using 
the  relationship  )w  =  In  2/nij. 

2.  m0  =  6  =>  =  0.1155;  mt  =  10  => 

Xi  =  0.0693. 

3.  Computed  NEV  «  222. 

4.  Using  version  2,  computed  pEvi=0-7293, 
PevO=0.8812,  and  N  «  285. 

5 .  Obtaining  N ,  and  N0  from  N  and  R:  Nj = 1 90, 
N0=95. 

V.  Other  Design  Issues  ( pages  511-514) 

A.  Three  issues: 

•  Choosing  accrual  and  follow-up  times 

•  Adjusting  for  loss  to  follow-up 

•  Adjusting  for  cross-over  (drop-in  and/or 
drop-out) 

B.  Accrual  versus  follow-up  times 

1 .  Need  balance 

2.  Cost  of  accrual  >>  Cost  of  follow-up 

3.  (Colorectal  cancer)  Example:  longer  follow¬ 
up  time  has  >  impact  on  sample  size  than 
longer  accrual  time. 

C.  Adjusting  for  loss  to  follow-up. 

1.  Formula:  NLOFadj  =  N/(l  — pi(,i)  where 

piof  =  proportion  of  subjects  expected  to  be 
lost  to  follow-up. 

2.  Example:  N=270,  piof  =  0.25  NLOFadj  =  360. 
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D.  Adjusting  for  crossovers  (drop-in/drop-out). 

1.  Typical  analysis  approach:  Intention-  to- 
Treat  (ITT);  analyze  what  you  randomize 

2.  Can  adjust  sample  size  to  allow  for  cross¬ 
overs. 

a.  Formula:  NITTadj  =  N/(l  -  dc  —  dt)2, 
where 

N  =  original  total  sample  size 
dc  =  proportion  of  (drop-outs)  from  T 
to  C 

dt  =  proportion  of  (drop-ins)  from  C  to  T 

b.  Table  of  sample  size  inflation  factors  for 
different  combinations  of  dc  and  dt. 

i.  Subjects  receiving  opposite  treat¬ 
ment  from  randomized  allocation 
"dilute”  effect  of  intervention  unless 
adjusted  for. 

c.  Example  (Exercise  regimen  effect  on 
colorectal  cancer?):  N=600,  dc  =  0.05, 
dt  =  0.15  =>  NiTTadj  =  936. 

VI.  Summary  (pages  514-517) 


Answer  questions  1-10  as  true  or  false  (circle  T  or  F) 

T  F  1.  In  a  randomized  trial  comparing  2  groups,  the 
null  hypothesis  (H0)  assumes  that  the  survival 
experience  in  the  groups  is  different. 

T  F  2.  The  effect  size  is  typically  determined  in  the  form 
of  a  difference  of  hazard  rates. 

T  F  3.  Suppose  that  a  time-to-event  study  is  designed  to 
determine  if  there  is  reduction  in  annual  hazard 
rates  from  5%  to  2.5%.  Then  the  effect  size  of 
interest  defined  in  terms  of  a  ratio  measure  is  2. 

T  F  4.  The  P- value  is  the  probability  that  the  null 
hypothesis  is  true  given  the  observed  study  data. 

T  F  5.  Consider  a  two-group  randomized  clinical  trial, 
in  which  the  randomization  process  is  not  sys¬ 
tematically  flawed.  Then  the  use  of  a  P-value  to 
compare  the  distributions  of  a  known  risk  factor 
(e.g.,  gender)  in  each  group  is  uninformative. 

T  F  6.  When  determining  sample  size  for  a  time-to- 
event  study,  one  typically  first  determines  the 
expected  (i.e.,  required)  number  of  events  (NEV) 
prior  to  determining  the  study  size  required  (N). 
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T  F  7.  When  carrying  out  a  time-to-event  study,  one 
typically  recruits  the  study  participants  prior  to 
observing  the  number  of  events  that  actually 
occur. 

T  F  8.  A  formula  for  the  total  sample  size  (N)  for  a  time- 
to-event  study  can  be  derived  by  multiplying  the 
required  number  of  events  (NEV)  by  the  probabil¬ 
ity  (pEv)  that  a  study  subject  will  get  an  event 
since  study  entry,  i.e.,  N  =  NEV  x  pEV- 
T  F  9.  Suppose  the  allocation  ratio  (R  =  Nj/No)  for  sub¬ 
jects  into  two  treatment  groups  is  1/2,  and  it  is 
determined  that  the  total  study  size  (N)  required 
is  300.  Then  Nj  =  100  and  N0  =  200. 

T  F  10.  Suppose  that  it  is  determined  that  the  total  study 
size  required  (N)  is  300  without  considering  the 
possibility  of  loss  to  follow-up  during  the  study. 
If,  nevertheless,  the  investigators  expect  that  20% 
of  study  subjects  would  be  lost  to  follow-up,  Then 
to  adjust  for  possible  loss  to  follow-up,  the  sam¬ 
ple  size  should  be  increased  to  1500  subjects. 

Consider  a  randomized  trial  in  which  the  accrual  period 

(A)  is  2  years  and  the  follow-up  period  (F)  is  3  years. 

Suppose  further  that  the  allocation  ratio  (R)  is  2,  a=.05, 

P=.20,  and  the  study  aim  is  to  increase  the  median  survival 

time  (m,)  in  the  control  group  from  1.5  years  to  2.2  years  in 

the  intervention  group. 

1 1 .  Assuming  an  exponential  survival  rate  in  both  groups, 
transform  the  given  median  survival  times  (mi)  to 
corresponding  hazard  rates  (A.j). 

12.  Based  on  your  answer  to  question  11,  what  is  A? 

13.  Determine  the  number  of  events  (NEV)  required  for 
the  study. 

14.  Determine  the  probability  of  getting  an  event  (pEvi) 
for  each  group,  i=0,l. 

15.  Determine  the  total  sample  size  (N)  required  for  this 
study. 

16.  Determine  the  sample  sizes  in  each  group  (i.e.,  N0 
and  NO- 

17.  Based  on  your  answer  to  question  14,  how  would 
you  adjust  your  total  sample  size  to  consider  loss-to- 
follow-up  of  25%? 

18.  Based  on  your  answer  to  question  17,  determine  the 
sample  sizes  required  in  each  group  (i.e.,  N0  and  NO- 

19.  Based  on  your  answer  to  question  17,  how  would  you 
adjust  your  total  sample  size  to  consider  cross-over 
percentages  of  dc  =  .05  and  dt  =0.10? 

20.  Based  on  your  answer  to  question  17,  determine  the 
sample  sizes  required  in  each  group  (i.e.,  N0  and  NO. 
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Consider  a  randomized  trial  in  which  the  accrual  period  (A) 
is  2  years  and  the  follow-up  period  (F)  is  2  years.  Suppose 
further  that  the  allocation  ratio  (R)  is  2,  a=.05,  (3=.10,  and 
the  study  aim  is  to  reduce  hazard  rate  (/.;)  in  the  control 
group  from  10%  to  5%  in  the  intervention  group. 

1.  Use  Formula  1  (see  detailed  outline)  to  determine  the 
required  number  of  events  (NEV),  total  study  size  (N), 
and  sample  sizes  required  in  each  group  (i.e.,  N0  and  N^. 

2.  Use  Formula  2  (see  detailed  outline)  to  determine  the 
required  number  of  events  (NEV),  total  study  size  (N), 
and  sample  sizes  required  in  each  group  (i.e.,  N0  and  Nj). 

3 .  How  do  the  total  sample  sizes  (N)  obtained  in  questions  1 
and  2  compare? 

4.  Based  on  your  answer  to  question  2,  how  would  you 
adjust  your  total  sample  size  to  consider  loss-to-follow- 
up  of  25%? 

5.  Based  on  your  answer  to  question  4,  determine  the 
sample  sizes  required  in  each  group  (i.e.,  N0  and  NO. 

6.  Based  on  your  answer  to  question  4,  how  would  you 
adjust  your  total  sample  size  to  consider  cross-over 
proportions  of  dc  =  .05  and  dt  =  0.10? 

7.  Based  on  your  answer  to  question  6,  determine  the 
sample  sizes  required  in  each  group  (i.e.,  N0  and  NO. 

8.  Using  the  total  study  size  (N)  calculated  in  question  6, 
these  study  subjects  will  need  to  be  recruited  at  an 
accrual  rate  of  r=N/A  per  year,  where  A=2.  If  this 
accrual  rate  is  not  feasible,  i.e.,  you  couldn’t  find  r 
subjects  per  year,  how  can  you  adjust  your  sample 
size  to  make  your  study  feasible? 


1.  F:  H0  assumes  that  the  survival  experience  in  the  two 
groups  is  the  same. 

2.  F:  The  effect  size  can  be  either  in  the  form  of  a  differ¬ 
ence  or  ratio  of  hazard  rates,  survival  probabilities,  or 
median  survival  times. 

1.  T 

2.  F:  The  P-value  is  the  probability  of  obtaining  a  result 
as  or  more  extreme  than  the  result  obtained  in  one’s 
study  given  that  the  that  the  null  hypothesis  is  true. 

3.  T 

4.  T 

7.  T 

8.  F:  N  =  NEV/pEv- 

9.  T 
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10. 

11. 

12. 

13. 

14. 


15. 


16. 

17. 

18. 

19. 

20. 


F:  The  sample  size  should  be  increased  to  375  sub¬ 
jects. 

Using  the  formula  )w  =  In  2/m;,  i=0, 1 ,  we  obtain 
m0  =  1.5  =>  =  0.4621;  mp  =  2.2  =>  A,t  =  0.3151. 

A  =  Xq/IX  i  =  0.4621/0.3151  =  1.467. 


XT  f  (1.96  +  0.84)  (2(1 .467 )  +  1) 

EV_y  V5(1.467-l) 
rounds  to  130  required  events. 
Using  Formula  2,  we  obtain 

Pevi  =  1  ~ 


=  129.91,  which 


(0.3 15 1)  (2) 

=  1  —  0.2883  =  0.7117 
1 


e— (0.3151)(3)  _  e-(0.3151)(2+3) 


Pevo  —  1 


(0.462 1 ) (2)  L 
=  1  —  0.1632  =  0.8368 


e  (0.4621)(3)  _  e-(0.4621)(2+3) 


N  = 


130 


^(0.7117)  + +^(0.8368) 
=  172.55  «  173 


Nj  =  [2/(2-4-1)1173  =  115.33  w  115  and 

N0  =  115.33/2  =  57.67  «  58 

NLOFadj  =  173/(1  -  0.25)  =  230.67  w  231 

Nj  =  [2/(2-4-1)1231  =  154  and  N0  =  154/2  =  77 

NiTTadj  =  231/(1  -  0.05  — 0.10)2  =  319.72  w  320 

Nj  =  [2/(2+l)]320  =  213.33  «  213  and  N0  =  213.33/2  = 

106.665  w  107 
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In  this  appendix,  we  provide  examples  of  computer 
programs  for  carrying  out  the  survival  analyses  described 
in  this  text.  This  appendix  does  not  give  an  exhaustive 
survey  of  all  computer  packages  currently  available,  but 
rather  is  intended  to  describe  the  similarities  and  differ¬ 
ences  among  four  of  the  most  widely  used  packages.  The 
software  packages  that  we  describe  are  Stata  (version 
10.0),  SAS  (version  9.2),  SPSS  (PASW  18),  and  R.  A  com¬ 
plete  description  of  these  packages  is  beyond  the  scope  of 
this  appendix.  Readers  are  referred  to  the  built-in  help 
functions  for  each  program  for  further  information. 


Datasets 

Most  of  the  computer  syntax  and  output  presented  in  this 
appendix  are  obtained  from  running  step-by-step  survival 
analyses  on  the  “addicts”  dataset.  The  other  dataset  that  is 
utilized  in  this  appendix  is  the  "bladder  cancer”  dataset  for 
analyses  of  recurrent  events.  The  "addicts”  and  “bladder 
cancer”  data  are  described  below  and  can  be  downloaded 
from  our  website  at  http://www.sph.emory.edu/dkleinb/ 
surv3.htm.  On  this  website,  we  also  provide  many  of  the 
other  datasets  that  have  been  used  in  the  examples  and 
exercises  throughout  this  text.  The  data  on  our  website  are 
provided  in  five  forms  (1)  as  Stata  datasets  (with  a  .dta 
extension),  (2)  as  SAS  datasets  (with  a  .sas7bdat  exten¬ 
sion),  (3)  as  SPSS  datasets  (with  a  .sav  extension),  (4)  as 
R  datasets  (with  an  .rda  extension),  and  (5)  as  text  datasets 
(with  a  .dat  extension). 

Addicts  Dataset  (addicts.dat) 

In  a  1991  Australian  study  by  Caplehorn  et  al.,  two  metha¬ 
done  treatment  clinics  for  heroin  addicts  were  compared 
to  assess  patient  time  remaining  under  methadone  treat¬ 
ment.  A  patient’s  survival  time  was  determined  as  the  time 
(in  days)  until  the  person  dropped  out  of  the  clinic  or  was 
censored.  The  two  clinics  differed  according  to  its  live-in 
policies  for  patients.  The  variables  are  defined  as  follows: 

ID  -  Patient  ID 

SURVT  -  The  time  (in  days)  until  the  patient  dropped  out 
of  the  clinic  or  was  censored 

STATUS  -  Indicates  whether  the  patient  dropped  out  of  the 
clinic  (coded  1)  or  was  censored  (coded  0) 

CLINIC  -  Indicates  which  methadone  treatment  clinic  the 
patient  attended  (coded  1  or  2) 

PRISON  -  Indicates  whether  the  patient  had  a  prison 
record  (coded  1)  or  not  (coded  0) 

DOSE  -  A  continuous  variable  for  the  patient’s  maximum 
methadone  dose  (mg/day) 
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Bladder  Cancer  Dataset  (bladder.dat) 

The  bladder  cancer  dataset  contains  recurrent  event  out¬ 
come  information  for  eighty-six  cancer  patients  followed 
for  the  recurrence  of  bladder  cancer  tumor  after  transure¬ 
thral  surgical  excision  (Byar  and  Green  1 980).  The  exposure 
of  interest  is  the  effect  of  the  drug  treatment  of  thiotepa. 
Control  variables  are  the  initial  number  and  initial  size  of 
tumors.  The  data  layout  is  suitable  for  a  counting  processes 
approach.  The  variables  are  defined  as  follows: 

ID  -  Patient  ID  (may  have  multiple  observations  for  the 
same  subject) 

EVENT  -  Indicates  whether  the  patient  had  a  tumor 
(coded  1)  or  not  (coded  0) 

INTERVAL  -  A  counting  number  representing  the  order  of 
the  time  interval  for  a  given  subject  (coded  1  for  the 
subject’s  first  time  interval,  coded  2  for  a  subject’s 
second  time  interval,  etc.) 

START  -  The  starting  time  (in  months)  for  each  interval 
STOP  -  The  time  of  event  (in  months)  or  censorship  for 
each  interval 

TX  -  Treatment  status  (coded  1  for  treatment  with  thiotepa 
and  0  for  the  placebo) 

NUM  -  The  initial  number  of  tumors 

SIZE  -  The  initial  size  (in  centimeters)  of  the  tumor 


Software 

What  follows  is  a  detailed  explanation  of  the  code  and 
output  necessary  to  perform  the  type  of  survival  analyses 
described  in  this  text.  The  rest  of  this  appendix  is  divided 
into  four  broad  sections,  one  for  each  of  the  following 
software  packages: 

A.  Stata 

B.  SAS 

C.  SPSS 

D.  R  Software 


Each  of  these  sections  is  self-contained,  allowing  the 
reader  to  focus  on  the  particular  statistical  package  of  his 
or  her  interest. 


A.  Stata 

Analyses  using  Stata  are  obtained  by  typing  the  appropri¬ 
ate  statistical  commands  in  the  Stata  Command  window  or 
in  the  Stata  Do-file  Editor  window.  The  key  commands 
used  to  perform  the  survival  analyses  are  listed  below. 
These  commands  are  case  sensitive  and  lower-case  letters 
should  be  used. 
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stset  -  Declares  data  in  memory  to  be  survival  data.  Used  to 
define  the  "time-to-event”  variable,  the  “status”  variable, 
and  other  relevant  survival  variables.  Other  Stata  Com¬ 
mands  beginning  with  st  utilize  these  defined  variables, 
sts  list  -  Produces  Kaplan-Meier  (KM)  or  Cox-adjusted 
survival  estimates  in  the  output  window.  The  default  is 
KM  survival  estimates. 

sts  graph  -  Produces  plots  of  Kaplan-Meier  (KM)  survival 
estimates.  This  command  can  also  be  used  to  produce 
Cox-adjusted  survival  plots. 

sts  generate  -  Creates  a  variable  in  the  working  dataset 
that  contains  Kaplan-Meier  or  Cox  adjusted  survival 
estimates. 

sts  test  -  Used  to  perform  statistical  tests  for  the  equality  of 
survival  functions  across  strata, 
stphplot  -  Produces  plots  of  log-log  survival  against  the  log 
of  time  for  the  assessment  of  the  proportional  hazards 
(PH)  assumption.  The  user  can  request  KM  log-log  sur¬ 
vival  plots  or  Cox  adjusted  log-log  survival  plots, 
stcoxkm  -  Produces  KM  survival  plots  and  Cox  adjusted 
survival  plots  on  the  same  graph, 
stcox  -  Used  to  run  a  Cox  proportional  hazard  model,  a 
stratified  Cox  model,  or  an  extended  Cox  model  (i.e., 
containing  time  varying  covariates), 
stphtest  -  Performs  statistical  tests  on  the  PH  assumption 
based  on  Schoenfeld  residuals.  Use  of  this  command 
requires  that  a  Cox  model  be  previously  run  with  the 
command  stcox  and  the  schoenfeld()  option, 
streg  -  Used  to  run  parametric  survival  models. 

Four  windows  will  appear  when  Stata  is  opened.  These  win¬ 
dows  are  labeled  Stata  Command,  Stata  Results,  Review, 
and  Variables.  The  user  can  click  on  File  — >  Open  to  select 
a  working  dataset  for  analysis.  Once  a  dataset  is  selected, 
the  names  of  its  variables  appear  in  the  Variables  window. 
Commands  are  entered  in  the  Stata  Command  window.  The 
output  generated  by  commands  appears  in  the  Results  win¬ 
dow  after  the  return  key  is  pressed.  The  Review  window 
preserves  a  history  of  all  the  commands  executed  during 
the  Stata  session.  The  commands  in  the  Review  window 
can  be  saved,  copied,  or  edited  as  the  user  desires.  Command 
can  also  be  run  from  the  Review  window  by  double-clicking 
on  the  command.  Commands  can  also  be  saved  in  a  file  by 
clicking  on  the  log  button  on  the  Stata  tool  bar. 

Alternatively,  commands  can  be  typed,  or  pasted  into  the 
Do-file  Editor.  The  Do-file  Editor  window  is  activated  by 
clicking  on  Window  — *  Do-file  Editor  or  by  simply  clicking 
on  the  Do-file  Editor  button  on  the  Stata  tool  bar.  Com¬ 
mands  are  executed  from  the  Do-file  Editor  by  clicking  on 
Tools  — >  Do.  The  advantage  of  running  commands  from 
the  Do-file  Editor  is  that  commands  need  not  be  entered 
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and  executed  one  at  a  time  as  they  do  from  the  Stata 
Command  window.  The  Do-file  Editor  serves  a  similar 
function  as  the  program  editor  in  SAS.  In  fact,  by  typing 
#delim  in  the  Do-file  Editor  window,  the  semicolon 
becomes  the  delimiter  for  completing  Stata  statements 
(as  in  SAS)  rather  than  the  default  carriage  return. 

The  survival  analyses  demonstrated  in  Stata  are  as  follows: 

1 .  Estimating  survival  functions  (unadjusted)  and 
comparing  them  across  strata 

2.  Assessing  the  PH  assumption  using  graphical 
approaches 

3.  Running  a  Cox  PH  model 

4.  Running  a  stratified  Cox  model 

5.  Assessing  the  PH  assumption  with  a  statistical  test 

6.  Obtaining  Cox  adjusted  survival  curves 

7.  Running  an  extended  Cox  model 

8.  Running  parametric  models 

9.  Running  frailty  models 

10.  Modeling  recurrent  events 

The  first  step  is  to  activate  the  addicts  dataset  by  clicking 
on  File  — >  Open  and  selecting  the  Stata  dataset,  addicts, 
dta.  Once  this  is  accomplished,  you  will  see  the  command 
use  “addicts. dta”,  clear  in  the  Review  window  and 
Results  window.  This  indicates  that  the  addicts  dataset  is 
activated  in  Stata’s  memory. 

To  perform  survival  analyses,  you  must  indicate  which 
variable  is  the  “time-to-event”  variable  and  which  variable 
is  the  “status”  variable.  Rather  than  program  this  in  every 
survival  analysis  command,  Stata  provides  a  way  to  pro¬ 
gram  it  once  with  the  stset  command.  All  survival  com¬ 
mands  beginning  with  st  utilize  the  survival  variables 
defined  by  stset  as  long  as  the  dataset  remains  in  active 
memory.  The  code  to  define  the  survival  variables  for  the 
addicts  data  is  as  follows: 

stset  survt,  failure(status==l)  id(id) 

Following  the  word  stset  comes  the  name  of  the  “time-to- 
event”  variable.  Options  for  Stata  Commands  follow  a 
comma.  The  first  option  used  is  to  define  the  variable  and 
value  that  indicates  an  event  (or  failure)  rather  than  a 
censorship.  Without  this  option,  Stata  assumes  that  all 
observations  had  an  event  (i.e.,  no  censorships).  Notice 
that  two  equal  signs  are  used  to  express  equality.  A  single 
equal  sign  is  used  to  designate  assignment.  The  next  option 
defines  the  id  variable  as  the  variable,  ID.  This  is  unneces¬ 
sary  with  the  addicts  dataset  since  each  observation 
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represents  a  different  patient  (cluster).  However  if  there 
were  multiple  observations  and  multiple  events  for  a  single 
subject  (cluster),  Stata  can  provide  robust  variance  esti¬ 
mates  appropriate  for  clustered  data. 

The  stset  command  will  add  four  new  variables  to  the 
dataset.  Stata  interprets  these  variables  as  follows: 

_t  -  The  “time-to-event”  variable 

_d  -  The  “status  variable”  (coded  1  for  an  event  and  0  for 
a  censorship) 

_t0  -  The  beginning  "time  variable.”  All  observations  start 
at  time  0  by  default 

_st  -  Indicates  which  variables  are  used  in  the  analysis.  All 
observations  are  used  (coded  1 )  by  default 

To  see  the  first  10  observations  printed  in  the  output 
window,  enter  the  command: 

list  in  1/10 

The  command  stdes  provides  descriptive  information 
(output  below)  of  survival  time. 


stdes 

failure 

-d: 

status 

analysis  time 

-t: 

survt 

id : 

id 

1 


(. - per  subject 


Category 

total 

mean 

min 

median 

max 

no.  of  subjects 

238 

no.  of  records 

238 

1 

1 

1 

1 

(first)  entry  time 

0 

0 

0 

0 

(final)  exit  time 

402.5714 

2 

367.5 

1076 

subjects  with  gap 

0 

time  on  gap  if  gap 

0 

time  at  risk 

95812 

402.5714 

2 

367.5 

1076 

failures 

150 

.6302521 

0 

1 

1 

The  commands  strate  and  stir  can  be  used  to  obtain  inci¬ 
dent  rate  comparisons  for  different  categories  of  specified 
variables.  The  strate  command  lists  the  incident  rates  by 
CLINIC  while  the  stir  command  gives  rate  ratios  and  rate 
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differences.  Type  the  following  commands  one  at  a  time 
(output  omitted): 

strate  clinic 
stir  clinic 

For  the  survival  analyses  that  follow,  it  is  assumed  that  the 
command  stset  has  been  run  for  the  addicts  dataset,  as 
demonstrated  on  the  previous  page. 


1 .  ESTIMATING  SURVIVAL  FUNCTIONS 
(UNADJUSTED)  AND  COMPARING 
THEM  ACROSS  STRATA 

To  obtain  Kaplan-Meier  survival  estimates  use  the  com¬ 
mand  sts  list.  The  code  and  output  follow: 

sts  list 


failure  -d:  status  ==  1 
analysis  time  -t:  survt 
id :  id 


Time 

Beg . 
Total 

Fail 

Net 

Lost 

Survivor 

Function 

Std. 

Error 

[95%  Conf 

.  Int.] 

2 

238 

0 

2 

1.0000 

7 

236 

1 

0 

0.9958 

0.0042 

0.9703 

0.9994 

13 

235 

1 

0 

0.9915 

0.0060 

0.9665 

0.9979 

17 

234 

1 

0 

0.9873 

0.0073 

0.9611 

0.9959 

19 

233 

1 

0 

0.9831 

0.0084 

0.9555 

0.9936 

26 

232 

1 

0 

0.9788 

0.0094 

0.9499 

0.9911 

28 

231 

0 

2 

0.9788 

0.0094 

0.9499 

0.9911 

29 

229 

1 

0 

0.9745 

0.0103 

0.9442 

0.9885 

30 

228 

1 

0 

0.9703 

0.0111 

0.9386 

0.9857 

33 

227 

1 

0 

0.9660 

0.0118 

0.9331 

0.9828 

905 

8 

0 

1 

0.1362 

0.0364 

0.0748 

0.2159 

932 

7 

0 

2 

0.1362 

0.0364 

0.0748 

0.2159 

944 

5 

0 

1 

0.1362 

0.0364 

0.0748 

0.2159 

969 

4 

0 

1 

0.1362 

0.0364 

0.0748 

0.2159 

1021 

3 

0 

1 

0.1362 

0.0364 

0.0748 

0.2159 

1052 

2 

0 

1 

0.1362 

0.0364 

0.0748 

0.2159 

1076 

1 

0 

1 

0,1362 

0.0364 

0.0748 

0.2159 

If  we  wish  to  stratify  by  CLINIC  and  compare  the  survival 
estimates  side-to-side  for  specified  time  points,  we  use  the 
by()  and  comparef)  option.  The  code  and  output  follow: 
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sts  list,  by(clinic)  compare  at  (0  20  to  1080) 


failure  -d:  status 
analysis  time  -t:  survt 
id :  id 


1 


Survivor  Function 
clinic  1  2 


time 


0 

1.0000 

1.0000 

20 

0.9815 

0.9865 

40 

0.9502 

0.9595 

60 

0.9189 

0.9459 

80 

0.9000 

0.9320 

100 

0.8746 

0.9320 

120 

0.8681 

0.9179 

140 

0.8422 

0.9038 

160 

0.8093 

0.8753 

180 

0.7690 

0.8466 

200 

0.7420 

0.8323 

220 

0.6942 

0.8179 

840 

0.0725 

0.5745 

860 

0.0543 

0.5745 

880 

0.0543 

0.5171 

900 

0.0181 

0.5171 

920 

0.5171 

940 

0.5171 

960 

0.5171 

980 

0.5171 

1000 

0.5171 

1020 

0.5171 

1040 

0.5171 

1060 

0.5171 

1080 

Notice  that  the  survival  rate  for  CLINIC=2  is  higher  than 
CLINIC=  1 .  Other  survival  times  could  have  been  requested 
using  the  compare()  option. 


To  graph  the  Kaplan-Meier  survival  function  (against 
time),  use  the  code: 


sts  graph 
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The  code  and  output  that  provide  a  graph  of  the  Kaplan- 
Meier  survival  function  stratified  by  CLINIC  follow: 

sts  graph,  by( clinic) 


Kaplan-Meier  survival  estimates,  by  clinic 


The  failure  option  graphs  the  failure  function  (the  cumu¬ 
lative  risk)  rather  than  the  survival  (zero  to  one  rather  than 
one  to  zero).  The  code  follows  (output  omitted): 

sts  graph,  by( clinic)  failure 

The  code  to  run  the  log  rank  test  on  the  variable  CLINIC 
(and  output)  follows: 

sts  test  clinic 

failure  -d:  status  ==  1 

analysis  time  -t:  survt 

id :  id 

Log-rank  test  for  equality  of  survivor  functions 

Events  Events 
clinic  observed  expected 

1  122  90.91 

2  28  59.09 


Total  150  150.00 

chi2(l)  =  27.89 

Pr>chi2  =  0.0000 
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The  Wilcoxon,  Tarone-Ware,  Peto,  and  Flemington- 
Harrington  tests  can  also  be  requested.  These  tests  are 
variations  of  the  log  rank  test  that  weight  each  observation 
differently.  The  Wilcoxon  test  weights  the  jth  failure  time 
by  n |  (the  number  still  at  risk).  The  Tarone-Ware  test 
weights  the  jth  failure  time  by  ^nj.  The  Peto  test  weights 
the  jth  failure  time  by  the  survival  estimate,  s(tj)  calculated 
over  all  groups  combined.  This  survival  estimate,  s(tj),  is 
similar  but  not  exactly  equal  to  the  Kaplan-Meier  survival 
estimate.  The  Flemington-Harington  test  uses  the  Kaplan- 
Meier  survival  estimate,  s(t),  over  all  groups  to  calculate  its 
weights  for  the  jth  failure  time,  s^i^fl  —  S((,_[)]?,  so  it 
takes  two  arguments  (p  and  q).  The  code  follows  (output 
omitted): 

sts  test  clinic,  wilcoxon 

sts  test  clinic,  tware 

sts  test  clinic,  peto 

sts  test  clinic,  fh(l,3) 

Notice  that  the  default  test  for  the  sts  test  command  is  the 
log  rank  test.  The  choice  of  which  weighting  of  the  test 
statistic  to  use  (e.g.,  log  rank  or  Wilcoxon)  depends  on 
which  test  is  believed  to  provide  the  greatest  statistical 
power,  which  in  turn  depends  on  how  it  is  believed  the 
null  hypothesis  is  violated.  However,  one  should  make  an 
a  priori  decision  on  which  statistical  test  to  use  rather  than 
fish  for  a  desired  p-value. 

A  stratified  log  rank  test  for  CLINIC  (stratified  by  PRISON) 
can  be  run  with  the  strata  option.  With  the  stratified 
approach,  the  observed  minus  expected  number  of  events 
are  summed  over  all  failure  times  for  each  group  within 
each  stratum  and  then  summed  over  all  strata.  The  code 
follows  (output  omitted): 

sts  test  clinic,  strata(prison) 

The  sts  generate  command  can  be  used  to  create  a  new 
variable  in  the  working  dataset  containing  the  KM  survival 
estimates.  The  following  code  defines  a  new  variable  called 
SKM  (the  variable  name  is  the  user’s  choice)  that  contains 
KM  survival  estimates  stratified  by  CLINIC: 

sts  generate  skm=s,  by(clinic) 

The  ltable  command  produces  life  tables.  Life  tables  are  an 
alternative  approach  to  Kaplan-Meier  that  are  particularly 
useful  if  you  do  not  have  individual-level  data.  The  code 
and  output  that  follows  provide  life  table  survival  esti¬ 
mates,  stratified  by  CLINIC,  at  the  time  points  (in  days) 
specified  by  the  interval()  option: 
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ltable  survt  status,  by(clinic)  interval(60  150  200  280  365  730  1095) 


Interval 

Beg . 
Total 

Deaths 

Lost 

Survival 

Std . 
Error 

[95%  Conf 

.  Int.] 

clinic 

0 

=  1 

163 

13 

4 

0.9193 

0.0215 

0.8650 

0.9523 

60 

150 

146 

14 

6 

0.8293 

0.0300 

0.7609 

0.8796 

150 

200 

126 

13 

3 

0.7427 

0.0352 

0.6661 

0.8043 

200 

280 

110 

17 

2 

0.6268 

0.0393 

0.5446 

0.6984 

280 

365 

91 

10 

6 

0.5556 

0.0408 

0.4720 

0.6313 

365 

730 

75 

41 

15 

0.2181 

0.0367 

0.1509 

0.2934 

730 

1095 

19 

14 

5 

0.0330 

0.0200 

0.0080 

0.0902 

clinic 

0 

=  2 

75 

4 

2 

0.9459 

0.0263 

0.8624 

0.9794 

60 

150 

69 

5 

3 

0.8759 

0.0388 

0.7749 

0.9334 

150 

200 

61 

3 

0 

0.8328 

0.0441 

0.7242 

0.9015 

200 

280 

58 

5 

1 

0.7604 

0.0508 

0.6429 

0.8438 

280 

365 

52 

3 

2 

0.7157 

0.0540 

0.5943 

0.8065 

365 

730 

47 

7 

23 

0.5745 

0.0645 

0.4385 

0.6890 

730 

1095 

17 

1 

16 

0.5107 

0.0831 

0.3395 

0.6584 

2.  ASSESSING  THE  PH  ASSUMPTION  USING 
GRAPHICAL  APPROACHES 

Three  graphical  approaches  for  the  assessment  of  the  PH 
assumption  for  the  variable  CLINIC  are  demonstrated: 

1)  Log-log  Kaplan-Meier  survival  estimates  (stratified  by 
CLINIC)  plotted  against  time  (or  against  the  log  of 
time) 

2)  Log-log  Cox  adjusted  survival  estimates  (stratified  by 
CLINIC)  plotted  against  time 

3)  Kaplan-Meier  survival  estimates  and  Cox  adjusted 
survival  estimates  plotted  on  the  same  graph. 

All  three  approaches  are  somewhat  subjective  yet  hopefully 
informative.  The  first  two  approaches  are  based  on  whether 
the  log  log  survival  curves  are  parallel  for  different  levels  of 
CLINIC.  The  third  approach  is  to  determine  if  the  Cox 
adjusted  survival  curve  (not  stratified)  is  close  to  the  KM 
curve.  In  other  words,  are  predicted  values  from  the  PH 
model  (from  Cox)  close  to  the  “observed”  values  using  KM? 

The  first  two  approaches  use  the  stphplot  command 
while  the  third  approach  uses  the  stcoxkm  command. 
The  code  and  output  for  the  log-log  Kaplan-Meier  survival 
plots  follow: 
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stphplot,  by(clinic)  nonegative 


clinic  =  1  — * —  clinic  =  2 


ln(analysis  time) 


The  left  side  of  the  graph  seems  jumpy  for  CLINIC=  1  but  it 
only  represents  a  few  events.  It  also  looks  like  there  is  some 
separation  between  the  plots  at  the  later  times  (right  side). 
The  nonegative  option  in  the  code  requests  log(-log) 
curves  rather  than  the  default  -log(-log)  curves.  The  choice 
is  arbitrary.  Without  the  option,  the  curves  would  go  down¬ 
ward  rather  than  upward  (left-to-right). 

Stata  (as  well  as  SAS)  plot  log(survival  time)  rather  than 
survival  time  on  the  horizontal  axis  by  default.  As  far  as 
checking  the  parallel  assumption,  it  does  not  matter  if  log 
(survival  time)  or  survival  time  is  on  the  horizontal  axis. 
However,  if  the  log  log  survival  curves  look  like  straight 
lines  with  log(survival  time)  on  the  horizontal  axis,  then 
there  is  evidence  that  the  “time-to-event”  variable  follows  a 
Weibull  distribution.  If  the  slope  of  the  line  equals  one, 
then  there  is  evidence  that  the  survival  time  variable 
(SURVT)  follows  an  exponential  distribution  -  a  special 
case  of  the  Weibull  distribution.  For  these  situations,  a 
parametric  survival  model  can  be  used. 

It  may  be  visually  more  informative  to  graph  the  log  log 
survival  curves  against  survival  time  (rather  than  log  sur¬ 
vival  time).  The  nolntime  option  can  be  used  to  put  sur¬ 
vival  time  on  the  horizontal  axis.  The  code  and  output 
follows: 
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stphplot,  by(clinic)  nonegative  nolntime 


clinic  =1  — * — clinic  =  2 


The  graph  suggests  that  the  curves  begin  to  diverge  over 
time. 

The  stphplot  command  can  also  be  used  to  obtain  log-log 
Cox  adjusted  survival  estimates.  The  code  follows: 

stphplot,  strata(clinic)  adjust(prison  dose)  nonegative  nolntime 

The  log-log  curves  are  adjusted  for  PRISON  and  DOSE 
using  a  stratified  COX  model  on  the  variable  CLINIC.  The 
mean  values  of  PRISON  and  DOSE  are  used  for  the  adjust¬ 
ment.  The  output  follows: 


clinic  =1  — * — clinic  =  2 


The  Cox  adjusted  curves  look  very  similar  to  the  KM 


curves. 
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The  stcoxkm  command  is  used  to  compare  Kaplan-Meier 
survival  estimates  and  Cox  adjusted  survival  estimates 
plotted  on  the  same  graph.  The  code  and  output  follow: 

stcoxkm,  by( clinic) 


Observed:  clinic  =  1  — * —  Observed:  clinic  =  2 


The  KM  and  adjusted  survival  curves  are  very  close 
together  for  CLINIC=1  and  less  so  for  CLINIC=2.  These 
graphical  approaches  suggest  that  there  is  some  violation 
with  the  PH  assumption.  The  predicted  values  are  Cox 
adjusted  for  CLINIC,  and  therefore  assume  the  PH 
assumption.  Notice  that  the  predicted  survival  curves 
are  not  parallel  by  CLINIC  even  though  we  are  adjusting 
for  CLINIC.  It  is  the  log-log  survival  curves,  rather  than 
the  survival  curves,  that  are  forced  to  be  parallel  by  Cox 
adjustment. 

The  same  graphical  analyses  can  be  performed  with 
PRISON  and  DOSE.  However,  DOSE  would  have  to  be 
categorized  since  it  is  a  continuous  variable. 


3.  RUNNING  A  COX  PH  MODEL 

For  a  Cox  PH  model,  the  key  assumption  is  that  the  hazard 
is  proportional  across  different  patterns  of  covariates.  The 
first  model  that  is  demonstrated  contains  all  three  covari¬ 
ates:  PRISON,  DOSE,  and  CLINIC.  In  this  model,  we  are 
assuming  the  same  baseline  hazard  for  all  possible  pat¬ 
terns  of  these  covariates.  In  other  words,  we  are  accepting 
the  PH  assumption  for  each  covariate  (perhaps  incor¬ 
rectly).  The  code  and  output  follow: 
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stcox  prison  clinic  dose,  nohr 


failure  -d:  status 
analysis  time  -t:  survt 
id :  id 


1 


Iteration 

0: 

log 

Iteration 

1: 

log 

Iteration 

2: 

log 

Iteration 

3: 

log 

Iteration 

4: 

log 

Refining  estimates 

Iteration 

0: 

log 

likelihood  = 
likelihood  = 
likelihood  = 
likelihood  = 
likelihood  = 

likelihood  = 


-705 . 6619 
-674.54907 
-673.407 
-673.40242 
-673.40242 

-673.40242 


Cox  regression  --  Breslow  method  for  ties 


No.  of  subjects  = 

238 

Number  of  obs 

=  238 

No.  of  failures  = 

150 

Time  at  risk 

95812 

LR  chi2(3) 

=  64.52 

Log  likelihood 

-673.40242 

Prob  >  chi2 

=  0.0000 

-t 

-d 

Coef . 

Std.  Err. 

z 

P>lz| 

[95%  Conf. 

Interval] 

prison 

.3265108 

.1672211 

1.95 

0.051 

- .0012366 

.6542581 

clinic 

-1.00887 

.2148709 

-4.70 

0.000 

-1.430009 

- .5877304 

dose 

- .0353962 

.0063795 

-5.55 

0.000 

- .0478997 

- .0228926 

The  output  indicates  that  it  took  five  iterations  for  the  log 
likelihood  to  converge  at  —673.40242.  The  iteration  history 
typically  appears  at  the  top  of  Stata  model  output;  how¬ 
ever,  the  iteration  history  will  subsequently  be  omitted. 
The  final  table  lists  the  regression  coefficients,  their  stan¬ 
dard  errors,  a  Wald  test  statistic  (z)  for  each  covariate,  with 
corresponding  p-value,  and  95%  confidence  interval. 

The  nohr  option  in  the  stcox  command  requests  the 
regression  coefficients  rather  than  the  default  exponen¬ 
tiated  coefficients  (hazard  ratios).  If  you  want  the  expo¬ 
nentiated  coefficients,  omit  the  nohr  option.  The  code  and 
output  follow: 


stcox  prison  clinic  dose 
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Cox  regression  --  Breslow  method  for  ties 


No.  of  subjects  = 

238 

Number  of  obs 

=  238 

No.  of  failures  = 

150 

Time  at  risk 

95812 

LR  chi2(3) 

=  64.52 

Log  likelihood 

-673.40242 

Prob  >  chi2 

=  0 . 0000 

-t 

_d 

Haz.  Ratio 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

prison 

1.386123 

.231789 

1.95 

0.051 

.9987642 

1.923715 

clinic 

.3646309 

.0783486 

-4.70 

0.000 

.2393068 

.5555868 

dose 

.965223 

.0061576 

-5.55 

0.000 

.9532294 

.9773675 

This  table  contains  the  hazard  ratios,  its  standard  errors, 
and  corresponding  confidence  intervals.  Notice  that  you 
do  not  need  to  supply  the  "time-to  event”  variable  or  the 
status  variable  when  using  the  stcox  command.  The  stcox 
command  uses  the  information  supplied  from  the  stset 
command.  A  Cox  model  can  also  be  run  using  the  cox 
command,  which  does  not  rely  on  the  stset  command 
having  previously  been  run.  The  code  follows: 

cox  survt  prison  clinic  dose,  dead(status) 

Notice  that  with  the  cox  command,  we  have  to  list  the 
variable  SURVT.  The  dead()  option  is  used  to  indicate 
that  the  variable  STATUS  distinguishes  events  from  cen¬ 
sorship.  The  variable  used  with  the  dead()  option  needs  to 
be  coded  nonzero  for  events  and  zero  for  censorships.  The 
output  from  the  cox  command  follows: 

Cox  regression 
Entry  time  0 

Log  likelihood 


Breslow  method  for  ties 

Number  of  obs  =  238 

LR  chi2(3)  =  64.52 

Prob  >  chi2  =  0.0000 

-673.40242  Pseudo  R2  =  0.0457 


survt 

status 

Coef . 

Std.  Err. 

z 

P>|z| 

[95%  Conf.  Interval] 

prison 

.3265108 

.1672211 

1.95 

0.051 

-.0012366  .6542581 

clinic 

-1.00887 

.2148709 

-4.70 

0.000 

-1.430009  -.5877304 

dose 

- .0353962 

.0063795 

-5.55 

0.000 

-.0478997  -.0228926 

The  output  is  identical  to  that  obtained  from  the  stcox 
command  except  that  the  regression  coefficients  are 
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given  by  default.  The  hr  option  for  the  cox  command 
supplies  the  exponentiated  coefficients. 

Notice  with  the  output  that  the  default  method  of  handling 
ties  (i.e.,  when  multiple  events  happen  at  the  same  time)  is 
the  Breslow  method.  If  you  wish  to  use  more  exact  meth¬ 
ods,  you  can  use  the  exactp  option  (for  the  exact  partial 
likelihood)  or  the  exactm  option  (for  the  exact  marginal 
likelihood)  in  the  stcox  or  cox  command.  The  exact  meth¬ 
ods  are  computationally  more  intensive  and  typically  have 
a  slight  impact  on  the  parameter  estimates.  However,  if 
there  are  a  lot  of  events  that  occur  at  the  same  time,  then 
exact  methods  are  preferred.  The  code  and  output  follow: 

stcox  prison  clinic  dose,  nohr  exactm 


Cox  regression  --  exact  marginal  likelihood 


No.  of  subjects  = 

238 

Number  of  obs 

=  238 

No.  of  failures  = 

150 

Time  at  risk 

95812 

LR  chi2(3) 

=  64.56 

Log  likelihood 

-666.3274 

Prob  >  chi2 

=  0 . 0000 

-t 

-d 

Coef . 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

prison 

.326581 

.1672306 

1.95 

0.051 

- .0011849 

.6543469 

clinic 

-1.009906 

.2148906 

-4.70 

0.000 

-1.431084 

- .5887285 

dose 

- .0353694 

.0063789 

-5.54 

0.000 

-.0478718 

- .0228669 

Alternatively,  you  could  use  Efron  method  of  handling  ties. 
This  is  the  method  that  the  R  statistical  package  uses  as  its 
default.  The  code  follows  (output  omitted): 


stcox  prison  clinic  dose,  nohr  efron 

Suppose  you  are  interested  in  running  a  Cox  model  with  two 
interaction  terms  with  PRISON.  The  generate  command 
can  be  used  to  define  new  variables.  The  variables  CLIN_PR 
and  CLIN_DO  are  product  terms  that  are  defined  from 
CLINIC  x  PRISON  and  CLINIC  x  DOSE.  The  code  follows: 

generate  cIin_pr=clinic*prison 
generate  c!in_do=clinic*dose 

Type  describe  or  list  to  see  that  the  new  variables  are  in 
the  working  dataset. 
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The  following  code  runs  the  Cox  model  with  the  two 
interaction  terms: 

stcox  prison  clinic  dose  clin_pr  clin_do,  nohr 


Cox  regression  --  Breslow  method  for  ties 


No.  of  subjects  = 

238 

Number  of  obs 

=  238 

No.  of  failures  = 

150 

Time  at  risk 

95812 

LR  chi2(5) 

=  68.12 

Log  likelihood 

-671.59969 

Prob  >  chi2 

=  0 . 0000 

-t 

-d 

Coef . 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

prison 

1.191998 

.5413685 

2.20 

0.028 

.1309348 

2.253061 

clinic 

.1746985 

.893116 

0.20 

0.845 

-1.575777 

1.925174 

dose 

- .0193175 

.01935 

-1.00 

0.318 

-.0572428 

.0186079 

clin.pr 

-  .7379931 

.4314868 

-1.71 

0.087 

-1.583692 

.1077055 

clin.do 

- .0138608 

.0143275 

-0.97 

0.333 

-.0419422 

.0142206 

The  lrtest  command  can  be  used  to  perform  likelihood 
ratio  tests.  For  example,  to  perform  a  likelihood  ratio  test 
on  the  two  interaction  terms,  CLIN_PR  and  CLIN_DO,  in 
the  preceding  model,  we  can  save  the  -2  log  likelihood 
statistic  of  the  full  model  in  the  computer’s  memory  by 
typing  the  following  command: 

lrtest,  saving(O) 

Now,  the  reduced  model  (without  the  interaction  terms) 
can  be  run  (output  omitted)  by  typing: 

stcox  prison  clinic  dose 

After  the  reduced  model  is  ran,  the  following  command 
provides  the  results  of  the  likelihood  ratio  test  comparing 
the  full  model  (with  the  interaction  terms)  to  the  reduced 
model: 
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lrtest 

The  resulting  output  follows: 

Cox:  likelihood-ratio  test  chi2(2)  =  3.61 

Prob  >  chi2  =  0.1648 

The  p-value  of  0.1648  is  not  significant  at  the  alpha  =  0.05 
level. 


4.  RUNNING  A  STRATIFIED  COX  MODEL 

If  the  proportional  hazard  assumption  is  not  met  for  the 
variable  CLINIC,  but  is  met  for  the  variables  PRISON  and 
DOSE,  then  a  stratified  Cox  analysis  can  be  performed. 
The  stcox  command  can  be  used  to  run  a  stratified  Cox 
model.  The  following  code  (with  output)  runs  a  Cox  model 
stratified  on  CLINIC: 

stcox  prison  dose,  strata(clinic) 


Stratified  Cox  regr. 

--  Breslow  method 

for  ties 

No.  of  subjects  = 

238 

Number  of  obs 

=  238 

No.  of  failures  = 

150 

Time  at  risk 

95812 

LR  chi2(2) 

=  33.94 

Log  likelihood 

-597.714 

Prob  >  chi2 

=  0.0000 

-t 

-d 

Haz.  Ratio 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

prison 

dose 

1.475192 

.9654655 

.2491827 

.0062418 

2.30 

-5.44 

0.021 

0.000 

1.059418 

.953309 

2.054138 

.977777 

Stratified  by  clinic 


The  strata()  option  allows  up  to  five  stratified  variables. 


A  stratified  Cox  model  can  be  run  including  the  two  inter¬ 
action  terms.  Recall  that  the  generate  command  created 
these  variables  in  the  previous  section.  This  model  allows 
for  the  effect  of  PRISON  and  DOSE  to  differ  for  different 
values  of  CLINIC.  The  code  and  output  follow: 
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stcox  prison  dose  clin_pr  clin_do,  strata( clinic)  nohr 


Stratified  Cox  regr.  --  Breslow  method  for  ties 


No.  of  subjects  = 

238 

Number  of  obs 

=  238 

No.  of  failures  = 

150 

Time  at  risk 

95812 

LR  chi2(4) 

=  35.81 

Log  likelihood 

-596.77891 

Prob  >  chi2 

=  0.0000 

-t 

-d 

Coef . 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

prison 

1.087282 

.5386163 

2.02 

0.044 

.0316135 

2.142951 

dose 

- .0348039 

.0197969 

-1.76 

0.079 

- .0736051 

.0039973 

clin.pr 

- .584771 

.4281291 

-1.37 

0.172 

-1.423889 

.2543465 

clin.do 

- .0010622 

.014569 

-0.07 

0.942 

- .0296169 

.0274925 

Stratified  by  clinic 


Suppose  we  wish  to  estimate  the  hazard  ratio  for 
PRISON=l  vs.  PRISON=0  for  CLINIC=2.  This  hazard 
ratio  can  be  estimated  by  exponentiating  the  coefficient 
for  prison  plus  2  times  the  coefficient  for  the  clinic-prison 
interaction  term.  This  expression  is  obtained  by  substitut¬ 
ing  the  appropriate  values  into  the  hazard  in  both  the 
numerator  (for  PRISON=l)  and  denominator  (for 
PRISON=0)  (see  below): 

HR  _  ho(t)  exp[l^!  +  p2DOSE  +  (2)(1)J?3  +  pACL!N_DO ] 
h0(t)  exppi?!  +  p2DOSE  +  (2)(0)/?3  +  (!4CLIN  DO] 

=  exp(^!  +2^). 

The  lincom  command  can  be  used  to  exponentiate  linear 
combinations  of  parameters.  Run  this  command  directly 
after  running  the  model  to  estimate  the  HR  for  PRISON 
where  CLINIC=2.  The  code  and  output  follow: 


lincom  prison+2*cIin_pr,  hr 


(  1)  prison  +  2.0  clin  pr  =  0.0 

t  |  Haz.  Ratio  Std.  Err. 

z 

P>  1  z  1 

[95%  Conf. 

Interval] 

(1)  t  .9210324  .3539571 

-0.21 

0.831 

.4336648 

1 . 956121 

Software:  A.  Stata  545 


Models  can  also  be  run  on  a  subsetted  portion  of  the  data 
using  the  if  statement.  The  following  code  (with  output) 
runs  a  Cox  model  on  the  data  where  CLINIC=2: 

stcox  prison  dose  if  clinic==2 


Cox  regression  --  Breslow  method  for  ties 


No.  of  subjects  = 

75 

Number  of  obs 

=  75 

No.  of 

failures  = 

28 

Time  at 

risk 

36254 

LR  chi2(2) 

=  9.70 

Log  likelihood 

-104.37135 

Prob  >  chi2 

=  0.0078 

.t 

_d 

Haz.  Ratio 

Std.  Err.  z 

p > | z [  [95%  Conf. 

Interval] 

prison 

.9210324 

.3539571  -0.21 

0.831  .4336648 

1.956121 

dose 

.9637452 

.0118962  -2.99 

0.003  .9407088 

.9873457 

The  hazard  ratio  estimates  for  PRISON=l  vs.  PRISON=0 
(for  CLINIC=2)  are  exactly  the  same  using  the  stratified 
Cox  approach  with  product  terms  and  the  subsetted  data 
approach  (0.9210324). 


5.  ASSESSING  THE  PH  ASSUMPTION 
WITH  A  STATISTICAL  TEST 

The  stphtest  command  can  be  used  to  perform  a  statistical 
test.  A  statistical  test  gives  objective  criteria  for  assessing 
the  PH  assumption  compared  to  using  the  graphical 
approach.  This  does  not  mean  that  this  statistical  test  is 
better  than  the  graphical  approach.  It  is  just  more  objec¬ 
tive.  In  fact,  the  graphical  approach  is  generally  more 
informative  for  descriptively  characterizing  the  form  of  a 
PH  violation. 

The  command  stphtest  outputs  a  PH  global  test  for  all  the 
covariates  simultaneously  and  can  also  be  used  to  obtain  a 
test  for  each  covariate  separately  with  the  detail  option.  To 
run  these  tests,  you  must  obtain  Schoenfeld  residuals  for 
the  global  test  and  scaled  Schoenfeld  residuals  for  separate 
tests  with  each  covariate.  The  idea  behind  the  PH  test  is 
that  if  the  PH  assumption  is  satisfied,  then  the  residuals 
should  not  be  correlated  with  survival  time  (or  ranked 
survival  time).  On  the  other  hand,  if  the  residuals  tend  to 
be  positive  for  subjects  who  become  events  at  a  relatively 
early  time  and  negative  for  subjects  who  become  events  at 
a  relatively  late  time  (or  vice  versa),  then  there  is  evidence 
that  the  hazard  ratio  is  not  constant  over  time  (i.e.,  PH 
assumption  is  violated). 
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Before  the  stphtest  can  be  implemented,  the  stcox 
command  needs  to  be  run  to  obtain  the  Schoenfeld  residuals 
(with  the  schoenfeld()  option)  and  the  scaled  Schoenfeld 
residuals  (with  the  scaledsch()  option).  The  names  of  newly 
defined  variables  are  in  the  parentheses:  schoen*  creates 
SCHOEN1,  SCHOEN2,  and  SCHOEN3  while  scaled*  cre¬ 
ates  SCALED1,  SCALED2,  and  SCALED3.  These  variables 
contain  the  residuals  for  PRISON,  DOSE,  and  CLINIC, 
respectively  (the  order  that  the  variables  were  entered  in 
the  model).  The  user  is  free  to  type  any  variable  name  in 
the  parentheses.  The  Schoenfeld  residuals  are  used  for  the 
global  test  while  the  scaled  Schoenfeld  residuals  are  used  for 
the  testing  of  the  PH  assumption  for  individual  variables: 

stcox  prison  dose  clinic,  schoenfeld(schoen*)  scaledsch(scaled*) 

Once  the  residuals  are  defined,  the  stphtest  command  can 
be  ran.  The  code  and  output  follow: 

stphtest,  rank  detail 


Test  of  proportional  hazards  assumption 
Time:  Rank(t) 


rho 

chi2 

df 

Prob>chi2 

prison 

-0.04645 

0.32 

1 

0.5689 

dose 

0.08975 

1.08 

1 

0.2996 

clinic 

-0.24927 

10.44 

1 

0.0012 

global  test 

12.36 

3 

0.0062 

The  tests  suggest  that  the  PH  assumption  is  violated  for 
CLINIC  with  the  p-value  at  0.0012.  The  tests  do  not  suggest 
violation  of  the  PH  assumption  for  PRISON  or  DOSE. 

The  plot()  option  of  the  stphtest  command  can  be  used  to 
produce  a  plot  of  the  scaled  Schoenfeld  residuals  for 
CLINIC  against  survival  time  ranking.  If  the  PH  assump¬ 
tion  is  met,  the  fitted  curve  should  look  horizontal  since  the 
scaled  Schoenfeld  residuals  would  be  independent  of  sur¬ 
vival  time.  The  code  and  graph  follow: 
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stphtest,  rank  plot(clinic) 

Test  of  PH  Assumption 


The  fitted  curve  slopes  slightly  downward  (not  horizontal). 


6.  OBTAINING  COX  ADJUSTED  SURVIVAL  CURVES 

Adjusted  survival  curves  can  be  obtained  with  the  sts 
graph  command.  Adjusted  survival  curves  depend  on  the 
pattern  of  covariates.  For  example,  the  adjusted  survival 
estimates  for  a  subject  with  PRISON=l,  CLINIC=1,  and 
DOSE=40  are  generally  different  than  for  a  subject  with 
PRISON=0,  CLINIC=2,  and  DOSE=70.  The  sts  graph  com¬ 
mand  produces  adjusted  baseline  survival  curves.  The  fol¬ 
lowing  code  produces  an  adjusted  survival  plot  with 
PRISON=0,  CLINIC=0,  and  DOSE=0  (output  omitted): 

sts  graph,  adjustfor(prison  dose  clinic) 

It  is  probably  of  more  interest  to  create  adjusted  plots  for 
reasonable  patterns  of  covariates  (CLINIC=0  is  not  even  a 
valid  value).  Suppose  we  are  interested  in  graphing  the 
adjusted  survival  curve  for  PRISON=0,  CLINIC=2,  and 
DOSE=70.  We  can  create  new  variables  with  the  generate 
command  that  can  be  used  with  the  sts  graph  command: 

generate  clinic2= clinic-2 
generate  dose70=dose-70 

These  variables  (PRISON,  CLINIC2,  and  DOSE70)  pro¬ 
duce  the  desired  pattern  of  covariate  when  each  is  set  to 
zero.  The  following  code  produces  the  desired  results: 

sts  graph,  adjustfor(prison  dose70  clinic2) 
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Survivor  function 

adjusted  for  prison  dose70  clinic2 


Adjusted  stratified  Cox  survival  curves  can  be  obtained 
with  the  strataf)  option.  The  following  code  creates  two 
survival  curves  stratified  by  clinic  (CLINIC=  1 ,  PRISON=0, 
and  DOSE=70)  and  (CLINIC=2,  PRISON=0,  and 
DOSE=70): 

sts  graph,  strata(clinic)  adjust  for  (prison  dose70) 


Survivor  functions,  by  clinic 
adjusted  for  prison  dose70 


The  adjusted  curves  suggest  that  there  is  a  strong  effect 
from  CLINIC  on  survival. 


Software:  A.  Stata  549 


Suppose  the  interest  is  in  comparing  adjusted  survival 
plots  of  PRISON=l  to  PRISON=0  stratified  by  CLINIC. 
In  this  setting,  the  sts  graph  command  cannot  be  used 
directly  since  we  cannot  simultaneously  define  both  levels 
of  prison  (PRISON=l  and  PRISON=0)  as  the  baseline  level 
(recall  sts  graph  plots  only  the  baseline  survival  function). 
However,  survival  estimates  can  be  obtained  using  the  sts 
generate  command  twice,  once  where  PRISON=0  is 
defined  as  baseline  and  once  where  PRISON=l  is  defined 
as  baseline.  The  following  code  creates  variables  contain¬ 
ing  the  desired  adjusted  survival  estimates: 

generate  prison  l=prison-l 

sts  generate  scoxO=s,  strata(clinic)  adjustfor(prison  dose70) 
sts  generate  scoxl=s,  strata(clinic)  adjustfor(prisonl  dose70) 

The  variables  SCOX1  and  SCOXO  contain  the  survival  esti¬ 
mates  for  PRISON=l  and  PRISON=0,  respectively,  adjust¬ 
ing  for  dose  and  stratifying  by  clinic.  The  graph  command 
is  used  to  plot  these  estimates.  If  you  are  using  a  higher 
version  of  Stata  than  Stata  7.0  (e.g.,  Stata  8.0),  then  you 
should  replace  the  graph  command  with  the  graph7  com¬ 
mand.  The  code  and  output  follow: 

Graph7  scoxO  scoxl  survt,  twoway  symbol( [clinic]  [clinic])  xlabel(365,730,1095) 


symbols  O  for  prison=0,  X  for  prison=1 

1 


.009935 


365  730  1095 

survival  time  in  days 
subsetted  by  clinic==1 

We  can  also  graph  PRISON=l  and  PRISON=0  subsetting 
the  data  where  CLINIC=1.  The  option  twoway  requests  a 
two-way  scatter  plot.  The  options  symbol,  xlabel,  and  title 
request  the  symbols,  axis  labels,  and  title,  respectively: 
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graph7  scoxO  scoxl  survt  if  clinic=  =  l,  twoway  symbol(ox)  xlabel(365, 730, 1095) 
tl(“  symbols  O  for  prison=0,  X  for  prison=l”)  title  (“subsetted  by  clinic==l”) 

S(t+0),  adjusted  S(t+0),  adjusted 


1  -\ 


.009935  -\ 


’i  v 


'U 
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‘22222  2  22 
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A 
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2  2  2 
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730 

survival  time  in  days 


1095 


7.  RUNNING  AN  EXTENDED  COX  MODEL 

If  the  PH  assumption  is  not  satisfied,  a  possible  strategy  is 
to  run  a  stratified  Cox  model.  Another  strategy  is  to  run  a 
Cox  model  with  time-varying  covariates  (an  extended  Cox 
model).  The  challenge  of  running  an  extended  Cox  model  is 
to  choose  the  appropriate  function  of  survival  time  to 
include  in  the  model. 

Suppose  we  want  to  include  a  time  dependent  covariate 
DOSE  times  the  log  of  time.  This  product  term  could  be 
appropriate  if  the  hazard  ratio  comparing  any  two  levels  of 
DOSE  monotonically  increases  (or  decreases)  over  time. 
The  tvc  option(  )  of  the  stcox  command  can  be  used  to 
declare  DOSE  a  time  varying  covariate  that  will  be  multi¬ 
plied  by  a  function  of  time.  The  specification  of  that  func¬ 
tion  of  time  is  stated  in  the  texp  option  with  the  variable  _t 
representing  time.  The  code  and  output  for  a  model  con¬ 
taining  the  time  varying  covariate,  DOSE  x  ln(_t),  follow: 
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stcox  prison  clinic  dose,  tvc(dose)  texp(ln(_t))  nohr 


Cox  regression  - 

-  Breslow  method 

for  ties 

No.  of  subjects 

=  238 

Number  of  obs 

=  238 

No.  of  failures 

=  150 

Time  at  risk 

=  95812 

LR  chi2(4) 

=  66.29 

Log  likelihood 

=  -672.51694 

Prob  >  chi2 

=  0.0000 

-t 


-d 

Coef . 

Std .  Err. 

z 

p>]z[  [95%  Conf. 

Interval] 

rh 

prison 

.3404817 

.1674672 

2.03 

0.042  .012252 

.6687113 

clinic 

-1.018682 

.215385 

-4.73 

0.000  -1.440829 

- .5965352 

dose 

- .0824307 

.0359866 

-2.29 

0.022  -.1529631 

- .0118982 

t 

dose 

.0085751 

.0064554 

1.33 

0.184  -.0040772 

.0212274 

note:  second  equation  contains  variables  that  continuously  vary 
with  respect  to  time;  variables  interact  with  current 
values  of  ln(.t) . 


The  parameter  estimate  for  the  time-dependent  covariate, 
DOSE  x  ln(_t),  is  0.0085751;  however,  it  is  not  statistically 
significant  with  a  Wald  test  p-value  of  0.184. 

A  heaviside  function  can  also  be  used.  The  following  code 
runs  a  model  with  a  time-dependent  variable  equal  to 
CLINIC  if  time  is  greater  than  or  equal  to  365  days  and 
0  otherwise. 

stcox  prison  dose  clinic,  tvc(clinic)  texp(_t>=365)  nohr 

Stata  recognizes  the  expression  (_t>=365)  as  taking  the 
value  1  if  survival  time  is  >365  days  and  0  otherwise.  The 
output  follows: 
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Cox 

regression  --  Breslow  method 

for 

ties 

No. 

of  subjects  = 

238 

Number  of  obs 

=  238 

No. 

of  failures  = 

150 

Time  at  risk 

95812 

LR  chi2(4) 

=  74.17 

Log 

likelihood 

-668.57443 

Prob 

>  chi2 

=  0.0000 

-t 

-d  Coef. 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

rh 

prison  .377704 

.1684024 

2.24 

0.025 

.0476414 

.7077666 

dose  -.0355116 

.0064354  - 

5.52 

0.000 

- .0481247 

-.0228985 

clinic  -.4595628 

.2552911  - 

1.80 

0.072 

- .959924 

.0407985 

t 

clinic  -1.368665 

.4613948  - 

2.97 

0.003 

-2.272982 

- .464348 

note:  second  equation  contains  variables  that  continuously  vary 
with  respect  to  time;  variables  interact  with  current 
values  of-t>  =  365. 


Unfortunately,  the  texp  option  can  only  be  used  once  in  the 
stcox  command.  This  makes  it  more  difficult  to  run  the 
equivalent  model  with  two  heaviside  functions.  However,  it 
can  be  accomplished  using  the  stsplit  command,  which 
adds  extra  observations  to  the  working  dataset.  The  follow¬ 
ing  code  creates  a  variable  called  V 1  and  adds  new  obser¬ 
vations  to  the  dataset: 

stsplit  vl,  at(365) 

After  the  above  stsplit  command  is  executed,  any  subject 
followed  more  than  365  days  is  represented  by  two  obser¬ 
vations  rather  than  one.  For  example,  the  first  subject 
(ID=1)  had  an  event  on  the  428th  day;  the  first  observation 
for  that  subject  shows  no  event  between  0  and  365  days 
while  the  second  observation  shows  an  event  on  the  428th 
day.  The  newly  defined  variable  vl  has  the  value  365  for 
observations  with  survival  time  exceeding  or  equal  to  365 
and  0  otherwise.  The  following  code  lists  the  first  ten 
observations  for  the  requested  variables  (output  follows): 
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list  id  _tO  _t  _d  clinic  vl  in  1/10 


id 

-to 

-t 

-d 

clinic 

vl 

1. 

1 

0 

365 

0 

1 

0 

2, 

1 

365 

428 

1 

1 

365 

3. 

2 

0 

275 

1 

1 

0 

4, 

3 

0 

262 

1 

1 

0 

5. 

4 

0 

183 

1 

1 

0 

6. 

5 

0 

259 

1 

1 

0 

7. 

6 

0 

365 

0 

1 

0 

8. 

6 

365 

714 

1 

1 

365 

9. 

7 

0 

365 

0 

1 

0 

10, 

7 

365 

438 

1 

1 

365 

With  the  data  in  this  form,  two  heaviside  functions  can 
actually  be  defined  in  the  data  using  the  following  code: 


generate  hv2 = clinic*  (v 1/3 65) 
generate  hvl=clinic*(l-(vl/365)) 


The  following  code  and  output  list  a  sample  of  the  observa¬ 
tions  (in  159/167)  with  the  observation  number  sup¬ 
pressed  (the  noobs  option): 

list  id  _t0  _t  clinic  vl  hvl  hv2  in  159/167,  noobs 


id 

.to 

-t 

clinic 

vl 

hvl 

hv2 

100 

0 

365 

1 

0 

1 

0 

100 

365 

749 

1 

365 

0 

1 

101 

0 

150 

1 

0 

1 

0 

102 

0 

365 

1 

0 

1 

0 

102 

365 

465 

1 

365 

0 

1 

103 

0 

365 

2 

0 

2 

0 

103 

365 

708 

2 

365 

0 

2 

104 

0 

365 

2 

0 

2 

0 

104 

365 

713 

2 

365 

0 

2 

With  the  two  heaviside  functions  defined  in  the  split  data,  a 
time  dependent  model  using  these  functions  can  be  run 
with  the  following  code  (the  output  follows): 
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stcox  prison  dose  hvl  hv2,  nohr 


stcox  prison  clinic  dose  hvl  hv2,  nohr 


No.  of  subjects  = 

238 

Number  of  obs 

=  360 

No.  of  failures  = 

150 

Time  at  risk 

95812 

LR  chi2(4) 

=  74.17 

Log  likelihood 

-668.57443 

Prob  >  chi2 

=  0.0000 

-t 


-d 

Coef . 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

prison 

.377704 

.1684024 

2.24 

0.025 

.0476414 

.7077666 

dose 

- .0355116 

.0064354 

-5.52 

0.000 

- .0481247 

- .0228985 

hvl 

- .4595628 

.2552911 

-1.80 

0.072 

- .959924 

.0407985 

hv2 

-1.828228 

.385946 

-4.74 

0.000 

-2.584668 

-1.071788 

The  stsplit  command  is  complicated  but  it  offers  a  power¬ 
ful  approach  for  manipulating  the  data  to  accommodate 
time  varying  analyses. 


If  you  wish  to  return  the  data  to  its  previous  form,  drop  the 
variables  that  were  created  from  the  split  and  then  use  the 
stjoin  command: 

drop  vl  hvl  hv2 
stjoin 


It  is  possible  to  split  the  data  at  every  single  failure  time, 
but  this  uses  a  large  amount  of  memory.  However,  if  there 
is  only  one  time  varying  covariate  in  the  model,  the  sim¬ 
plest  way  to  run  an  extended  Cox  model  is  by  using  the  tvc 
and  texp  options  with  the  stcox  command. 


One  should  not  confuse  an  individual’s  survival  time  vari¬ 
able  (the  outcome  variable)  with  the  variable  used  to  define 
the  time  dependent  variable  (_t  in  Stata).  The  individual’s 
survival  time  variable  is  a  time  independent  variable.  The 
time  of  the  individual’s  event  (or  censorship)  does  not 
change.  A  time-dependent  variable,  on  the  other  hand,  is 
defined  so  that  it  can  change  its  values  over  time. 


8.  RUNNING  PARAMETRIC  MODELS 

The  Cox  PH  model  is  the  most  widely  used  model  in  sur¬ 
vival  analysis.  A  key  reason  why  it  is  so  popular  is  that  the 
distribution  of  the  survival  time  variable  need  not  be  spe¬ 
cified.  However,  if  it  is  believed  that  survival  time  follows  a 
particular  distribution,  then  that  information  can  be  uti¬ 
lized  in  a  parametric  modeling  of  survival  data. 
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Many  parametric  models  are  accelerated  failure  time 
(AFT)  models.  Whereas  the  key  assumption  of  a  PH 
model  is  that  hazard  ratios  are  constant  over  time,  the 
key  assumption  for  an  AFT  model  is  that  survival  time 
accelerates  (or  decelerates)  by  a  constant  factor  when  com¬ 
paring  different  levels  of  covariates. 

The  most  common  distribution  for  parametric  modeling  of 
survival  data  is  the  Weibull  distribution.  The  Weibull  dis¬ 
tribution  has  the  desirable  property  that  if  the  AFT 
assumption  holds,  then  the  PH  assumption  also  holds. 
The  exponential  distribution  is  a  special  case  of  the  Wei¬ 
bull  distribution.  The  key  property  for  the  exponential 
distribution  is  that  the  hazard  is  constant  over  time  (not 
just  the  hazard  ratio).  The  Weibull  and  exponential  model 
can  be  run  as  a  PH  model  (the  default)  or  an  AFT  model. 

A  graphical  method  for  checking  the  validity  of  a  Weibull 
assumption  is  to  examine  Kaplan-Meier  log-log  survival 
curves  against  log  survival  time.  This  is  accomplished 
with  the  sts  graph  command  (see  Section  2  of  this  appen¬ 
dix).  If  the  plots  are  straight  lines,  then  there  is  evidence 
that  the  distribution  of  survival  times  follows  a  Weibull 
distribution.  If  the  slope  of  the  line  equals  one,  then  the 
evidence  suggests  that  survival  time  follows  an  exponential 
distribution. 

The  streg  command  is  used  to  run  parametric  models. 
Even  though  the  log  log  survival  curves  obtained  using 
the  addicts  dataset  are  not  straight  lines,  the  data  will  be 
used  for  illustration.  First,  a  parametric  model  using  the 
exponential  distribution  will  be  demonstrated.  The  code 
and  output  follow: 

streg  prison  dose  clinic,  dist(exponential)  nohr 


Exponential  regression  --  log  relative-hazard  form 


No.  of  subjects  = 

238 

Number  of  obs 

=  238 

No.  of 

failures  = 

150 

Time  at 

risk 

95812 

LR 

chi2(3) 

=  49.91 

Log  likelihood 

-270.47929 

Prob  >  chi2 

=  0.0000 

-t 

Coef . 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

prison 

.2526491 

.1648862 

1.53 

0.125 

- .070522 

.5758201 

dose 

-.0289167 

.0061445 

-4.71 

0.000 

-  .0409596 

- .0168738 

clinic 

-.8805819 

.210626  - 

-4.18 

0.000 

-1.293401 

- .4677625 

-cons 

-3.684341 

.4307163 

-8.55 

0.000 

-4.528529 

-2.840152 
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The  distribution  is  specified  with  the  dist()  option.  The 
stcurv  command  can  be  used  following  the  streg  com¬ 
mand  to  obtain  fitted  survival,  hazard,  or  cumulative  haz¬ 
ard  curves.  The  following  code  obtains  the  estimated 
hazard  function  for  PRISON=0,  DOSE=40,  and  CLINIC=1: 

stcurv,  hazard  at  (prison=0  dose=40  clinic=l) 


1.00327  -\ 


°  .2 


o 
c n 

Cl 


-.996726 

2 


"  1076 

analysis  time 
Exponential  regression 


The  graph  illustrates  the  fact  that  the  hazard  is  constant  over 
time  if  survival  time  follows  an  exponential  distribution. 


Next,  a  Weibull  distribution  is  run  using  the  streg  com¬ 
mand: 


streg  prison  dose  clinic,  dist(weibull)  nohr 


Weibull 

regression 

--  log  relative-1 

hazard  form 

No.  of  subjects  = 

238 

Number  of  obs 

=  238 

No.  of 

failures  = 

150 

Time  at 

risk 

95812 

LR 

chi2(3) 

=  60.89 

Log  likelihood 

-260.98467 

Prob  >  chi2 

=  0.0000 

-t 

Coef . 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

prison 

.3144143 

.1659462 

1.89 

0.058 

-  .0108342 

.6396628 

dose 

- .0334675 

.006255 

-5.35 

0.000 

- .0457272 

- .0212079 

clinic 

-  .9715245 

.2122826  ■ 

-4.58 

0.000 

-1.387591 

- .5554582 

.cons 

-5.624436 

.6588041  ■ 

-8.54 

0.000 

-6.915668 

-4.333203 

/ln-p 

.3149526 

.0675583 

4.66 

0.000 

.1825408 

.4473644 

P 

1.370194 

.092568 

1.200263 

1.564184 

1/P 

.7298235 

.0493056 

.6393109 

.8331507 

Software:  A.  Stata  557 


Notice  that  the  Weibull  output  has  a  parameter  p  that  the 
exponential  distribution  does  not  have.  The  hazard  func¬ 
tion  for  a  Weibull  distribution  is  /ptn  1 .  If  p  =  1,  then  the 
Weibull  distribution  is  also  an  exponential  distribution 
(h  (t)  =  1).  Hazard  ratio  parameters  are  given  by  default 
for  the  Weibull  distribution.  If  you  want  the  parameteriza¬ 
tion  for  an  AFT  model,  then  use  the  time  option. 

The  code  and  output  for  a  Weibull  AFT  model  follow: 

streg  prison  dose  clinic,  dist(weibull)  time 


Weibull  regression  --  accelerated  failure-time  form 


No.  of  subjects  = 

238 

Number  of  obs 

=  238 

No.  of  failures  = 

150 

Time  at  risk 

95812 

LR  chi2(3) 

=  60.89 

Log  likelihood 

-260,98467 

Prob  >  chi2 

=  0 . 0000 

-t 

Coef . 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

prison 

- .2294669 

.1207889 

-1.90 

0.057 

- .4662088 

.0072749 

dose 

.0244254 

.0045898 

5.32 

0.000 

.0154295 

.0334213 

clinic 

.7090414 

.1572246 

4.51 

0.000 

.4008867 

1.017196 

.cons 

4.104845 

.3280583 

12.51 

0.000 

3.461863 

4.747828 

/ln.p 

.3149526 

.0675583 

4.66 

0.000 

.1825408 

.4473644 

P 

1.370194 

.092568 

1.200263 

1.564184 

1/P 

.7298235 

.0493056 

.6393109 

.8331507 

The  relationship  between  the  hazard  ratio  parameter  pj 
and  the  AFT  parameter  ocj  is  ft  =  -a,jp.  For  example,  using 
the  coefficient  estimates  for  PRISON  in  the  Weibull  PH 
and  AFT  models  yields  the  relationship  0.3144  = 
(-0.2295X1.37). 

The  stcurv  can  again  be  used  following  the  streg  com¬ 
mand  to  obtain  fitted  survival,  hazard,  or  cumulative  haz¬ 
ard  curves.  The  following  code  obtains  the  estimated 
hazard  function  for  PRISON=0,  DOSE=40,  and  CLINIC=1: 


stcurv,  hazard  at  (prison=0  dose=40  clinic=l) 
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analysis  time 
Weibull  regression 


The  plot  of  the  hazard  is  monotonically  increasing.  With  a 
Weibull  distribution,  the  hazard  is  constrained  such  that  it 
cannot  increase  and  then  decrease.  This  is  not  the  case 
with  the  log  logistic  distribution  as  demonstrated  in  the 
next  example.  The  log  logistic  model  is  not  a  PH  model,  so 
the  default  model  for  the  streg  command  is  an  AFT  model. 
The  code  and  output  follow: 

streg  prison  dose  clinic,  dist(loglogistic) 


Log-logistic  regression  --  accelerated  failure-time  form 


No.  of  subjects  = 

238 

Number  of  obs 

=  238 

No.  of  failures  = 

150 

Time  at  risk 

95812 

LR  chi2(3) 

=  52.18 

Log  likelihood 

-270.42329 

Prob  >  chi2 

=  0 . 0000 

-t 

Coef . 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

prison 

- .2912719 

.1439646 

-2.02 

0,043 

- .5734373 

- .0091065 

dose 

.0316133 

.0055192 

5.73 

0.000 

.0207959 

.0424307 

clinic 

.5805977 

.1715695 

3.38 

0.001 

.2443276 

.9168677 

.cons 

3.563268 

.3894467 

9.15 

0.000 

2.799967 

4.32657 

/ln.gam 

- .5331424 

.0686297 

-7.77 

0.000 

- .6676541 

- .3986306 

gamma 

.5867583 

.040269 

.5129104 

.6712386 

Note  that  Stata  calls  the  shape  parameter  gamma  for  a 
log-logistic  model.  The  code  to  produce  the  graph  of  the 
hazard  function  for  PRISON=0,  DOSE=40,  and  CLINIC=1 
follows: 


stcurv,  hazard  at  (prison=0  dose=40  clinic=l) 


Software:  A.  Stata  559 


The  hazard  function  (in  contrast  to  the  Weibull  hazard 
function)  first  increases  and  then  decreases. 

The  corresponding  survival  curve  for  the  log  logistic  distri¬ 
bution  can  also  be  obtained  with  the  stcurve  command: 

stcurv,  survival  at  (prison=0  dose=40  clinic=  1 ) 


Log-logistic  regression 

If  the  AFT  assumption  holds  for  a  log  logistic  model,  then 
the  proportional  odds  assumption  holds  for  the  survival 
function  (although  the  PH  assumption  would  not  hold). 
The  proportional  odds  assumption  can  be  evaluated  by 
plotting  of  the  log  odds  of  survival  (using  KM  estimates) 
against  the  log  of  survival  time.  If  the  plots  are  straight 
lines  for  each  pattern  of  covariates,  then  the  log-logistic 
distribution  is  reasonable.  If  the  straight  lines  are  also 
parallel,  then  the  proportional  odds  and  AFT  assumptions 
also  hold.  The  following  code  will  plot  the  estimated  log 
odds  of  survival  against  the  log  of  time  by  CLINIC  (output 
omitted): 
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sts  generate  skm=s,  by(clinic) 
generate  logodds=ln(skm/(l-skm)) 
generate  logt=ln(survt) 

graph7  logodds  logt,  twoway  symbol( [clinic]  [clinic]) 

Another  context  for  thinking  about  the  proportional  odds 
assumption  is  that  the  odds  ratio  estimated  by  a  logistic 
regression  does  not  depend  on  the  length  of  the  follow-up. 
For  example,  if  a  follow-up  study  was  extended  from  3  to  5 
years,  then  the  underlying  odds  ratio  comparing  two  pat¬ 
terns  of  covariates  would  not  change.  If  the  proportional 
odds  assumption  is  not  true,  then  the  odds  ratio  is  specific 
to  the  length  of  follow-up. 

Both  the  log-logistic  and  Weibull  models  contain  an  extra 
shape  parameter  that  is  typically  assumed  constant.  This 
assumption  is  necessary  for  the  PH  or  AFT  assumption  to 
hold  for  these  models.  Stata  provides  a  way  of  modeling 
the  shape  parameter  as  a  function  of  predictor  variables  by 
use  of  the  ancillary  option  in  the  streg  command  (see 
Chapter  7  under  the  heading  "Other  Parametric  Models”). 
The  following  code  runs  a  log-logistic  model  in  which  the 
shape  parameter  gamma  is  modeled  as  a  function  of 
CLINIC  while  X  is  modeled  as  a  function  of  PRISON  and 
DOSE: 

streg  prison  dose,  dist(loglogistic)  ancillary(clinic) 

The  output  follows: 


Log-logistic  regression  --  accelerated  failure-time  form 


No.  of  subjects 

238 

Number  of  obs 

=  238 

No.  of  failures 

150 

Time  at  risk 

95812 

LR  chi2(2) 

=  38.87 

Log  likelihood  = 

-272.65273 

Prob  >  chi2 

=  0.0000 

_t 

Coef . 

Std .  Err. 

z 

P>|Z| 

[95%  Conf. 

Interval] 

.t 

prison 

- .3275695 

.1405119 

-2.33 

0.020 

- .6029677 

-.0521713 

dose 

.0328517 

.0054275 

6.05 

0.000 

.022214 

.0434893 

.cons 

4.183173 

.3311064 

12.63 

0.000 

3.534216 

4.83213 

In. gam 

clinic 

.4558089 

.1734819 

2.63 

0.009 

.1157906 

.7958273 

.cons 

-1.094496 

.2212143 

-4.95 

0.000 

-1.528068 

-.6609238 

Software:  A.  Stata  561 


Notice  there  is  a  parameter  estimate  for  CLINIC  as  well  as 
an  intercept  (_cons)  under  the  heading  ln_gam  (the  log  of 
gamma).  With  this  model,  the  estimate  for  gamma 
depends  on  whether  CLINIC=1  or  CLINIC=2.  There  is  no 
easy  interpretation  for  the  predictor  variables  in  this  type 
of  model,  which  is  why  it  is  not  commonly  used.  However, 
for  any  specified  value  of  PRISON,  DOSE,  and  CLINIC,  the 
hazard  and  survival  functions  can  be  estimated  by  substi¬ 
tuting  the  parameter  estimates  into  the  expressions  for  the 
log-logistic  hazard  and  survival  functions. 

Other  distributions  supported  by  streg  are  the  generalized 
gamma,  the  lognormal,  and  the  Gompertz  distributions. 


9.  RUNNING  FRAILTY  MODELS 

Frailty  models  contain  an  extra  random  component 
designed  to  account  for  individual-level  differences  in  the 
hazard  otherwise  unaccounted  for  by  the  model.  The 
frailty,  a,  is  a  multiplicative  effect  on  the  hazard  assumed 
to  follow  some  distribution.  The  hazard  function  condi¬ 
tional  on  the  frailty  can  be  expressed  as  h(t\a)  =  a[h(t)]. 

Stata  offers  two  choices  for  the  distribution  of  the  frailty: 
the  gamma  and  the  inverse-Gaussian,  both  of  mean  1  and 
variance  theta.  The  variance  (theta)  is  a  parameter  esti¬ 
mated  by  the  model.  If  theta  =  0,  then  there  is  no  frailty. 

For  the  first  example,  a  Weibull  PH  model  is  run  with 
PRISON,  DOSE,  and  CLINIC  as  predictors.  A  gamma  dis¬ 
tribution  is  assumed  for  the  frailty  component.  The  models 
in  this  section  were  run  using  Stata  8.0.  The  code  follows: 

streg  dose  prison  clinic,  dist(  weibull)  frailty  (gamma)  nohr 

The  frailty()  option  requests  that  a  frailty  model  be  run. 
The  output  follows: 
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Weibull  regression  -■ 


log  relative-hazard  form 
Gamma  frailty 


No.  of  subjects  = 

238 

Number  of  obs 

=  238 

No.  of 

failures  = 

15G 

Time  at 

risk 

95812 

LR  chi2(4) 

=  60.89 

Log  likelihood 

-26G. 98467 

Prob 

>  chi2 

=  0.0000 

-t 

Coef . 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

dose 

-.0334635 

. GG62553 

-5.35 

0.000 

- .0457237 

- .0212034 

prison 

.3143786 

.165953 

1.89 

0.058 

- .0108833 

.6396405 

clinic 

- .9714998 

. 21229G9 

-4.58 

0.000 

-1.387582 

- .5554173 

.cons 

-5.624342 

.6588994 

-8.54 

0.000 

-6.915761 

-4.332923 

/ln-p 

. 3149G36 

.0675772 

4.66 

0.000 

.1824548 

.4473525 

/In -the 

-15.37947 

722.4246 

-0.02 

0.983 

-1431.306 

1400.547 

P 

1.370127 

.0925893 

1.20016 

1.564166 

1/P 

.7298592 

.0493218 

.6393185 

.8332223 

theta 

2 . 09e-O7 

.0001512 

0 

Likelihood  ratio  test  of  theta  =  G:  chibar2(01)  =  G.00 
Prob>=chibar2  =  l.GGG 


Notice  that  there  is  one  additional  parameter  (theta)  com¬ 
pared  to  the  model  run  in  the  previous  section.  The  esti¬ 
mate  for  theta  is  2.09  times  10~7  or  0.000000209  which  is 
essentially  zero.  A  likelihood  ratio  test  for  the  inclusion  of 
theta  is  provided  at  the  bottom  of  the  output  and  yields  a 
chi-square  value  of  0.00  and  a  p-value  of  1.000.  The  frailty 
has  no  effect  on  the  model  and  need  not  be  included. 


The  next  model  will  be  the  same  as  the  previous  except  that 
CLINIC  will  not  be  included.  One  might  expect  a  frailty 
component  to  play  a  larger  role  if  an  important  covariate, 
such  as  CLINIC,  is  not  included  in  the  model.  The  code  and 
output  follow: 
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streg  dose  prison,  dist(weibull)  frailty  (gamma)  nohr 


Weibull  regression 

--  log  relative- 

hazard  form 

Gamma  frailty 

No.  of  subjects  = 

238 

Number  of  obs 

=  238 

No.  of  failures  = 

150 

Time  at  risk 

95812 

LR  chi2(3) 

=  36.00 

Log  likelihood 

-273.42782 

Prob  >  chi2 

=  0.0000 

-t 

Coef . 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

dose 

- .0358231 

.010734 

-3.34 

0.001 

- .0568614 

- .0147849 

prison 

.2234556 

.2141028 

1.04 

0.297 

- .1961783 

.6430894 

-cons 

-6.457393 

.6558594 

-9.85 

0.000 

-7.742854 

-5.171932 

/ln.p 

.2922832 

.1217597 

2.40 

0.016 

.0536385 

.5309278 

/In -the 

-2.849726 

5.880123 

-0.48 

0.628 

-14.37456 

8.675104 

P 

1.339482 

.163095 

1.055103 

1.700509 

1/P 

.7465571 

.0909006 

.5880591 

.9477747 

theta 

.0578602 

.340225 

5 . 72e-07 

5855.31 

Likelihood  ratio  test  of  theta  =  0:  chibar2(01)  =  0.03 
Prob>=chibar2  =0.432 


The  variance  (theta)  of  the  frailty  is  estimated  at 
0.0578602.  Although  this  estimate  is  not  exactly  zero  as  in 
the  previous  example,  the  p-value  for  the  likelihood  ratio 
test  for  theta  is  nonsignificant  at  0.432.  So  the  addition  of 
frailty  did  not  account  for  CLINIC  being  omitted  from  the 
model. 

Next,  the  same  model  is  run  except  that  the  inverse-Gauss- 
ian  distribution  is  used  for  the  frailty  rather  than  the 
gamma  distribution.  The  code  and  output  follow: 
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streg  dose  prison,  dist(weibull)  frailty(invgaussian)  nohr 


Weibull  regression  --  log  relative-hazard  form 

Inverse-Gaussian  frailty 


No.  of  subjects  = 
No.  of  failures  = 
Time  at  risk 

Log  likelihood 

238 

150 

95812 

-273.43201 

Number  of  obs 

LR  chi2(3) 

Prob  >  chi2 

=  238 

=  35.99 

=  0.0000 

-t 

Coef . 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

dose 

- .0353908 

.0096247 

-3.68 

0.000 

- .0542549 

- .0165268 

prison 

.2166456 

.1988761 

1.09 

0.276 

- .1731445 

.6064356 

-cons 

-6.448779 

.6494397 

-9.93 

0.000 

-7.721658 

-5.175901 

/ln.p 

.2875567 

.1122988 

2.56 

0.010 

.0674551 

.5076583 

/In -the 

-3.137696 

7.347349 

-0.43 

0.669 

-17.53824 

11.26284 

P 

1/P 

theta 

1.333166 

.7500941 

.0433827 

.1497129 

.0842347 

.3187475 

1.069782 

.6019034 

2.42e-08 

1.661396 

.9347697 

77873.78 

Likelihood  ratio  test  of  theta  =  0:  chibar2(01)  =  0.02 
Prob>=chibar2  =  0.443 


The  p-value  for  the  likelihood  ratio  test  for  theta  is  0.443  (at 
the  bottom  of  the  output).  The  results  in  this  example  are 
very  similar  whether  assuming  the  inverse-Gaussian  or  the 
gamma  distribution  for  the  frailty  component. 


An  example  of  shared  frailty  applied  to  recurrent  event 
data  is  shown  in  the  next  section. 


10.  MODELING  RECURRENT  EVENTS 

The  modeling  of  recurrent  events  is  illustrated  with  the 
bladder  cancer  dataset  (bladder.dta)  described  at  the 
start  of  this  appendix.  Recurrent  events  are  represented 
in  the  data  with  multiple  observations  for  subjects  having 
multiple  events.  The  data  layout  for  the  bladder  cancer 
dataset  is  suitable  for  a  counting  process  approach  with 
time  intervals  defined  for  each  observation  (see  Chapter  8). 
The  following  code  prints  the  12th-20th  observation,  which 
contains  information  for  four  subjects.  The  code  and  out¬ 
put  follow: 
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list  in  12/20 


id 

event 

interval 

start 

stop 

tx 

num 

size 

12. 

10 

1 

1 

0 

12 

0 

1 

1 

13. 

10 

1 

2 

12 

16 

0 

1 

1 

14. 

10 

0 

3 

16 

18 

0 

1 

1 

15. 

11 

0 

1 

0 

23 

0 

3 

3 

16. 

12 

1 

1 

0 

10 

0 

1 

3 

17. 

12 

1 

2 

10 

15 

0 

1 

3 

18. 

12 

0 

3 

15 

23 

0 

1 

3 

19. 

13 

1 

1 

0 

3 

0 

1 

1 

20. 

13 

1 

2 

3 

16 

0 

1 

1 

There  are  three  observations  for  ID=10,  one  observation 
for  ID=1 1,  three  observations  for  ID=12,  and  two  observa¬ 
tions  for  ID=13.  The  variables  START  and  STOP  represent 
the  time  interval  for  the  risk  period  specific  to  that  obser¬ 
vation.  The  variable  EVENT  indicates  whether  an  event 
(coded  1)  occurred.  The  first  three  observations  indicate 
that  the  subject  with  ID=10  had  an  event  at  12  months, 
another  event  at  16  months,  and  was  censored  at  18 
months. 

Before  using  Stata’s  survival  commands,  the  stset  com¬ 
mand  must  be  used  to  define  the  key  survival  variables. 
The  code  follows: 

stset  stop,  failure(event==l)  id(id)  timeO(start)  exit(time.) 

We  have  previously  used  the  stset  command  on  the 
“addicts”  dataset,  but  more  options  from  stset  are  included 
here.  The  id()  option  defines  the  subject  variable  (i.e.,  the 
cluster  variable),  the  time0()  option  defines  the  variable 
that  begins  the  time  interval,  and  the  exit(time  .)  option 
tells  Stata  that  there  is  no  imposed  limit  on  the  length  of 
follow-up  time  for  a  given  subject  (e.g.,  subjects  are  not  out 
of  the  risk  set  after  their  first  event).  With  the  stset  com¬ 
mand,  Stata  creates  the  variables  _t0,  _t,  and  _d,  which 
Stata  automatically  recognizes  as  survival  variables  repre¬ 
senting  the  time  interval  and  event  status.  Actually,  the 
time0()  option  could  have  been  omitted  from  this  stset 
command  and  by  default  Stata  would  have  created  the 
starting  time  variable,  _t0,  in  the  correct  counting  process 
format  as  long  as  the  id()  option  was  used  (otherwise  _t0 
would  default  to  zero).  The  following  code  (and  output) 
lists  the  12th-20th  observation  with  the  newly  created  vari¬ 
ables: 
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list  id  _t0  _t  _d  tx  in  12/20 


id 

.to 

-t 

-d 

tx 

12. 

10 

0 

12 

1 

0 

13. 

10 

12 

16 

1 

0 

14. 

10 

16 

18 

0 

0 

15. 

11 

0 

23 

0 

0 

16. 

12 

0 

10 

1 

0 

17. 

12 

10 

15 

1 

0 

18. 

12 

15 

23 

0 

0 

19. 

13 

0 

3 

1 

0 

20. 

13 

3 

16 

1 

0 

A  Cox  model  with  recurrent  events  using  the  counting 
process  approach  can  now  be  run  with  the  stcox  com¬ 
mand.  The  predictors  are  treatment  status  (TX),  initial 
number  of  tumors  (NUM),  and  the  initial  size  of  tumors 
(SIZE).  The  robust  option  requests  robust  standard  errors 
for  the  coefficient  estimates.  Omit  the  nohr  option  if  you 
want  the  exponentiated  coefficients.  The  code  and  output 
follow: 


stcox  tx  num  size,  nohr  robust 


Cox  regression  - 

-  Breslow  method 

for  ties 

No.  of  subjects 

=  85 

Number  of  obs 

=  190 

No.  of  failures 

=  112 

Time  at  risk 

=  2711 

Wald  chi2(3) 

=  11.25 

Log  likelihood 

=  -460.07958 

Prob  >  chi2 

=  0.0105 

(standard  errors  adjusted  for  clustering  on  id) 


-t 

-d 

Coef . 

Robust 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

tx 

- .4070966 

.2432658 

-1.67 

0.094 

- .8838889 

.0696956 

num 

.1606478 

.0572305 

2.81 

0.005 

.0484781 

.2728174 

size 

- .0400877 

.0726459 

-0.55 

0.581 

- .182471 

.1022957 

The  interpretation  of  these  parameter  estimates  is  dis¬ 
cussed  in  Chapter  8 
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A  stratified  Cox  model  can  also  be  run  using  the  data  in  this 
format  with  the  variable  INTERVAL  as  the  stratified  vari¬ 
able.  The  stratified  variable  indicates  whether  subjects 
were  at  risk  for  their  1st,  2nd,  3rd,  or  4th  event.  This 
approach  is  called  a  Stratified  CP  approach  in  Chap. 
8  and  is  used  if  the  investigator  wants  to  distinguish  the 
order  in  which  recurrent  events  occur.  The  code  and  out¬ 
put  follow: 

stcox  tx  num  size,  nohr  robust  strata(interval) 


stratified  Cox  regr. 

—  Breslow 

method  for  ties 

No.  of  subjects  = 

85 

Number  of  obs 

=  190 

No.  of  failures  = 

112 

Time  at  risk 

2711 

Wald  chi2(3) 

=  7.11 

Log  likelihood 

-319.85912 

Prob  >  chi2 

=  0.0685 

(standard  errors  adjusted  for  clustering  on  id) 


-t 

-d 

Coef . 

Robust 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

tx 

- .3342955 

.1982339 

-1.69 

0.092 

-.7228268 

.0542359 

num 

.1156526 

.0502089 

2.30 

0.021 

.017245 

.2140603 

size 

- .0080508 

.0604807 

-0.13 

0.894 

-.1265908 

.1104892 

Stratified  by  interval 


Interaction  terms  between  the  treatment  variable  (TX)  and 
the  stratified  variable  could  be  created  to  examine  whether 
the  effect  of  treatment  differed  for  the  1st,  2nd,  3rd,  or  4th 
event.  (Note  that  in  this  dataset,  subjects  have  a  maximum 
of  4  events). 

Another  stratified  approach  (called  Gap  Time)  is  a  slight 
variation  of  the  Stratified  CP  approach.  The  difference  is  in 
the  way  the  time  intervals  for  the  recurrent  events  are 
defined.  There  is  no  difference  in  the  time  intervals  when 
subjects  are  at  risk  for  their  first  event.  However,  with  the 
Gap  Time  approach,  the  starting  time  at  risk  gets  reset  to 
zero  for  each  subsequent  event.  The  following  code  creates 
data  suitable  for  running  a  Gap  Time  recurrent  event 
model. 


generate  stop2  =_t  -  _t0 

stset  stop2,  failure(event==l)  exit(time  .) 
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The  generate  command  defines  a  new  variable  called 
STOP2  representing  the  length  of  the  time  interval  for 
each  observation.  The  stset  command  is  used  with 
STOP2  as  the  outcome  variable  (_t).  By  default,  Stata  sets 
the  variable  _t0  to  zero.  The  following  code  (and  output) 
lists  the  12th  through  20th  observations  for  selected  vari¬ 
ables. 

list  id  _t0  _t  _d  tx  in  12/20 


id 

.to 

-t 

-d 

tx 

12. 

10 

0 

12 

1 

0 

13. 

10 

0 

4 

1 

0 

14. 

10 

0 

2 

0 

0 

15. 

11 

0 

23 

0 

0 

16. 

12 

0 

10 

1 

0 

17. 

12 

0 

5 

1 

0 

18. 

12 

0 

8 

0 

0 

19. 

13 

0 

3 

1 

0 

20. 

13 

0 

13 

1 

0 

Notice  that  the  id()  option  was  not  used  with  the  stset 
command  for  the  Gap  Time  approach.  This  means  that 
Stata  does  not  know  that  multiple  observations  correspond 
to  the  same  subject.  However,  the  cluster()  option  can  be 
used  directly  in  the  stcox  command  to  request  that  the 
analysis  be  clustered  by  ID  (i.e.,  by  subject).  The  following 
code  runs  a  stratified  Cox  model  using  the  Gap  Time 
approach  with  the  cluster()  and  robust  options.  The 
code  and  output  follow: 

stcox  tx  num  size,  nohr  robust  strata(interval)  cluster(id) 


Stratified  Cox  regr. 

—  Breslow 

method  for  ties 

No.  of  subjects  = 

190 

Number  of  obs 

=  190 

No.  of  failures  = 

112 

Time  at  risk 

2711 

Wald  chi2(3) 

=  11.99 

Log  likelihood 

-363.16022 

Prob  >  chi2 

=  0.0074 

(standard  errors  adjusted  for  clustering  on  id) 


-t 

-d 

Coef . 

Robust 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

tx 

-  .2695213 

.2093108 

-1.29 

0.198 

- .6797628 

.1407203 

num 

.1535334 

.0491803 

3.12 

0.002 

.0571418 

.2499249 

size 

.0068402 

.0625862 

0.11 

0.913 

- .1158265 

.129507 

Stratified  by  interval 


Software:  A.  Stata  569 


The  results  using  the  Gap  Time  approach  vary  slightly 
from  that  obtained  using  the  Stratified  CP  approach. 

Next,  we  demonstrate  how  a  shared  frailty  model  can  be 
applied  to  recurrent  event  data.  Frailty  is  included  in  recur¬ 
rent  event  analyses  to  account  for  variability  due  to  unob¬ 
served  subject-specific  factors  that  may  lead  to  within- 
subject  correlation. 

Before  running  the  model,  we  rerun  the  stset  command 
shown  earlier  in  this  section  to  get  the  data  back  to  the 
form  suitable  for  a  counting  process  approach.  The  code 
follows: 

stset  stop,  failure(event==l)  id(id)  timeO(start)  exitftime  .) 

Next  a  parametric  Weibull  model  is  run  with  a  gamma- 
distributed  shared  frailty  component  using  the  streg  com¬ 
mand.  We  use  the  same  three  predictors  for  comparability 
with  the  other  models  presented  in  this  section.  The  code 
follows: 

streg  tx  num  size,  dist(weibull)  frailty(gamma)  shared(id)  nohr 

The  dist()  option  requests  the  distribution  for  the  para¬ 
metric  model.  The  frailtyf)  option  requests  the  distribu¬ 
tion  for  the  frailty  and  the  sharedf)  option  defines  the 
cluster  variable,  ID.  For  this  model,  observations  from  the 
same  subject  share  the  same  frailty.  The  output  follows: 


Weibull  regression  —  log  relative-hazard  form 

Gamma  shared  frailty 


No.  of  subjects 

85 

Number  of  obs 

=  190 

No.  of  failures 

112 

Time  at  risk  = 

2711 

LR  chi2(3) 

=  8.04 

Log  likelihood  = 

-184.73658 

Prob  >  chi2 

=  0.0453 

-t 

Coef . 

Std .  Err. 

z 

P>|z| 

[95%  Conf. 

Interval] 

tx 

- .4583219 

.2677275 

-1.71 

0.087 

- .9830582 

.0664143 

num 

.1847305 

.0724134 

2.55 

0.011 

.0428028 

.3266581 

size 

-.0314314 

.0911134 

-0.34 

0.730 

- .2100104 

.1471476 

.cons 

-2.952397 

.4174276 

-7.07 

0.000 

-3.77054 

-2.134254 

/ln-p 

-.1193215 

.0898301 

-1.33 

0.184 

- .2953852 

.0567421 

/In  -the 

-.7252604 

.5163027 

-1.40 

0.160 

-1.737195 

.2866742 

P 

.8875224 

.0797262 

.7442449 

1.058383 

1/P 

1.126732 

.1012144 

.9448377 

1.343644 

theta 

.4841985 

.249993 

.1760134 

1.33199 

Likelihood  ratio  test  of  theta=0:  chibar2(01)  =7.34 
Prob>=chibar2  =  0.003 
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The  model  output  is  discussed  in  Chapter  8. 

The  counting  process  data  layout  with  multiple  observa¬ 
tions  per  subject  need  not  only  apply  to  recurrent  event 
data,  but  can  also  be  used  for  a  more  conventional  survival 
analyses  in  which  each  subject  is  limited  to  one  event.  A 
subject  with  four  observations  may  be  censored  for  the 
first  three  observations  before  getting  the  event  in  the 
time  interval  represented  by  the  fourth  observation.  This 
data  layout  is  particularly  suitable  for  representing  time- 
varying  exposures,  which  may  change  values  over  different 
intervals  of  time  (see  the  stsplit  command  in  Section  7  of 
this  appendix). 


B.  SAS 

Analyses  are  carried  out  in  SAS  by  using  the  appropriate 
SAS  procedure  on  a  SAS  dataset.  The  key  SAS  procedures 
for  performing  survival  analyses  are: 

PROC  LIFETEST  -  This  procedure  is  used  to  obtain 
Kaplan-Meier  survival  estimates  and  plots.  It  can  also 
be  used  to  output  life  table  estimates  and  plots.  It  will 
generate  output  for  the  log  rank  and  Wilcoxon  test  sta¬ 
tistics  if  stratifying  by  a  covariate.  A  new  SAS  dataset 
containing  survival  estimates  can  be  requested. 

PROC  PHREG  -  This  procedure  is  used  to  run  the  Cox 
proportional  hazards  model,  a  stratified  Cox  model,  and 
an  extended  Cox  model  with  time-varying  covariates.  It 
can  also  be  used  to  create  a  SAS  dataset  containing 
adjusted  survival  estimates.  These  adjusted  survival  esti¬ 
mates  can  then  be  plotted  using  PROC  GPLOT. 

PROC  LIFEREG  -  This  procedure  is  used  to  run  para¬ 
metric  accelerated  failure  time  AFT  models. 

Analyses  on  the  "addicts”  dataset  will  be  used  to  illustrate 
these  procedures.  The  “addicts”  dataset  was  obtained  from 
a  1991  Australian  study  by  Caplehorn  et  al.  and  contains 
information  on  238  heroin  addicts.  The  study  compared 
two  methadone  treatment  clinics  to  assess  patient  time 
remaining  under  methadone  treatment.  The  two  clinics 
differed  according  to  its  live-in  policies  for  patients. 
A  patient’s  survival  time  was  determined  as  the  time  (in 
days)  until  the  person  dropped  out  of  the  clinic  or  was 
censored.  The  variables  are  defined  at  the  start  of  this 
appendix. 


Software:  B.  Sas  571 


All  of  the  SAS  programming  code  will  be  written  in  capital 
letters  for  readability.  However,  SAS  is  not  case  sensitive. 
If  a  program  is  written  with  lower-case  letters,  SAS  reads 
them  as  upper  case.  The  number  of  spaces  between 
words  (if  more  than  one)  has  no  effect  on  the  program. 
Each  SAS  programming  statement  ends  with  a  semicolon. 

The  addicts  dataset  is  stored  as  a  permanent  SAS  dataset 
called  addicts.sas7bdat.  A  LIBNAME  statement  is  needed 
to  indicate  the  path  to  the  location  of  the  SAS  dataset.  In 
our  examples,  we  assume  the  file  is  located  on  the  C  drive. 
The  LIBNAME  statement  includes  a  reference  name  as 
well  as  the  path.  We  call  the  reference  name  REF.  The 
code  is  as  follows: 

LIBNAME  REF  '  C  :  \ ’ ; 

The  user  is  free  to  define  his/her  own  reference  name. 
The  path  to  the  location  of  the  file  is  given  between  the 
quotation  marks.  The  general  form  of  the  code  is: 


LIBNAME  Your  reference  name  'Your  path  to  file  location'; 

PROC  CONTENTS,  PROC  PRINT,  PROC  UNIVARIATE, 
PROC  FREQ,  and  PROC  MEANS  can  be  used  to  list  or 
describe  the  data.  SAS  code  can  be  run  in  one  batch  or 
highlighted  and  submitted  one  procedure  at  a  time.  Code 
can  be  submitted  by  clicking  on  the  submit  button  on  the 
toolbar  in  the  Editor  window.  The  code  for  using  these 
procedures  follows  (output  omitted): 

PROC  CONTENTS  DATA=REF.ADDICTS;RUN; 

PROC  PRINT  DATA=REF.ADDICTS;RUN; 

PROC  UNIVARIATE  DATA=REF.ADDICTS;VAR  SURVT;RUN; 

PROC  FREQ  DATA=REF.ADDICTS;TABLES  CLINIC  PRISON;RUN; 

PROC  MEANS  DATA=REF.ADDICTS;VAR  SURVT;CLAS  CLINIC;RUN; 

Notice  that  each  SAS  statement  ends  with  a  semicolon. 
If  each  procedure  is  submitted  one  at  a  time,  then  each 
procedure  must  end  with  a  RUN  statement.  Otherwise  one 
RUN  statement  at  the  end  of  the  last  procedure  is  suffi¬ 
cient.  With  the  LIBNAME  statement,  SAS  recognizes  a 
two-level  file  name:  the  reference  name  and  the  file  name 
without  an  extension.  For  our  example,  the  SAS  file  name 
is  REF.ADDISTS.  Alternatively,  a  temporary  SAS  dataset 
could  be  created  and  used  for  these  procedures. 
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Text  that  you  do  not  wish  SAS  to  process  can  be  written  as 
a  comment: 

/*  A  comment  begins  with  a  forward  slash  followed  by  a 
star  and  ends  with  a  star  followed  by  a  forward  slash.  */ 
*  A  comment  can  also  be  created  by  beginning  with  a  star 
and  ending  with  a  semicolon; 

The  survival  analyses  demonstrated  in  SAS  are  as  follows: 

1 .  Demonstrating  PROC  LIFETEST  to  obtain  Kaplan- 
Meier  and  life  table  survival  estimates  (and  plots). 

2.  Running  a  Cox  PH  model  with  PROC  PHREG. 

3.  Running  a  stratified  Cox  model. 

4.  Assessing  the  PH  assumption  with  a  statistical  test. 

5.  Obtaining  Cox  adjusted  survival  curves. 

6.  Running  an  extended  Cox  model  (i.e.,  containing  time 
varying  covariates). 

7.  Running  parametric  models  with  PROC  LIFEREG. 

8.  Modeling  recurrent  events 

1 .  DEMONSTRATING  PROC  LIFETEST  TO  OBTAIN 
KM  AND  LIFE  TABLE  SURVIVAL  ESTIMATES 
(AND  PLOTS) 

PROC  LIFETEST  produces  Kaplan-Meier  survival  esti¬ 
mates  with  the  METHOD=KM  option.  The  PLOTS=(S) 
option  plots  the  estimated  survival  function.  The  TIME 
statement  defines  the  time-to-event  variable  (SURVT)  and 
the  value  for  censorship  (STATUS=0).  The  code  follows 
(output  omitted): 

PROC  LIFETEST  DATA=REF . ADDICTS  METH0D=KM  PL0TS=(S); 
TIME  SURVT*STATUS(0); 

RUN; 


Use  a  STRATA  statement  in  PROC  LIFETEST  to  compare 
survival  estimates  for  different  groups  (e.g.,  strata  clinic). 
The  PLOTS=(S,  LLS)  option  produces  log-log  curves  as 
well  as  survival  curves.  If  the  PH  assumption  is  met,  the 
log-log  survival  curves  will  be  parallel.  The  STRATA  state¬ 
ment  also  provides  the  log  rank  test  and  Wilcoxon  test 
statistics.  The  code  follows: 


Software:  B.  Sas  573 


PROC  LIFETEST  DATA=REF . ADDICTS  METHOD=KM  PL0TS=(S , LLS) ; 
TIME  SURVT*STATUS(0) ; 

STRATA  CLINIC; 

RUN; 

PROC  LIFETEST  yields  the  following  edited  output: 


The  LIFETEST  Procedure  (stratified) 
Stratum  1:  CLINIC  =  1 


Product- Limit  Survival  Estimates 


Survival 

Standard 

Number 

Number 

SURVT 

Survival 

Failure 

Error 

Failed 

Left 

0.00 

1 . 0000 

0 

0 

0 

163 

2.00* 

0 

162 

7.00 

0.9938 

0.00617 

0.00615 

1 

161 

17.00 

0.9877 

0.0123 

0.00868 

2 

160 

836.00 

0.0869 

0.9131 

0.0295 

118 

6 

837.00 

0.0725 

0.9275 

0.0279 

119 

5 

Stratum  2:  CLINIC  =  2 
Product- Limit  Survival  Estimates 


SURVT 

Survival 

Failure 

0.00 

1 . 0000 

0 

2.00* 

13.00 

0.9865 

0.0135 

26.00 

0.9730 

0.0270 

Survival 

Standard 

Number 

Number 

Error 

Failed 

Left 

0 

0 

75 

0 

74 

0.0134 

1 

73 

0.0189 

2 

72 

Test  of  Equality  over  Strata 


Test 


Chi-Square 


Pr  > 

DF  Chi-Square 


Log-Rank  27.8927  1 
Wilcoxon  11.6268  1 
-2Log(LR)  26.G236  1 


<.0001 

0.0007 

<.0001 
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Both  the  log  rank  and  Wilcoxon  test  yield  highly  significant 
chi-square  test  statistics.  The  Wilcoxon  test  is  a  variation  of 
the  log  rank  test  weighting  the  observed  minus  expected 
score  of  the  jth  failure  time  by  nj  (the  number  still  at  risk  at 
the  jth  failure  time). 

The  requested  log-log  plots  from  PROC  LIFETEST  follow: 


Log  of  survival  time  in  days 
STRATA:  CLINIC=1  CLINIC=2 

SAS  (as  well  as  Stata  and  R)  plots  log(survival  time)  rather 
than  survival  time  on  the  horizontal  axis  by  default  for  log- 
log  curves.  As  far  as  checking  the  parallel  assumption,  it 
does  not  matter  if  log(survival  time)  or  survival  time  is  on 
the  horizontal  axis.  However,  if  the  log-log  survival  curves 
look  like  straight  lines  with  log(survival  time)  on  the  hori¬ 
zontal  axis,  then  there  is  evidence  that  the  “time-to-event” 
variable  follows  a  Weibull  distribution.  If  the  slope  of  the 
line  equals  one,  then  there  is  evidence  that  the  survival 
time  variable  follows  an  exponential  distribution  -  a  spe¬ 
cial  case  of  the  Weibull  distribution.  For  these  situations,  a 
parametric  survival  model  can  be  used. 

You  can  gain  more  control  over  how  variables  are  plotted, 
by  creating  a  dataset  that  contains  the  survival  estimates. 
Use  the  OUTSURV=  option  in  the  PROC  LIFETEST  state¬ 
ment  to  create  a  SAS  data  containing  the  KM  survival 
estimates.  The  option  OUTSURV=DOG  creates  a  dataset 
called  dog  (make  up  your  own  name)  containing  the  sur¬ 
vival  estimates  in  a  variable  called  SURVIVAL.  The  code 
follows: 

PROC  LIFETEST  DATA=REF . ADDICTS  METH0D=KM  0UTSURV=D0G; 
TIME  SURVT*STATUS(0) ; 

STRATA  CLINIC; 

RUN; 
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Data  dog  contains  the  survival  estimates  but  not  the 
log(-(log))  of  the  survival  estimates.  Data  cat  is  created  in 
the  following  code  from  data  dog  (using  the  statement  SET 
DOG)  and  defines  a  new  log-log  variable  called  LLS. 

DATA  CAT; 

SET  DOC; 

L LS = LOG (- LOG (SURVIVAL)) ; 

RUN; 

In  SAS,  the  LOG  function  returns  the  natural  log,  not  the 
log  base  10. 

PROC  PRINT  prints  the  data  in  the  output  window. 

PROC  PRINT  DATA=CAT ;  RUN; 

The  first  10  observations  from  PROC  PRINT  are  listed 
below: 


Obs 

CLINIC 

SURVT 

-CENS0R- 

SURVIVAL 

LLS 

1 

1 

0 

0 

1.00000 

2 

1 

2 

1 

1.00000 

3 

1 

7 

0 

0.99383 

-5.08450 

4 

1 

17 

0 

0.98765 

-4.38824 

5 

1 

19 

0 

0.98148 

-3.97965 

6 

1 

28 

1 

0.98148 

-3.97965 

7 

1 

28 

1 

0.98148 

-3.97965 

8 

1 

29 

0 

0.97523 

-3.68561 

9 

1 

30 

0 

0.96898 

-3.45736 

10 

1 

33 

0 

0.96273 

-3.27056 

The  PLOT  LLS*SURVT=CLINIC  statement  puts  the  vari¬ 
able  LLS  (the  log-log  survival  variables)  on  the  vertical  axis 
and  SURVT  on  the  horizontal  axis,  stratified  by  CLINIC. 
The  SYMBOL  option  can  be  used  to  choose  plotting  colors 
for  each  level  of  clinic.  The  code  and  output  for  plotting  the 
log  log  curves  by  CLINIC  follow: 

SYMBOL  C0L0R=BLUE ; 

SYMB0L2  C0L0R=RED ; 

PROC  GPLOT  DATA=CAT ; 

PLOT  LLS*SURVT= CLINIC ; 

RUN; 
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I  Is 


survival  time  in  days 
Coded  1  or  2  a  a  a  1  •  a  a2 

The  plot  has  survival  time  (in  days)  rather  than  the  default 
log(survival  time).  The  log-log  survival  plots  look  parallel 
for  CLINIC  the  first  365  days  but  then  seem  to  diverge.  This 
information  can  be  utilized  when  developing  an  approach 
for  modeling  CLINIC  with  a  time  dependent  variable  in  an 
extended  Cox  model. 

You  can  also  obtain  survival  estimates  using  life  tables. 
This  method  is  useful  if  you  do  not  have  individual  level 
survival  information  but  rather  have  group  survival  infor¬ 
mation  for  specified  time  intervals.  The  user  determines 
the  time  intervals  using  the  INTERVALS=  option.  The  code 
follows  (output  omitted): 

PROC  LIFETEST  DATA=REF . ADDICTS 
METH0D=LT  INTERVALS=  50  100  150 
200  TO  1000  BY  100  PL0TS=(S); 

TIME  SURVT*STATUS(0); 

RUN; 


2.  RUNNING  A  COX  PROPORTIONAL  HAZARD 
MODEL  WITH  PROC  PHREG 

PROC  PHREG  is  used  to  request  a  Cox  proportional 
hazards  model.  The  code  follows: 


PROC  PHREG  DATA=REF. ADDICTS; 

MODEL  SURVT* STATUS (0)=  PRISON  DOSE  CLINIC; 
RUN; 


Software:  B.  Sas  577 


The  code  SURVT*STATUS(0),  in  the  MODEL  statement 
specifies  the  time-to-event  variable  (SURVT)  and  the 
value  for  censorship  (STATUS=0).  Three  predictors  are 
included  in  the  model:  PRISON,  DOSE,  and  CLINIC.  The 
option  RL  in  the  MODEL  statement  of  PROC  PHREG 
provides  95%  confidence  intervals  for  the  hazard  ratio 
estimates.  The  PH  assumption  is  assumed  to  follow  for 
each  of  these  predictors  (perhaps  incorrectly).  The  output 
produced  by  PROC  PHREG  follows: 


The  PHREG  Procedure 
Model  Information 

Data  Set 

Dependent  Variable 
Censoring  Variable 
Censoring  Value(s) 

Ties  Handling 


REF. ADDICTS 

SURVT  survival  time  in  days 

STATUS  status  (0=censoredj  l=endpoint) 

0 

BRESLOW 


Summary  of  the  Number  of  Event  and  Censored  Values 

Percent 

Total  Event  Censored  Censored 

238  150  88  36.97 

Model  Fit  Statistics 

Without  With 

Criterion  Covariates  Covariates 


-2  LOG  L 

AIC 

SBC 


1411.324 

1411.324 

1411.324 


1346.805 

1352.805 
1361.837 


Analysis  of  Maximum  Likelihood  Estimates 


Parameter 

DF 

Parameter 

Estimate 

Standard 

Error 

Chi-Square 

Pr  >  ChiSq 

Hazard 

Ratio 

95%  Hazard 
Confidence 

Ratio 

Limits 

PRISON 

1 

0.32650 

0.16722 

3.8123 

0.0509 

1.386 

0.999 

1.924 

DOSE 

1 

-0.03540 

0.00638 

30.7844 

<.0001 

0.965 

0.953 

0.977 

CLINIC 

1 

-1.00876 

0.21486 

22.0419 

<•0001 

0.365 

0.239 

0.556 

The  table  above  lists  the  parameter  estimates  for  the 
regression  coefficients,  their  standard  errors,  a  Wald  chi- 
square  test  statistic  for  each  predictor,  and  corresponding 
p-value.  The  column  labeled  HAZARD  RATIO  gives  the 
estimated  hazard  ratio  per  one-unit  change  in  each  predic¬ 
tor  by  exponentiating  the  estimated  regression  coeffi¬ 
cients.  The  final  two  columns  give  the  95%  confidence 
limits  for  this  hazard  ratio. 


You  can  use  the  TIES=EXACT  option  in  the  model 
statement  rather  than  run  the  default  TIES=BRESLOW 
option  that  was  used  in  the  previous  model.  The  TIES= 
EXACT  option  is  a  computationally  intensive  method  to 
handle  events  that  occur  at  the  same  time.  If  many  events 


578  Computer  Appendix:  Survival  Analysis  on  the  Computer 


occur  simultaneously  in  the  data,  then  the  TIES=EXACT 
option  is  preferred.  Otherwise,  the  difference  between  this 
option  and  the  default  is  slight.  The  TIES=EFRON  option 
is  another  tie-handling  approach  that  SAS  offers.  The 
TIES=EFRON  is  the  default  method  used  in  R. 


PROC  PHREC  DATA=REF. ADDICTS; 

MODEL  SURVT* STATUS (0)=  PRISON  DOSE  CLINIC/TIES=EXACT  RL; 
RUN; 

The  output  follows: 


The  PHREC  Procedure 
Model  Information 


Data  Set 

Dependent  Variable 
Censoring  Variable 
Censoring  Value(s) 
Ties  Handling 


REF. ADDICTS 
SURVT 
STATUS 
0 

EXACT 


survival  time  in  days 

status  (0=censored,  l=endpoint) 


Analysis  of  Maximum  Likelihood  Estimates 


Variable 

DF 

Parameter 

Estimate 

Standard 

Error 

Chi-Square 

Pr  >  ChiSq 

Hazard 

Ratio 

95%  Hazard 
Confidence 

Ratio 

Limits 

PRISON 

1 

0.32657 

0.16723 

3.8135 

0.0508 

1.386 

0.999 

1.924 

DOSE 

1 

-0.03537 

0.00638 

30.7432 

< .  0001 

0.965 

0.953 

0.977 

CLINIC 

1 

-1.00980 

0.21488 

22.0832 

<.0001 

0.364 

0.239 

0.555 

The  parameter  estimates  and  their  standard  errors  vary 
only  slightly  from  the  previous  model  without  the  TIE- 
S=EXACT  option.  Notice  that  the  type  of  ties-handling 
approach  is  listed  in  the  table  called  MODEL  INFORMA¬ 
TION  in  the  output. 


Suppose  we  wish  to  assess  interaction  between  PRISON 
and  CLINIC  and  between  PRISON  and  DOSE.  We  can 
define  two  interaction  terms  in  a  new  temporary  SAS  data¬ 
set  (called  addicts2)  and  then  run  a  model  containing  those 
terms.  Product  terms  for  CLINIC  times  PRISON  (called 
CLIN_PR)  and  CLINIC  time  DOSE  (called  CLIN_DO)  are 
defined  in  the  following  data  step: 


DATA  ADDICTS2; 

SET  REF. ADDICTS; 
CLIN-PR=CLINIC*  PRISON; 
CLIN-DO=CLINIC*DOSE ; 
RUN; 


Software:  B.  Sas  579 


The  interaction  terms  (called  CLIN_PR  and  CLIN_DO)  are 
then  added  to  the  model.  The  CONTRAST  statement  can 
be  used  to  test  the  two  interaction  terms  simultaneously 
with  a  generalized  Wald  test.  After  the  word  CONTRAST  is 
a  user-supplied  label  in  quotes  (i.e. ,  the  user’s  option  what 
to  put  in  quotes).  Then  the  tested  covariates  (the  product 
terms)  are  listed  followed  by  a  1  and  separated  by  a  comma 
(see  code  below): 


PROC  PHREG  DATA=ADDICTS2; 

MODEL  SURVT  *  ST  ATUS ( 0 ) =  PRISON  DOSE  CLINIC  CLIN_PR  CLINJ30; 
CONTRAST  "TEST  INTERACTION"  CLIN_PR  1,  CLIN_DO  1; 

RUN; 


The  PROC  PHREG  output  follows: 


Data  Set 

Dependent  Variable 
Censoring  Variable 
Censoring  Value(s) 
Ties  Handling 


The  PHREG  Procedure 
Model  Information 
WORK .ADDICTS2 

SURVT  survival  time  in  days 

STATUS  status  (0=censored,  l=endpoint) 

0 

BRESLOW 


Model  Fit  Statistics 
Without 

Criterion  Covariates 


With 

Covariates 


-2  LOG  L  1411 
AIC  1411 
SBC  1411 


324 

1343.199 

324 

1353.199 

324 

1368.253 

Analysis  of  Maximum  Likelihood  Estimates 


Variable 

DF 

Parameter 

Estimate 

Standard 

Error 

Chi-Square  Pr 

>  ChiSq 

Hazard 

Ratio 

PRISON 

1 

1.19200 

0.54137 

4.8480 

0.0277 

3.294 

DOSE 

1 

-0.01932 

0.01935 

0.9967 

0.3181 

0.981 

CLINIC 

1 

0.17469 

0.89312 

0.0383 

0.8449 

1.191 

CLIN_PR 

1 

-0.73799 

0.43149 

2.9253 

0.0872 

0.478 

CLIN  DO 

1 

-0.01386 

0.01433 

0.9359 

0.3333 

0.986 

Contrast  Test  Results 


Contrast 

DF 

Wald 

Chi-Square 

Pr  >  ChiSq 

TEST  INTERACTION 

2 

3.5803 

0.1669 
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The  estimates  of  the  hazard  ratios  (left  column)  maybe  decep¬ 
tive  when  product  terms  are  in  the  model.  For  example,  by 
exponentiating  the  estimated  coefficient  for  PRISON  at  exp 
(1.19200)  =  3.284,  we  obtain  the  estimated  hazard  ratio  for 
PRISON=l  versus  PRISON=0,  where  DOSE=0  and 
CLINIC=0.  This  is  a  meaningless  hazard  ratio  since  CLINIC 
is  coded  1  or  2  and  DOSE  is  always  greater  than  zero  (all 
patients  are  on  methadone).  In  the  next  section  (on  stratified 
Cox  models),  we  illustrate  how  a  CONTRAST  statement  can  be 
used  to  obtain  more  meaningful  hazard  ratio  estimates  for 
models  with  interaction  terms.  The  CONTRAST  statement 
can  be  used  to  obtain  a  linear  combination  of  parameter  esti¬ 
mates  in  addition  to  the  generalized  Wald  test  shown  above. 

The  Wald  chi-square  p-values  for  the  two  product  terms  are 
0.0872  for  CLIN_PR  and  0.3333  for  CLIN_DO.  The 
generalized  Wald  chi-square  p-values  for  testing  both  prod¬ 
uct  terms  simultaneously  is  0. 1 669.  Alternatively,  a  likelihood 
ratio  test  can  simultaneously  test  both  product  terms  by 
subtracting  the  -2  log-likelihood  statistic  for  the  full  model 
(with  the  two  product  terms)  from  the  reduced  model  (with¬ 
out  the  product  terms).  The  -2  log  likelihood  statistic  can  be 
found  on  the  output  in  the  table  called  MODEL  FIT  STATIS¬ 
TICS  and  under  the  column  called  WITH  COVARIATES.  The 
-2  log  likelihood  statistic  is  1,343.199  for  the  full  model  and 
1,346.805  for  the  reduced  model.  The  test  is  a  two  degree  of 
freedom  test  since  2  product  terms  are  simultaneously  tested. 

The  PROBCHI  function  in  SAS  can  be  used  to  obtain  p- 
values  for  chi-square  tests.  The  code  follows: 

DATA  TEST; 

REDUCED  =  1346.805; 

FULL  =  1343.199; 

DF  =  2; 

P-VALUE  =  1  -  PR0BCHI(REDUCED-FULL,DF); 

RUN; 

PR0C  PRINT  DATA=TEST ; RUN ; 

Note  that  you  must  write  1  minus  the  PROBCHI  function 
to  obtain  the  area  under  the  right  side  of  the  chi-square 
probability  density  function.  The  output  from  the  PROC 
PRINT  follows: 

Obs  REDUCED  FULL  DF  P-VALUE 

1  1346.81  1343.20  2  0.16480 

The  p-value  for  the  likelihood  ratio  test  for  both  product 
terms  is  0.16480,  a  similar  result  to  the  p-value  that  was 
obtained  from  the  generalized  Wald  test  (0.1669).  Both  of 
these  tests  are  two  degree  of  freedom  tests  since  the  two 
interaction  terms  are  simultaneously  tested. 


Software:  B.  Sas  581 


3.  RUNNING  A  STRATIFIED  COX  MODEL 

Suppose  we  believe  that  the  variable  CLINIC  violates  the 
proportional  hazards  assumption  but  the  variables 
PRISON  and  DOSE  follow  the  PH  assumption  within 
each  level  of  CLINIC.  A  stratified  Cox  model  on  the  variable 
CLINIC  can  be  run  with  PROC  PHREG  using  the  STRATA 
CLINIC  statement.  The  code  follows: 


PROC  PHREG  DATA=REF. ADDICTS; 

MODEL  SURVT*STATUS(0)=  PRISON  DOSE/RL; 
STRATA  CLINIC; 

RUN; 

The  output  of  the  parameter  estimates  follows: 


The  PHREG  Procedure 

Analysis  of  Maximum  Likelihood  Estimates 


Parameter 

Standard 

Hazard 

95%  Hazard  Ratio 

Estimate 

Error  Chi-Square 

Pr  >  ChiSq 

Ratio 

Confidence  Limits 

PRISON 

1 

0.38877 

0.16892  5.2974 

0.0214 

1.475 

1.059 

2.054 

DOSE 

1 

-0.03514 

0.00647  29.5471 

< . 0001 

0.965 

0.953 

0.978 

Notice  there  is  no  parameter  estimate  for  CLINIC  since 
CLINIC  is  the  stratified  variable.  The  hazard  ratio  for 
PRISON=l  vs.  PRISON=0  is  estimated  at  1.475.  This  haz¬ 
ard  ratio  is  assumed  not  to  depend  on  CLINIC  since  an 
interaction  term  between  PRISON  and  CLINIC  was  not 
included  in  the  model. 

Suppose  we  wish  to  assess  interaction  between  PRISON 
and  CLINIC  as  well  as  DOSE  and  CLINIC  in  a  Cox  model 
stratified  by  CLINIC.  We  can  define  interaction  terms  in  a 
new  SAS  dataset  (called  addicts2)  and  then  run  a  model 
containing  these  terms. 


DATA  ADDICTS2; 

SET  REF. ADDICTS; 
CLIN-PR=CLINIC* PRISON; 
CLIN-DO=CLINIC*DOSE; 
RUN; 


Note  with  the  interaction  model  that  the  hazard  ratio  for 
PRISON=l  versus  PRISON=0  for  CLINIC=1  controlling  for 
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DOSE  is  exp(p,  +  p3),  and  the  hazard  ratio  for  PRISON=l 
versus  PRISON=0  for  CLINIC=2  controlling  for  DOSE  is 
exp(p]  +  2p3).  This  latter  calculation  is  obtained  by  sub¬ 
stituting  the  appropriate  values  into  the  hazard  in  both  the 
numerator  (for  PRISON=l)  and  denominator  (for 
PRISON=0)  (see  below): 

hQ(t)  exp [lft  +  poDOSE  +  (2)(l)/?3  +  pACLINJ)0]  = 

h0(t)  exp [0/1,  +  fi2DOSE  +  (2)(0)j83  +  p4CLIN_DO]  PlPl  P3>' 


A  CONTRAST  statement  with  the  ESTIMTES=  option  can 
be  used  with  PROC  PHREG  when  we  wish  to  obtain  esti¬ 
mates  of  a  linear  combination  of  parameter  estimates. 
We  can  also  use  the  CONTRAST  statement  to  test  the  two 
interaction  terms  simultaneously  with  a  generalized  Wald 
test  as  we  illustrated  in  the  previous  section. 

The  code  below  runs  a  stratified  Cox  model  (STRATA 
CLINIC)  including  two  interaction  terms  in  the  model. 
Three  CONTRAST  statements  are  used:  the  first  to  esti¬ 
mate  the  hazard  ratio  for  PRISON  among  those  with 
CLINIC=1,  exp(p,  +  p3);  the  second  to  estimate  the  hazard 
ratio  for  PRISON  among  those  with  CLINIC=2,  exp(l  + 
2p3);  and  the  third  to  test  the  two  interaction  terms  with  a 
two  degree  of  freedom  generalized  Wald  test.  The  ESTI- 
MATE=EXP  option  in  the  first  two  CONTRAST  statements 
requests  that  the  parameter  estimates  be  exponentiated. 
The  code  in  the  second  CONTRAST  statement  PRISON  1 
CLIN_PR  2/ESTIMATE=EXP;  requests  the  estimate  for 
exp(p,  +  2p3).  The  p,  corresponds  to  PRISON  and  the 
beta3  (P)  corresponds  to  the  third  variable  in  the  model, 
CLIN_PR.  The  code  follows: 

PROC  PHREG  DATA=ADDICTS2; 

MODEL  SURVT*STATUS(0)=  PRISON  DOSE  CLIN_PR  CLIN_D0; 

STRATA  CLINIC; 

CONTRAST  'HR  FOR  PRISON  AMONG  CLINIC=1'  PRISON  1  CLIN_PR  1/ESTIMATE=EXP; 

CONTRAST  'HR  FOR  PRISON  AMONG  CLINIC=2'  PRISON  1  CLIN_PR  2/ESTIMATE=EXP; 

CONTRAST  "TEST  INTERACTION"  CLIN_PR  1,  CLIN_D0  1; 

RUN; 


Notice  that  when  we  stratify  by  CLINIC,  we  do  not  put 
the  variable  CLINIC  in  the  model  statement.  However, 
the  interaction  terms  CLIN_PR  and  CLIN_DO  are  put 
in  the  model  statement  while  CLINIC  is  put  in  the  strata 
statement.  The  output  follows: 


Software:  B.  Sas  583 


The  PHREG  Procedure 

Analysis  of  Maximum  Likelihood  Estimates 


Variable 

DF 

Parameter 

Estimate 

Standard 

Error 

Chi-Square  Pr 

>  ChiSq 

Hazard 

Ratio 

PRISON 

1 

1.08716 

0.53861 

4.0741 

0.0435 

2.966 

DOSE 

1 

-0.03482 

0.01980 

3.0930 

0.0786 

0.966 

CLIN_PR 

1 

-0.58467 

0.42813 

1.8650 

0.1721 

0.557 

CLIN  DO 

1 

-0.00105 

0.01457 

0.0052 

0.9427 

0.999 

Contrast  Rows  Estimation  and  Testing  Results 


Contrast 

HR  FOR  PRISON  AMONG  CLINIC=1 
HR  FOR  PRISON  AMONG  CLINIC=2 


Type 

Estimate 

Standard 

Error 

Confidence 

Limits 

EXP 

1.6528 

0.3119 

1.1419 

2.3925 

EXP 

0.9211 

0.3540 

0.4337 

1.9563 

Contrast 


Contrast  Test  Results 

Wald 

DF  Chi-Square 


Pr  >  ChiSq 


HR  FOR  PRISON  AMONG  CLINIC=1  1 
HR  FOR  PRISON  AMONG  CLINIC=2  1 
TEST  INTERACTION  2 


7.0918 

0.0457 

1.8650 


0.0077 

0.8307 

0.3936 


The  hazard  ratio  (PRISON=l  vs  PRISON=0)  is  estimated  at 
1.6528  among  CLINIC=1  and  0.9211  among  CLINIC=2. 
The  generalized  Wald  test  for  testing  both  interaction 
terms  simultaneously  (a  2  df  test:  1  p3  =  0,  1  p4  =  0)  yields 
a  p-value  of  0.3936. 


An  alternative  approach  allowing  for  interaction  with 
CLINIC  and  the  other  covariates  is  obtained  by  running 
two  models:  one  subsetting  on  the  observations  where 
CLINIC=1  and  the  other  subsetting  on  the  observations 
where  CLINIC=2.  The  code  and  output  follow: 

PROC  PHREG  DATA=ADDICTS2 ; 

MODEL  SURVT* STATUS (0)= PRISON  DOSE; 

WHERE  CLINIC=1 ; 

TITLE  COX  MODEL  RUN  ONLY  ON  DATA  WHERE  CLINIC=1; 

RUN; 


A  WHERE  statement  in  a  SAS  procedure  subsets  the  num¬ 
ber  of  observations  for  analyses.  A  TITLE  statement  can 
also  be  added  to  the  procedure.  The  output  containing  the 
parameter  estimates  subsetting  on  the  observations  where 
CLINIC=  1  follows: 


584  Computer  Appendix:  Survival  Analysis  on  the  Computer 


COX  MODEL  RUN  ONLY  ON  DATA  WHERE  CLINIC=1 
Analysis  of  Maximum  Likelihood  Estimates 


Variable 

DF 

Parameter 

Estimate 

Standard 

Error 

Chi-Square 

Pr  >  ChiSq 

Hazard 

Ratio 

PRISON 

1 

0.50249 

0.18869 

7.0918 

0.0077 

1.653 

DOSE 

1 

-0.03586 

0.00774 

21.4761 

<.0001 

0.965 

Similarly,  the  code  and  output  containing  the  parameter 
estimates  subsetting  on  the  observations  where  CLINIC=2: 


PROC  PHREC  DATA=ADDICTS2 ; 

MODEL  SURVT* STATUS (0)= PRISON  DOSE; 

WHERE  CLINIC=2 ; 

TITLE  COX  MODEL  RUN  ONLY  ON  DATA  WHERE  CLINIC=2; 
RUN; 


COX  MODEL  RUN  ONLY  ON  DATA  WHERE  CLINIC=2 
Analysis  of  Maximum  Likelihood  Estimates 


Parameter 

Standard 

Hazard 

Variable 

DF 

Estimate 

Error 

Chi-Square 

Pr  >  ChiSq 

Ratio 

Variable  Label 

PRISON 

1 

-0.08226 

0.38430 

0.0458 

0.8305 

0.921 

0=none,  l=prison 
record 

DOSE 

1 

-0.03693 

0.01234 

8.9500 

0.0028 

0.964 

methadone  dose 
(mg/day) 

The  estimated  hazard  ratio  for  PRISON=l  versus 
PRISON=0  is  0.921  among  CLINIC=2  controlling  for 
DOSE.  This  result  is  consistent  with  the  stratified  Cox 
model  previously  run  in  which  all  the  product  terms  with 
CLINIC  were  included  in  the  model. 


Software:  B.  Sas  585 


4.  ASSESSING  THE  PH  ASSUMPTION  WITH 
A  STATISTICAL  TEST 

The  following  SAS  program  makes  use  of  the  addicts  data¬ 
set  to  demonstrate  how  a  statistical  test  of  the  PH  assump¬ 
tion  is  performed  for  a  given  co variate  (Harrel  and  Lee 
1986).  This  is  accomplished  by  finding  the  correlation 
between  the  Schoenfeld  residuals  for  a  particular  covariate 
and  the  ranking  of  individual  failure  times.  If  the  PH 
assumption  is  met,  then  the  correlation  should  be  near 
zero.  The  p-value  for  testing  this  correlation  can  be 
obtained  from  PROC  CORR  (or  PROC  REG).  The  Schoen¬ 
feld  residuals  for  a  given  model  can  be  saved  in  a  SAS 
dataset  using  PROC  PHREG.  The  ranking  of  events  by 
failure  time  can  be  saved  in  a  SAS  dataset  using  PROC 
RANKED.  The  null  hypothesis  is  that  the  PH  assumption 
is  not  violated. 

First,  we  run  a  model  containing  CLINIC,  PRISON,  and 
DOSE.  The  output  statement  creates  a  SAS  dataset,  the 
OUT=  option  defines  an  output  dataset,  and  the  RESSCH= 
statement  is  followed  by  user-defined  variable  names,  so 
that  the  output  dataset  contains  the  Schoenfeld  residuals. 
The  order  of  the  names  corresponds  to  the  order  of  the 
independent  variables  in  the  model  statement.  The  actual 
variable  names  are  arbitrary.  The  name  we  chose  for  the 
dataset  is  RESID  and  the  names  we  chose  for  the  variables 
containing  the  Schoenfeld  residuals  for  CLINIC,  PRISON, 
and  DOSE  are  RCLINIC,  RPRISON,  and  RDOSE.  The  code 
follows: 

The  code  follows: 

PROC  PHREG  DATA=REF. ADDICTS; 

MODEL  SURVT* STATUS (0)  =CLINIC  PRISON  DOSE; 

OUTPUT  OUT =RESID  RESSCH=RCLINIC  RPRISON  RDOSE; 

RUN; 


PROC  PRINT  DATA=RESID ; RUN ; 
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The  first  10  observations  of  the  PROC  PRINT  are  printed 
below.  The  three  columns  on  the  right  are  the  variables 
containing  the  Schoenfeld  residuals. 


Obs 

SURVT 

STATUS 

CLINIC 

PRISON 

DOSE 

RCLINIC 

RPRIS0N 

RD0SE 

1 

428 

1 

1 

0 

50 

-0.18715 

-0.40641 

-8.2100 

2 

275 

1 

1 

1 

55 

-0.15841 

0.55485 

-2.6277 

3 

262 

1 

1 

0 

55 

-0.16453 

-0.45197 

-2.5635 

4 

183 

1 

1 

0 

30 

-0.14577 

-0.48727 

-26.0823 

5 

259 

1 

1 

1 

65 

-0.16306 

0.54313 

7.3701 

6 

714 

1 

1 

0 

55 

-0.25853 

-0.50074 

-8.5347 

7 

438 

1 

1 

1 

65 

-0.19292 

0.58106 

6.6072 

8 

796 

0 

1 

1 

60 

9 

892 

1 

1 

0 

50 

-0.34478 

-0.22372 

-15.9088 

10 

393 

1 

1 

1 

65 

-0.17712 

0.57376 

6.6886 

Next,  create  a  SAS  dataset  that  deletes  censored  observa¬ 
tions  (i.e.,  only  contains  observations  that  fail). 


DATA  EVENTS; 

SET  RESID; 

IF  STATUS=1 ; 

RUN; 

Use  PROC  RANK  to  create  a  dataset  containing  a  variable 
that  ranks  the  order  of  failure  times.  The  user  supplies  the 
name  of  the  output  dataset  using  the  OUT=  option.  The 
variable  to  be  ranked  is  SURVT.  The  RANKS  statement 
precedes  a  user-defined  variable  name  for  the  rankings  of 
failure  times.  The  user-defined  names  are  arbitrary.  The 
name  we  chose  for  this  variable  is  TIMERANK.  The  code 
follows: 

PROC  RANK  DATA=EVENTS  0UT=RANKED  TIES=MEAN ; 
VAR  SURVT; 

RANKS  TIMERANK; 

RUN; 


PROC  PRINT  DATA=RANKED ; RUN ; 
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PROC  CORR  is  used  to  get  the  correlations  between  the 
ranked  failure  time  variable  (called  TIMERANK  in  this 
example)  and  the  variables  containing  the  Schoenfeld  resi¬ 
duals  of  CLINIC,  PRISON,  and  DOSE  (called  RCLINIC, 
RPRISON,  and  RDOSE,  respectively,  in  this  example). 
The  NOSIMPLE  option  suppresses  the  printing  of  sum¬ 
mary  statistics.  If  the  PH  assumption  is  met  for  a  particular 
covariate,  then  the  correlation  should  be  near  zero.  The  p- 
value  obtained  from  PROC  CORR  which  tests  whether  this 
correlation  is  zero  is  the  same  p-value  we  use  for  testing  the 
PH  assumption.  The  code  follows: 

PROC  CORR  DATA=RANKED  NOSIMPLE; 

VAR  RCLINIC  RPRISON  RDOSE; 

WITH  TIMERANK; 

RUN; 

The  PROC  CORR  output  follows: 

The  CORR  Procedure 

Pearson  Correlation  Coefficients,  N  =  150 
Prob  >  | r |  under  HO:  Rho=0 

RCLINIC  RPRISON  RDOSE 

TIMERANK  -0.26153  -0.07970  0.07733 

Rank  for  Variable  SURVT  0.0012  0.3323  0.3469 

The  sample  correlations  with  their  corresponding  p-values 
printed  underneath  are  shown  above.  The  p-values  for 
CLINIC,  PRISON,  and  DOSE  are  0.0012,  0.3323,  and 
0.3469,  respectively,  suggesting  that  the  PH  assumption  is 
violated  for  CLINIC,  but  reasonable  for  PRISON  and 
DOSE. 

The  same  p-values  can  be  obtained  by  running  linear 
regressions  with  each  predictor  (one  at  a  time)  using 
PROC  REG  and  examining  the  p-values  for  the  regression 
coefficients.  The  code  below  will  produce  output  contain¬ 
ing  the  p-value  for  CLINIC: 

PROC  REG  DATA=RANKED ; 

MODEL  TIMERANK=RCLINIC; 

RUN; 
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The  output  produced  by  PROC  REG  follows: 


The  REG  Procedure 
Parameter  Estimates 


Variable 


Parameter  Standard 

Estimate  Error  t  Value  Pr  >  |t| 


Intercept  75.49955  3.43535 
RCLINIC  -28.38848  8.61194 


21.98  < . 0001 

-3.3G  0.GO12 


The  p-value  for  CLINIC  (0.0012)  is  printed  in  the  column 
on  the  right  and  matches  the  p-value  that  was  obtained 
using  PROC  CORR. 


5.  OBTAINING  COX  ADJUSTED  SURVIVAL  CURVES 

We  use  the  BASELINE  statement  in  PROC  PHREG  to 
create  an  output  dataset  containing  Cox  adjusted  survival 
estimates  for  a  specified  pattern  of  covariates.  The  partic¬ 
ular  pattern  of  covariates  of  interest  must  first  be  created 
in  a  SAS  dataset  that  is  subsequently  used  as  the  input 
dataset  for  the  COVARIATES=  option  in  the  BASELINE 
statement  of  PROC  PHREG.  Each  pattern  of  covariates 
yields  a  different  survival  curve  (assuming  nonzero 
effects).  Adjusted  log(-log)  survival  plots  can  also  be 
obtained  for  assessing  the  PH  assumption.  This  will  be 
illustrated  with  three  examples: 

Exl  -  Run  a  PH  model  using  PRISON,  DOSE,  and  CLINIC 
and  obtain  adjusted  survival  curves  where  PRISON=0, 
DOSE=70,  and  CLINIC=2. 

Ex2  -  Run  a  stratified  Cox  model  (by  CLINIC).  Obtain  two 
adjusted  survival  curves  using  the  mean  value  of 
PRISON  and  DOSE  for  CLINIC=1  and  CLINIC=2.  Use 
the  log  log  curves  to  assess  the  PH  assumption  for 
CLINIC  adjusted  for  PRISON  and  DOSE. 

Ex3  -  Run  a  stratified  Cox  model  (by  CLINIC)  and  obtain 
adjusted  survival  curves  for  PRISON=0,  DOSE=70  and 
for  PRISON=l,  DOSE=70.  This  yields  four  survival 
curves  in  all,  two  for  CLINIC=1  and  two  for  CLINIC=2. 
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Basically,  there  are  three  steps: 

1 )  Create  the  input  dataset  containing  the  pattern  (values) 
of  covariates  used  for  the  adjusted  survival  curves. 

2)  Run  a  Cox  model  with  PROC  PHREG  using  the 
BASELINE  statement  to  input  the  dataset  from  step  ( 1 ) 
and  output  a  dataset  containing  the  adjusted  survival 
estimates. 

3)  Plot  the  adjusted  survival  estimates  from  the  output 
dataset  created  in  step  (2). 

For  Exl,  we  create  an  input  dataset  (called  INI)  with  one 
observation  where  PRISON=0,  DOSE=70,  and  CLINIC=2. 
We  then  run  a  model  and  create  an  output  dataset  (called 
OUT1)  containing  a  variable  with  the  adjusted  survival 
estimates  (called  SI).  Finally,  the  adjusted  survival  curve 
is  plotted  using  PROC  GPLOT.  The  code  follows: 


DATA  INI; 

INPUT  PRISON  DOSE  CLINIC; 
CARDS; 

0  70  2 


PROC  PHREC  DATA=REF. ADDICTS; 

MODEL  SURVT* STATUS (0)= PRISON  DOSE  CLINIC; 

BASELINE  C0VARIATES=IN1  0UT=0UT1  SURVIVAL=S1/N0MEAN ; 

RUN; 

PROC  GPLOT  DATA=0UT1; 

PLOT  SI* SURVT ; 

TITLE  Adjusted  survival  for  prison=0,  dose=70,  clinic=2; 
RUN; 


The  BASELINE  statement  in  PROC  PHREG  specifies  the 
input  dataset,  the  output  dataset,  and  the  name  of 
the  variable  containing  the  adjusted  survival  estimates. 
The  NOMEAN  option  suppresses  the  survival  estimates 
using  the  mean  values  of  PRISON,  DOSE,  and  CLINIC. 
The  next  example  (Ex2)  will  not  use  the  NOMEAN  option. 
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The  output  for  PROC  GPLOT  follows: 


S(t)  Adjusted  survival  for  prison  =  0,  dose  =  70,  clinic  =  2 


For  Ex2,  we  wish  to  create  and  output  dataset  (called 
OUT2)  that  contains  the  adjusted  survival  estimates  from 
a  Cox  model  stratified  by  CLINIC  using  the  mean  values  of 
PRISON  and  DOSE.  An  input  dataset  need  not  be  specified 
since  by  default  the  mean  values  of  PRISON  and  DOSE  will 
be  used  if  the  NOMEAN  option  is  not  used  in  the  BASE¬ 
LINE  statement.  The  code  follows: 


PROC  PHREC  DATA=REF. ADDICTS; 

MODEL  SURVT* STATUS (0)  =  PRISON  DOSE  CLINIC; 

BASELINE  0UT=0UT2  SURVIVAL=S2  L0GL0GS=LS2; 

RUN; 

PROC  GPLOT  DATA=0UT2 ; 

PLOT  S2*SURVT=CLINIC ; 

TITLE  adjusted  survival  stratified  by  clinic; 

RUN; 

PROC  GPLOT  DATA=0UT2; 

PLOT  LS2*SURVT=CLINIC ; 

TITLE  log-log  curves  stratified  by  clinic,  adjusted  for 
prison,  dose; 

RUN; 


Log  of  Negative  Log  of  SURVIVAL 
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The  code,  PLOT  LS2*SURVT=CLINIC,  in  the  2nd  PROC 
GPLOT  will  plot  LS2  on  the  vertical  axis,  SURVT  on  the 
horizontal  axis,  stratified  by  CLINIC  on  the  same  graph. 
The  variable  LS2  was  created  in  the  BASELINE  statement 
of  PROC  PHREG  and  contains  the  adjusted  log-log  sur¬ 
vival  estimates.  The  PROC  GPLOT  output  for  the  log-log 
survival  curves  stratified  by  CLINIC  adjusted  for  PRISON 
and  DOSE  follows: 


2 
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The  adjusted  log-log  plots  look  similar  to  the  unadjusted 
log-log  Kaplan-Meier  plots  shown  earlier,  in  that  the  plots 
look  reasonably  parallel  before  365  days  but  then  diverge, 
suggesting  that  the  PH  assumption  is  violated  after  1  year. 

For  Ex3,  a  stratified  Cox  (by  CLINIC)  is  run  and  adjusted 
curves  are  obtained  for  PRISON=l  and  PRISON=0  holding 
DOSE=70.  An  input  dataset  (called  IN3)  is  created  with  two 
observations  for  both  levels  of  PRISON  with  DOSE=70.  An 
output  dataset  (called  OUT3)  is  created  with  the  BASE¬ 
LINE  statement  that  contains  a  variable  (called  S3)  of 
survival  estimates  for  all  four  curves  (two  for  each  stratum 
of  CLINIC).  The  code  follows: 


log-log  curves  stratified  by  clinic,  adjusted  for  prison  dose 
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DATA  IN3 ; 

INPUT  PRISON  DOSE; 
CARDS; 

1  70 
0  70 


PROC  PHREC  DATA=REF. ADDICTS; 

MODEL  SURVT* STATUS (0)=  PRISON  DOSE; 

STRATA  CLINIC; 

BASELINE  C0VARIATES=IN3  0UT=0UT4  SURVIVAL=S3/N0MEAN ; 

RUN; 

PROC  GPLOT  DATA=0UT3 ; 

PLOT  S3*SURVT=CLINIC ; 

TITLE  adjusted  survival  stratified  by  clinic  for  both  levels 
of  prison; 

RUN; 


adjusted  survival  stratified  by  clinic  for  both  levels  of  prison 


For  the  graph  above,  the  PH  assumption  is  not  assumed  for 
CLINIC  since  that  is  the  stratified  variable.  However,  the 
PH  assumption  is  assumed  for  PRISON  within  each  stra¬ 
tum  of  CLINIC  (i.e.,  CLINIC=1  and  CLINIC=2). 
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6.  RUNNING  AN  EXTENDED  COX  MODEL 

Models  containing  time-dependent  variables  are  run  using 
PROC  PHREG.  Time  dependent  variables  are  created  with 
programming  statements  within  the  PROC  PHREG  proce¬ 
dure.  Sometimes,  users  incorrectly  define  time-dependent 
variables  in  the  data  step.  This  leads  to  wrong  estimates 
because  the  time  variable  used  in  the  data  step  (SURVT)  is 
actually  time-independent  and  therefore  different  than  the 
time  variable  (also  called  SURVT)  used  to  define  time- 
dependent  variables  in  the  PROC  PHREG  statement.  See 
the  discussion  on  the  extended  Cox  likelihood  in  Chapter  6 
for  further  clarification  of  this  issue. 

We  have  evaluated  the  PH  assumption  for  the  variable 
CLINIC  by  plotting  KM  log-log  curves  and  Cox-adjusted 
log  log  curves  stratified  by  CLINIC  and  checking  whether 
the  curves  were  parallel.  We  could  do  similar  analyses  with 
the  variables  PRISON  and  DOSE  although  with  DOSE  we 
would  need  to  categorize  the  continuous  variable  before 
comparing  plots  for  different  strata  of  DOSE. 

If  it  is  expected  that  the  hazard  ratio  for  the  effect  of  DOSE 
increases  (or  decreases)  monotonically  with  time,  we  could 
add  a  continuous  time-varying  product  term  with  DOSE 
and  some  function  of  time.  The  model  defined  below  con¬ 
tains  a  time-varying  variable  (LOGTDOSE)  defined  as  the 
product  of  DOSE  and  the  natural  log  of  time  (SURVT).  In 
some  sense,  a  violation  of  the  PH  assumption  for  a  partic¬ 
ular  variable  means  that  there  is  an  interaction  between 
that  variable  and  time.  Note  that  the  variable  LOGTDOSE 
is  defined  within  the  PHREG  procedure  and  not  in  the  data 
step.  The  code  follows: 

PROC  PHREG  DATA=REF. ADDICTS; 

MODEL  SURVT* STATUS (0)= PRISON  CLINIC  DOSE  LOGTDOSE; 
L0GTD0SE=D0SE*  LOG(SURVT) ; 

RUN; 
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The  output  produced  by  PROC  PHREG  follows: 


The  PHREG  Procedure 


Analysis 

of  Maximum 

Likelihood  Estimates 

Parameter 

Standard 

Hazard 

Variable 

DF 

Estimate 

Error 

Chi-Square 

Pr  >  ChiSq 

Ratio 

PRISON 

1 

0.34047 

0.16747 

4.1333 

0.0420 

1.406 

CLINIC 

1 

-1.01857 

0.21538 

22.3655 

<.0001 

0.361 

DOSE 

1 

-0.08243 

0.03599 

5.2468 

0.0220 

0.921 

LOGTDOSE 

1 

0.00858 

0.00646 

1.7646 

0.1841 

1.009 

The  Wald  test  for  the  time-dependent  variable  LOGTDOSE 
yields  a  p-value  of  0.1841.  A  nonsignificant  p-value  does 
not  necessarily  mean  that  the  PH  assumption  is  reasonable 
for  DOSE.  Perhaps,  a  different  defined  time-dependent 
variable  would  have  been  significant  (e.g.,  DOSE  x 
(TIME  -  100)).  Also,  the  sample-size  of  the  study  is  a  key 
determinant  of  the  power  to  reject  the  null,  which  in  this 
case  means  rejection  of  the  PH  assumption. 


Next,  we  consider  time-dependent  variables  for  CLINIC. 
The  next  two  models  use  heaviside  functions  that  allow  a 
different  hazard  ratio  to  be  estimated  for  CLINIC  before 
and  after  365  days.  The  first  model  uses  two  heaviside 
functions  in  the  model  (HV1  and  HV2)  but  not  CLINIC. 
The  second  model  uses  one  heaviside  function  (HV)  but 
also  includes  CLINIC  in  the  model.  These  two  models  yield 
the  same  hazard  ratio  estimates  for  CLINIC  but  are  coded 
differently.  The  code  and  output  for  the  model  with  two 
heaviside  functions  follows: 

PROC  PHREG  DATA=REF. ADDICTS; 

MODEL  SURVT*STATUS(0)=PRISON  DOSE  HV1  HV2; 

IF  SURVT  <  365  THEN  HV1  =  CLINIC;  ELSE  HV1  =  0; 

IF  SURVT  >=  365  THEN  HV2  =  CLINIC;  ELSE  HV2  =  0; 

CONTRAST  'TEST  EQUALITY  OF  HEAVISIDES'  HV1  1  HV2  -1; 

RUN; 


Analysis 

of  Maximum 

Likelihood 

Estimates 

Parameter 

Standard 

Hazard 

Parameter 

Estimate 

Error 

Chi-Square 

Pr  >  ChiSq 

Ratio 

PRISON 

0.37770 

0.16840 

5.0304 

0.0249 

1.459 

DOSE 

-0.03551 

0 . 00644 

30.4503 

< . 0001 

0.965 

HVl 

-0.45956 

0.25529 

3.2405 

0.0718 

0.632 

HV2 

-1.82823 

0.38595 

22.4392 

< . 0001 

0.161 

Contrast  Test  Results 

Wald 

Contrast 

DF  Chi-Square 

Pr  >  ChiSq 

TEST 

EQUALITY  OF 

HEAVISIDES 

1 

8.7993 

0 . 0030 
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The  parameter  estimates  for  HV1  and  HV2  can  be  used 
directly  to  obtain  the  estimated  hazard  ratio  for  CLINIC=2 
vs  CLINIC=1  before  and  after  365  days.  The  estimated 
hazard  ratio  for  CLINIC  at  100  days  is  exp(-0. 45956)  = 
0.632  and  the  estimated  hazard  ratio  for  CLINIC  at  400 
days  is  exp(-l. 82823)  =  0.161.  The  CONTRAST  statement 
provides  a  Wald  test  on  the  equality  of  two  heaviside  coef¬ 
ficients  (p3  =  p4  or  p3  -  p4  =  0).  If  the  two  heaviside  coeffi¬ 
cients  were  equal,  then  the  hazard  ratios  for  CLINIC  would 
not  depend  on  time.  So  the  test  could  be  viewed  as  a  test  of 
one  form  of  PH  violation.  The  p-value  for  the  test  is  highly 
significant  at  0.0030,  suggesting  that  the  PH  assumption  is 
violated  for  CLINIC. 

The  code  and  output  for  an  equivalent  model  with  one 
heaviside  function  are  shown  below: 


PROC  PHREG  DATA=REF. ADDICTS; 

MODEL  SURVT*STATUS(0)=CLINIC  PRISON  DOSE  HV; 

IF  SURVT  >=  365  THEN  HV  =  CLINIC;  ELSE  HV  =  0; 

CONTRAST  'HR  FOR  CLINIC  <365  days'  CLINIC  1/ESTIMATE=EXP; 
CONTRAST  'HR  FOR  CLINIC  >=365  days'  CLINIC  1  HV  1/ESTIMATE=EXP; 
RUN; 


Analysis  of  Maximum  Likelihood  Estimates 


Parameter 

Standard 

Hazard 

Parameter  Estimate 

Error 

Chi-Square 

Pr  >  ChiSq 

Ratio 

CLINIC 

-0.45956 

0.25529 

3.2405 

0.0718 

0.632 

PRISON 

0.37770 

0.16840 

5.0304 

0.0249 

1.459 

DOSE 

-0.03551 

0.00644 

30.4503 

< .0001 

0.965 

HV 

-1.36866 

0.46139 

8.7993 

0.0030 

0.254 

Contrast  Rows  Estimation  and  Testing  Results 

Standard 

Contrast 

Type 

Estimate 

Error 

Confidence 

Limits 

HR  FOR 

CLINIC  <365  days 

EXP 

0.6316 

0.1612 

0.3829 

1.0416 

HR  FOR 

CLINIC  >=365  days 

EXP 

0 . 1607 

0.0620 

0.0754 

0.3424 

Notice  that  the  variable  CLINIC  is  included  in  this  model 
and  the  coefficient  for  the  time-dependent  heaviside  func¬ 
tion,  HV,  does  not  contribute  to  the  estimated  hazard  ratio 
until  day  365.  The  estimated  hazard  ratio  for  CLINIC  at 
100  days  is  exp(-0. 45956)  =  0.6316  while  the  estimated 
hazard  ratio  for  CLINIC  at  400  days  is  exp((-0. 45956)  + 
(-1.36866))  =  0.1607  as  calculated  using  the  ESTIMA- 
TE=EXP  option  in  the  CONTRAST  statement.  These 
results  are  consistent  with  the  estimates  obtained  from 
the  model  with  two  heaviside  functions.  A  Wald  test  for 
the  variable  HV  shows  a  statistically  significant  p-value  of 
0.003  suggesting  a  violation  of  the  PH  assumption  for 
CLINIC.  This  is  the  same  test  as  that  obtained  with  the 
CONTRAST  statement  using  the  model  with  two  heaviside 
functions. 
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Suppose  it  is  believed  that  the  hazard  ratio  for  CLINIC=2 
versus  CLINIC=1  is  constant  over  the  first  year  but  then 
monotonically  increases  (or  decreases)  after  the  first  year. 
The  following  code  defines  a  model  allowing  for  a  time- 
varying  covariate  called  CLINTIME  (defined  in  the  code) 
which  contributes  to  the  hazard  ratio  for  CLINIC  after  365 
days  (output  omitted): 

PROC  PHREC  DATA=REF. ADDICTS; 

MODEL  SURVT* STATUS (0)=CLINIC  PRISON  DOSE  CLINTIME; 

IF  SURVT  <  365  THEN  CLINTIME=0; 

ELSE  IF  SURVT  >=  365  THEN  CLINTIME  =  CLINIC* (SURVT-365) ; 

RUN; 


SAS  is  flexible  in  the  way  it  can  accommodate  the  model¬ 
ing  of  time-varying  covariates  from  different  data  formats. 
To  illustrate  this  point,  consider  an  example  that  was  dis¬ 
cussed  in  Chapter  6.  The  data  below  (Data  Dl)  contain  one 
observation  for  Jane  who  had  an  event  at  49  months 
(MONTHS=49  and  STATUS=1).  Her  dose  of  medication 
at  the  beginning  of  follow-up  was  60  mg  (DOSE  1=60  and 
TIME1=0).  At  the  12th  month  of  follow-up,  her  dose  was 
changed  to  120  mg  (DOSE2=120  and  TIME2=12).  At  the 
30th  month  of  follow-up,  her  dose  was  changed  to  150  mg 
(DOSE3=120  and  TIME3=30). 

(Data  Dl)  DOSE  changes  at  three  time  points  for  Jane 


(Data  Dl)  DOSE  changes  at  3  time  points  for  Jane 
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Jane  49  1  60  0  120  12  150  30 

If  dosage  is  measured  at  multiple  time  points,  then  we 
would  want  to  treat  dose  as  a  time  varying  covariate.  We 
are  assuming  that  Jane’s  observation  is  one  out  of  many 
individuals  A.  The  following  code  would  run  an  extended 
Cox  model  for  data  formatted  as  above: 
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PROC  PHREG  DATA=D1; 

MODEL  MONTHS*STATUS(0)=T_DOSE ; 

IF  MONTHS<=TIME2  THEN  T_DOSE=DOSEl; 

ELSE  IF  MONTHS<=TIME3  THEN  T_DOSE=DOSE2; 

ELSE  T_DOSE=DOSE3; 

RUN; 

The  time  dependent  variable  T_DOSE  is  defined  below  the 
MODEL  statement  and  is  defined  in  terms  of  DOSE1, 
DOSE2,  and  DOSE3  at  specified  points  in  time. 

Alternatively,  the  data  can  be  transposed  in  a  counting 
process  format  such  that  Jane  would  have  three  observa¬ 
tions  to  accommodate  her  three  values  of  dosage  over  her 
risk  period. 

The  following  code  transposes  the  data  (Dl)  into  a  count¬ 
ing  process  format  (D2): 

DATA  D2 ; 

SET  Dl; 

START=TIME1 ; STOP=TIME2 ; EVENT=0 ; DOSE=DOSEl ; OUTPUT ; 
START=TIME2 ; ST0P=TIME3 ; EVENT=0 ; DOSE=DOSE2 ; OUTPUT ; 
START=TIME3 ; STOP=MONTHS ; EVENT=STATUS ; DOSE=DOSE3 ; OUTPUT ; 
DROP  MONTHS  D0SE1  DOSE2  DOSE3  TIME1  TIME2  TIME3  STATUS; 
RUN; 


Now  the  data  (D2)  is  transposed  to  contain  three  observa¬ 
tions  for  Jane,  allowing  DOSE  to  be  represented  as  a  time- 
dependent  variable.  For  the  first  time  interval  (START=0, 
STOP=12),  Jane’s  dose  was  60  mg.  For  the  second  time 
interval  (12-30  months),  Jane’s  dose  was  120  mg.  For  the 
third  time  interval  (30-49  months),  Jane’s  dose  was  150 
mg.  The  data  indicate  that  Jane  had  an  event  at  49  months 
(STOP=49  and  STATUS=1).  Jane’s  three  observations  are 
printed  below: 
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PROC  PRINT  DATA=D2  ;  RUN  ; 


ID 

START 

STOP 

EVENT 

DOSE 

JANE 

0 

12 

0 

60 

JANE 

12 

30 

0 

120 

JANE 

30 

49 

1 

150 

The  code  to  run  the  model  with  the  data  in  counting  pro¬ 
cess  format  is  shown  below: 

PROC  PHREG  DATA=D2 ; 

MODEL  (START, STOP) * EVENT (0)=DOSE; 

RUN; 


Using  PROC  PHREG  on  data  in  the  counting  process  for¬ 
mat  is  discussed  in  more  detail  when  we  discuss  the  mod¬ 
eling  recurrent  events  in  SAS  (Section  8). 


7.  RUNNING  PARAMETRIC  MODELS 
WITH  PROC  LIFEREG 

PROC  LIFEREG  runs  parametric  AFT  models  rather  than 
PH  models.  Whereas  the  key  assumption  of  a  PH  model  is 
that  hazard  ratios  are  constant  over  time,  the  key  assump¬ 
tion  for  an  AFT  model  is  that  survival  time  accelerates  (or 
decelerates)  by  a  constant  factor  when  comparing  different 
levels  of  covariates. 

The  most  common  distribution  for  parametric  modeling  of 
survival  data  is  the  Weibull  distribution.  The  hazard  func¬ 
tion  for  a  Weibull  distribution  is  lptp~1 .  If  p  =  1,  then  the 
Weibull  distribution  is  also  an  exponential  distribution. 
The  Weibull  distribution  has  a  desirable  property,  in  that 
if  the  AFT  assumption  holds  then  the  PH  assumption  also 
holds.  The  exponential  distribution  is  a  special  case  of 
the  Weibull  distribution.  The  key  property  for  the  expo¬ 
nential  distribution  is  that  the  hazard  is  constant  over  time 
(h(t)  =/.)•  In  SAS,  the  Weibull  and  exponential  model  are 
run  only  as  AFT  models. 

The  Weibull  distribution  has  the  property  that  the  log-log 
of  the  survival  function  is  linear  with  the  log  of  time.  PROC 
LIFETEST  can  be  used  to  plot  Kaplan-Meier  log-log  curves 
against  the  log  of  time.  If  the  curves  are  approximately 
straight  lines  (and  parallel),  then  the  assumption  is  reason¬ 
able.  Furthermore,  if  the  straight  lines  have  a  slope  of  1, 
then  the  exponential  distribution  is  appropriate.  The  code 
below  produces  log-log  curves  stratified  by  CLINIC  and 
PRISON  that  can  be  used  to  check  the  validity  of  the 
Weibull  assumption  for  those  variables: 
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PROC  LIFETEST  DATA=REF . ADDICTS  METH0D=KM  PLOTS=(LLS); 
TIME  SURVT*STATUS(0) ; 

STRATA  CLINIC  PRISON; 

RUN; 


h — i — i-  CLINIC=1  PRISON=0  -i— i— i-  CLINIC=1  PRISON=1 
h — i — i-  CLINIC=2  PRISON=0  i  CLINIC=2  PRISON=1 


The  log-log  curves  do  not  look  straight  but  for  illustration 
we  shall  proceed  as  if  the  Weibull  assumption  were  appro¬ 
priate.  First,  an  exponential  model  will  be  run  with  PROC 
LIFEREG.  In  this  model,  the  Weibull  shape  parameter  (p) 
is  forced  to  equal  1,  which  forces  the  hazard  to  be  constant. 


PROC  LIFEREG  DATA=REF . ADDICTS ; 

MODEL  SURVT* STATUS (0)= PRISON  DOSE  CLINIC/DIST=EXPONENTIAL ; 
RUN; 

The  DIST=EXPONENTIAL  option  in  the  MODEL  statement 
requests  the  Weibull  distribution.  The  output  of  parameter 
estimates  obtained  from  PROC  LIFEREG  follows: 


Analysis  of  Parameter  Estimates 


Standard 

95%  Confidence 

Chi- 

Parameter 

DF 

Estimate 

Error 

Limits 

Square 

Pr  >  ChiSq 

Intercept 

1 

3.6843 

0.4307 

2.8402 

4.5285 

73.17 

<.0001 

PRISON 

1 

-0.2526 

0.1649 

-0.5758 

0.0705 

2.35 

0.1255 

DOSE 

1 

0.0289 

0.0061 

0.0169 

0.0410 

22.15 

<.0001 

CLINIC 

1 

0.8806 

0.2106 

0.4678 

1.2934 

17.48 

<.0001 

Scale 

0 

1.0000 

0.0000 

1 . 0000 

1.0000 

Weibull  Shape 

0 

1.0000 

0.0000 

1 . 0000 

1.0000 
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The  exponential  model  assumes  a  constant  hazard.  This  is 
indicated  in  the  output  by  the  value  of  the  Weibull  shape 
parameter  (1.0000).  The  output  can  be  used  to  calculate 
the  estimated  hazard  for  any  subject  given  a  pattern  of 
covariates.  For  example,  a  subject  with  PRISON=0, 
DOSE=50,  and  CLINIC=2  has  an  estimated  hazard  of 
exp{  — (3.6843  +  50(0.0289))  +  2(0.8806)}  =  0.001.  Note  that 
SAS  gives  the  parameter  estimates  for  the  AFT  form  of 
the  exponential  model.  Multiply  the  estimated  coefficients 
by  negative  one  to  get  estimates  consistent  with  the  PH 
parameterization  of  the  model  (see  Chapter  7). 

Next,  a  Weibull  AFT  model  is  ran  with  PROC  LIFEREG. 

PR0C  LIFEREG  DATA=REF . ADDICTS ; 

MODEL  SURVT* STATUS (0)= PRISON  DOSE  CLINIC/DIST=WEIBULL ; 

RUN; 

The  DIST= WEIBULL  option  in  the  MODEL  statement 
requests  the  Weibull  distribution.  The  output  for  the 
parameter  estimates  follows: 
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Standard 

95%  Confidence 

Chi- 

Parameter 

DF 

Estimate 

Error 

Limits 

Square 

Pr  >  ChiSq 

Intercept 

1 

4.1048 

0.3281 

3.4619 

4.7478 

156.56 

<.0001 

PRISON 

1 

-0.2295 

0.1208 

-0.4662 

0.0073 

3.61 

0.0575 

DOSE 

1 

0.0244 

0.0046 

0.0154 

0.0334 

28.32 

<.0001 

CLINIC 

1 

0.7090 

0.1572 

0.4009 

1.0172 

20.34 

<.0001 

Scale 

1 

0.7298 

0.0493 

0.6393 

0.8332 

Weibull  Shape 

1 

1.3702 

0.0926 

1.2003 

1.5642 

The  Weibull  shape  parameter  is  estimated  at  1.3702.  SAS 
calls  the  reciprocal  of  the  Weibull  shape  parameter,  the 
Scale  parameter,  estimated  at  0.7298.  The  acceleration 
factor  comparing  CLINIC=2  to  CLINIC=1  is  estimated  at 
exp(0.7090)  =  2.03.  So,  the  estimated  median  survival  time 
(time  off  heroin)  is  double  for  patients  enrolled  in 
CLINIC=2  compared  to  CLINIC=1. 

To  obtain  the  hazard  ratio  parameters  from  the  Weibull 
AFT  model,  multiply  the  Weibull  shape  parameter  by  the 
negative  of  the  AFT  parameter  (see  Chapter  7).  For  exam¬ 
ple,  the  HR  estimate  for  CLINIC=2  vs  CLINIC=  1  controlling 
for  the  other  covariates  is  exp(1.3702(-0.7090))  =  0.38. 
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Next,  a  log-logistic  AFT  model  is  run  with  PROC  LIFEREG. 


PROC  LIFEREG  DATA=REF . ADDICTS ; 

MODEL  SURVT* STATUS (0)= PRISON  DOSE  CLINIC/DIST=LLOGISTIC ; 
RUN; 

The  output  of  the  log-logistic  parameter  estimates  follows: 


Analysis  of  Parameter  Estimates 


Parameter 

DF 

Estimate 

Standard 

Error 

95%  Confidence 
Limits 

Chi- 

Square 

Pr  >ChiSq 

Intercept 

1 

3.5633 

0.3894 

2.8000 

4.3266 

83.71 

<.0001 

PRISON 

1 

-0.2913 

0.1440 

-0.5734 

-0.0091 

4.09 

0.0431 

DOSE 

1 

0.0316 

0.0055 

0.0208 

0.0424 

32.81 

<.0001 

CLINIC 

1 

0.5806 

0.1716 

0.2443 

0.9169 

11.45 

0.0007 

Scale 

1 

0.5868 

0.0403 

0.5129 

0.6712 

From  this  output,  the  acceleration  factor  comparing 
CLINIC=2  to  CLINIC=1  is  estimated  at  exp(0.5806)  = 
1.79.  If  the  AFT  assumption  holds  for  a  log-logistic 
model,  then  the  proportional  odds  assumption  holds  for 
the  survival  function  (although  the  PH  assumption  will  not 
hold).  The  proportional  odds  assumption  can  be  evaluated 
by  plotting  the  log  odds  of  survival  (using  KM  estimates) 
against  the  log  of  survival  time.  If  the  plots  are  straight 
lines  for  each  pattern  of  covariates,  then  the  log-logistic 
distribution  is  reasonable.  If  the  straight  lines  are  also 
parallel,  then  the  proportional  odds  and  AFT  assumptions 
also  hold. 

A  SAS  dataset  containing  the  KM  survival  estimates  can  be 
created  using  PROC  LIFETEST  (see  Section  1  of  this 
appendix).  Once  this  variable  is  created,  a  dataset  contain¬ 
ing  variables  for  the  estimated  log  odds  of  survival  and  the 
log  of  survival  time  can  also  be  created.  PROC  GPLOT  can 
then  be  used  to  plot  the  log  odds  of  survival  against  survival 
time. 

Another  context  for  thinking  about  the  proportional  odds 
assumption  is  that  the  odds  ratio  estimated  by  a  logistic 
regression  does  not  depend  on  the  length  of  the  follow-up. 
For  example,  if  a  follow-up  study  was  extended  from  3  to  5 
years,  then  the  underlying  odds  ratio  comparing  two  pat¬ 
terns  of  covariates  would  not  change.  If  the  proportional 
odds  assumption  is  not  true,  then  the  odds  ratio  is  specific 
to  the  length  of  follow-up. 
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An  AFT  model  is  a  multiplicative  model  with  respect  to 
survival  time  or  equivalently  an  additive  model  with 
respect  to  the  log  of  time.  In  the  previous  example,  the 
median  survival  time  was  estimated  as  1.79  times  longer 
for  CLINIC=2  compared  to  CLINIC=1.  In  that  example, 
survival  time  was  assumed  to  follow  a  log-logistic  distribu¬ 
tion  or  equivalently  the  log  of  survival  time  was  assumed  to 
follow  a  logistic  distribution. 

SAS  allows  additive  failure  time  models  to  be  run  (see 
chapter  7  under  the  heading  “Other  Parametric  Models”). 
The  NOLOG  option  in  the  MODEL  statement  of  PROC 
LIFEREG  suppresses  the  default  log  link  function  which 
means  that  time,  rather  than  log(time),  is  modeled  as  a 
linear  function  of  the  regression  parameters.  The  following 
code  requests  an  additive  failure  time  model  in  which  time 
follows  a  logistic  (not  log-logistic)  distribution: 


PROC  LIFEREG  DATA=REF . ADDICTS ; 

MODEL  SURVT* STATUS (0)= PRISON  DOSE  CLINIC/DIST=LLOGISTIC  NOLOG; 
RUN; 

Even  though  the  option  DIST=LLOGISTIC  appears  to 
request  that  survival  time  follows  a  log-logistic  distribution. 
The  NOLOG  option  actually  means  that  survival  time  is 
assumed  to  follow  a  logistic  distribution.  (Note  that  the 
NOLOG  option  in  Stata  means  -  something  completely 
different  using  the  streg  command  -  that  the  iteration  log 
file  not  be  shown  in  the  output.)  The  output  from  the 
additive  failure  time  model  follows: 


Analysis  of  Parameter  Estimates 


Parameter 

DF 

Estimate 

Standard 

Error 

95%  Confidence 
Limits 

Chi- 

Square 

Pr  >  ChiSq 

Intercept 

1 

-358.482 

114.0161 

-581.949 

-135.014 

9.89 

0.0017 

PRISON 

1 

-89.7816 

42.9645 

-173.990 

-5.5727 

4.37 

0.0366 

DOSE 

1 

10.3893 

1.6244 

7.2055 

13.5731 

40.91 

<.0001 

CLINIC 

1 

214.2525 

53.1204 

110.1385 

318.3665 

16.27 

<.0001 

Scale 

1 

172.4039 

11.3817 

151.4792 

196.2191 

Software:  B.  Sas  603 


The  parameter  estimate  for  CLINIC  is  214.2525.  The  inter¬ 
pretation  for  this  estimate  is  that  the  median  survival  time 
(or  time  to  any  fixed  value  of  S(t))  is  estimated  at  214  days 
more  for  CLINIC=2  compared  to  CLINIC=1.  In  other 
words,  you  add  214  days  to  the  estimated  median  survival 
time  for  CLINIC=1  to  get  the  estimated  median  survival 
time  for  CLINIC=2.  This  contrasts  with  the  previous  AFT 
model  in  which  you  multiply  estimated  median  survival 
time  for  CLINIC  =1  by  1.79  to  get  the  estimated  median 
survival  time  for  CLINIC=2.  The  additive  model  can  be 
viewed  as  a  shifting  of  survival  time  while  the  AFT  model 
can  be  viewed  as  a  scaling  of  survival  time. 

If  survival  time  follows  a  logistic  distribution  and  the  addi¬ 
tive  failure  time  assumption  holds,  then  the  proportional 
odds  assumption  also  holds.  The  logistic  assumption  can 
be  evaluated  by  plotting  the  log  odds  of  survival  (using  KM 
estimates)  against  time  (rather  than  against  the  log  of  time 
as  analogously  used  for  the  evaluation  of  the  log-logistic 
assumption).  If  the  plots  are  straight  lines  for  each  pattern 
of  covariates,  then  the  logistic  distribution  is  reasonable.  If 
the  straight  lines  are  also  parallel,  then  the  proportional 
odds  and  additive  failure  time  assumptions  hold. 

Other  distributions  supported  by  PROC  LIFEREG  are 
the  generalized  gamma  (DIST= GAMMA)  and  lognormal 
(DIST=LNORMAL)  distributions.  If  the  NOLOG  option  is 
specified  with  the  DIST=LNORMAL  option  in  the  model 
statement,  then  survival  time  is  assumed  to  follow  a  nor¬ 
mal  distribution. 


8.  MODELING  RECURRENT  EVENTS 

The  modeling  of  recurrent  events  is  illustrated  with  the 
bladder  cancer  dataset  (bladder.sas7bdat)  described  at 
the  start  of  this  appendix.  Recurrent  events  are  represented 
in  the  data  with  multiple  observations  for  subjects  having 
multiple  events.  The  data  layout  for  the  bladder  cancer 
dataset  is  suitable  for  a  counting  process  approach  with 
time  intervals  defined  for  each  observation  (see  Chapter  8). 
The  following  code  prints  the  12th-20th  observation,  which 
contains  information  for  four  subjects.  The  code  follows: 


PROC  PRINT  DATA=REF. BLADDER  (FIRST0BS=  12  0BS=20) ; 
RUN; 
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The  output  follows: 


OBS 

ID 

EVENT 

INTERVAL 

START 

STOP 

TX 

NUM 

SIZE 

12 

10 

1 

1 

0 

12 

0 

1 

1 

13 

10 

1 

2 

12 

16 

0 

1 

1 

14 

10 

0 

3 

16 

18 

0 

1 

1 

15 

11 

0 

1 

0 

23 

0 

3 

3 

16 

12 

1 

1 

0 

10 

0 

1 

3 

17 

12 

1 

2 

10 

15 

0 

1 

3 

18 

12 

0 

3 

15 

23 

0 

1 

3 

19 

13 

1 

1 

0 

3 

0 

1 

1 

20 

13 

1 

2 

3 

16 

0 

1 

1 

There  are  three  observations  for  ID=10,  one  observation 
for  ID=1 1,  three  observations  for  ID=12,  and  two  observa¬ 
tions  for  ID=13.  The  variables  START  and  STOP  represent 
the  time  interval  for  the  risk  period  specific  to  that  obser¬ 
vation.  The  variable  EVENT  indicates  whether  an  event 
(coded  1)  occurred.  The  first  three  observations  indicate 
that  the  subject  with  ID=10  had  an  event  at  12  months, 
another  event  at  16  months,  and  was  censored  at  18 
months. 

PROC  PHREG  can  be  used  for  survival  data  using  a  count¬ 
ing  process  data  layout.  The  following  code  runs  a  model 
with  three  predictors  -  treatment  status  (TX),  initial  num¬ 
ber  of  tumors  (NUM),  and  the  initial  size  of  tumors  (SIZE) 
-  included  in  the  model: 

PROC  PHREG  DATA=BLADDER  COVS(AGGREGATE) ; 

MODEL  (START, STOP) * EVENT (0)=TX  NUM  SIZE; 

ID  ID; 

RUN; 

The  code  (START, STOP)*EVENT(0)  in  the  MODEL  state¬ 
ment  indicates  that  the  time  intervals  for  each  observation 
are  defined  by  the  variables  START  and  STOP  and  that 
EVENT=0  denotes  a  censored  observation.  The  ID  statement 
defines  ID  as  the  variable  representing  each  subject.  The 
COVS(AGGREGATE)  option  in  the  PROC  PHREG  statement 
requests  robust  standard  errors  for  the  parameter  estimates. 
The  output  generated  by  PROC  PHREG  follows: 
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The  PHREC  Procedure 
Model  Information 


Data  Set 

Dependent  Variable 
Dependent  Variable 
Censoring  Variable 
Censoring  Value(s) 
Ties  Handling 


WORK. BLADDER 

START 

STOP 

EVENT 

0 

BRESLOW 


Analysis  of  Maximum  Likelihood  Estimates 
Parameter  Standard  StdErr  Hazard 


Variable 

Estimate 

Error 

Ratio 

Chi-Square 

Pr  >  ChiSq 

Ratio 

TX 

-0.40710 

0.24183 

1.209 

2.8338 

0.0923 

0.666 

NUM 

0.16065 

0.05689 

1.185 

7.9735 

0.0047 

1.174 

SIZE 

-0.04009 

0.07222 

1.028 

0.3081 

0.5788 

0.961 

Coefficient  estimates  are  provided  with  robust  standard 
errors.  The  column  under  the  heading  StdErrRatio  pro¬ 
vides  the  ratio  of  the  robust  to  the  non-robust  standard 
errors.  For  example,  the  standard  error  for  the  coefficient 
for  TX  (0.24183)  is  1.209  greater  than  the  standard  error 
would  be  if  we  had  not  requested  robust  standard  errors 
(i.e.,  omit  the  COVS(AGGREGATE)  option).  The  robust 
standard  errors  are  estimated  slightly  different  compared 
to  the  corresponding  model  in  Stata  or  R. 


A  stratified  Cox  model  can  also  be  run  using  the  data  in  this 
format  with  the  variable  INTERVAL  as  the  stratified  vari¬ 
able.  The  stratified  variable  indicates  whether  the  subject 
was  at  risk  for  their  1st,  2nd,  3rd,  or  4th  event.  This  approach 
is  called  a  Stratified  CP  approach  in  Chapter  8  and  is  used 
if  the  investigator  wants  to  distinguish  the  order  in  which 
recurrent  events  occur.  The  code  for  a  stratified  Cox  fol¬ 
lows: 


PROC  PHREG  DATA=B LADDER  COVS (AGGREGATE)  ; 
MODEL  (START, STOP) * EVENT (0)=TX  NUM  SIZE; 
ID  ID; 

RUN; 
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The  only  additional  code  from  the  previous  model  is  the 
STRATA  statement,  indicating  that  the  variable  INTER¬ 
VAL  is  the  stratified  variable.  The  output  containing  the 
parameter  estimates  follows: 

Analysis  of  Maximum  Likelihood  Estimates 


Variable 

Parameter 

Estimate 

Standard 

Error 

StdErr 

Ratio 

Chi-Square 

Pr  >  ChiSq 

Hazard 

Ratio 

TX 

-0.33430 

0.19706 

0.912 

2.8777 

0.0898 

0.716 

NUM 

0.11565 

0.04991 

0.930 

5.3690 

0.0205 

1.123 

SIZE 

-0.00805 

0.06012 

0.827 

0.0179 

0.8935 

0.992 

Interaction  terms  between  the  treatment  variable  (TX)  and 
the  stratified  variable  could  be  created  to  examine  whether 
the  effect  of  treatment  differed  for  the  1st,  2nd,  3rd,  or  4th 
event. 

Another  stratified  approach  (called  Gap  Time)  is  a  slight 
variation  of  the  stratified  counting  process  approach.  The 
difference  is  in  the  way  the  time  intervals  for  the  recurrent 
events  are  defined.  There  is  no  difference  in  the  time  inter¬ 
vals  when  subjects  are  at  risk  for  their  first  event.  However, 
with  the  Gap  Time  approach,  the  starting  time  at  risk  gets 
reset  to  zero  for  each  subsequent  event.  The  following  code 
creates  data  suitable  for  using  the  gap-time  approach: 

DATA  BLADDER2; 

SET  REF. BLADDER; 

START2=0 ; 

ST0P2=ST0P-START ; 

RUN; 

The  new  dataset  (bladder2)  copies  the  data  from  re.bladder 
and  creates  two  new  variables  for  the  time  interval: 
START2,  which  is  always  set  to  zero  and  STOP2,  which  is 
the  length  of  the  time  interval  (i.e.,  STOP-START).  The 
following  code  uses  these  newly  created  variables  to  run  a 
Gap  Time  model  with  PROC  PHREG: 

PROC  PHREG  DATA=BLADDER2  COVS(AGGREGATE) ; 

MODEL  (START2 ,  ST0P2)  "'EVENT  (0)=TX  NUM  SIZE; 

ID  ID; 

STRATA  INTERVAL; 

RUN; 
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The  output  follows: 

Parameter  Standard  StdErr  Hazard 


Variable 

Estimate 

Error 

Ratio 

Chi-Square 

Pr  >  ChiSq 

Ratio 

TX 

-0.26952 

0.20808 

1.002 

1.6778 

0.1952 

0.764 

NUM 

0.15353 

0.04889 

0.938 

9.8620 

0.0017 

1.166 

SIZE 

0.00684 

0.06222 

0.889 

0.0121 

0.9125 

1.007 

The  results  using  the  Gap  Time  approach  vary  slightly 
from  that  obtained  using  the  Stratified  CP  approach. 


The  counting  process  data  layout  with  multiple  observa¬ 
tions  per  subject  need  not  only  apply  to  recurrent  event 
data,  but  can  also  be  used  for  a  more  conventional  survival 
analyses  in  which  each  subject  is  limited  to  one  event. 
A  subject  with  four  observations  may  be  censored  for  the 
first  three  observations  before  getting  the  event  in  the  time 
interval  represented  by  the  fourth  observation.  This  data 
layout  is  particularly  suitable  for  representing  time-vary¬ 
ing  exposures  (i.e.,  exposures  which  change  values  over 
different  intervals  of  time). 


C.  SPSS 

Analyses  are  carried  out  in  SPSS  by  using  the  appropriate 
SPSS  procedure  on  an  SPSS  dataset.  Most  users  select 
procedures  by  pointing  and  clicking  the  mouse  through 
a  series  of  menus  and  dialog  boxes.  The  code,  or  command 
syntax,  generated  by  these  steps  can  be  viewed  and  edited. 

Analyses  on  the  "addicts”  dataset  will  be  used  to  illustrate 
these  procedures.  The  addicts  dataset  was  obtained  from  a 
1991  Australian  study  by  Caplehorn  et  al.  and  contains 
information  on  238  heroin  addicts.  The  study  compared 
two  methadone  treatment  clinics  to  assess  patient  time 
remaining  under  methadone  treatment.  The  two  clinics 
differed  according  to  its  live-in  policies  for  patients. 
A  patient’s  survival  time  was  determined  as  the  time  (in 
days)  until  the  person  dropped  out  of  the  clinic  or  was 
censored.  The  variables  are  defined  at  the  start  of  this 
appendix. 
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After  getting  into  SPSS,  open  the  dataset  addicts.sav.  The 
data  should  appear  on  your  screen.  This  is  now  your  work¬ 
ing  dataset.  To  obtain  a  basic  descriptive  analysis  of  the 
outcome  variable  (SURVT),  click  on  Analyze  — »  Descriptive 
Statistics  — >  Descriptive  from  the  drop-down  menus  to 
reach  the  dialog  box  to  specify  the  analytic  variables.  Select 
the  SURVT  from  the  list  of  variables  and  enter  it  into  the 
variable  box.  Click  on  OK  to  view  the  output.  Alternatively, 
you  can  click  on  Paste  (rather  than  OK)  to  obtain  the 
corresponding  SPSS  syntax.  The  syntax  can  then  be  sub¬ 
mitted  (by  clicking  the  button  under  Run),  edited,  or  saved 
for  another  session.  The  syntax  created  is  as  follows  (out¬ 
put  omitted): 

DESCRIPTIVES 

VARIABLES=survt 

/STATISTICS=MEAN  STDDEV  MIN  MAX. 

There  are  some  analyses  that  SPSS  only  performs  by  sub¬ 
mitting  syntax  rather  than  using  the  point  and  click 
approach  (e.g.,  running  an  extended  Cox  model  with  two 
time-varying  covariates).  Each  time  the  point  and  click 
approach  is  presented,  the  corresponding  syntax  will  also 
be  presented. 

To  obtain  more  detailed  descriptive  statistics  on  survival 
time  stratified  by  CLINIC,  click  on  Analyze  — »  Descriptive 
Statistics  — >  Explore  from  the  drop-down  menus.  Select 
SURVT  from  the  list  of  variables  and  enter  it  into  the 
Dependent  List  and  then  select  CLINIC  and  enter  it  into 
the  Factor  List.  Click  on  OK  to  see  the  output.  The  syntax 
created  from  clicking  on  Paste  (rather  than  OK)  is  as  fol¬ 
lows  (output  omitted): 

EXAMINE 

VARIABLES=survt  BY  clinic 
/PLOT  BOXPLOT  STEMLEAF 
/COMPARE  GROUP 
/STATISTICS  DESCRIPTIVES 
/CINTERVAL  95 
/MISSING  LISTWISE 
/NOTOTAL. 

Survival  analyses  can  be  performed  in  SPSS  by  selecting 
Analyze  — >  Survival.  There  are  then  four  choices  for  selec¬ 
tion:  Life  Tables,  Kaplan-Meier,  Cox  Regression,  and  Cox 
w/  Time-Dep  Cov.  The  key  SPSS  procedures  for  survival 
analysis  are  the  KM  and  COXREG  procedures. 


Software:  C.  SPSS  609 


The  survival  analyses  demonstrated  in  SPSS  are  as  follows: 

1 .  Estimating  survival  functions  (unadjusted)  and 
comparing  them  across  strata 

2.  Assessing  the  PH  assumption  using  Kaplan-Meier  log- 
log  survival  curves 

3.  Running  a  Cox  PH  model 

4.  Running  a  stratified  Cox  model  and  obtaining  Cox 
adjusted  log-log  curves 

5.  Assessing  the  PH  assumption  with  a  statistical  test 

6.  Running  an  extended  Cox  model 

SPSS  (version  PASW  18)  does  not  provide  commands  to 

run  parametric  survival  models,  frailty  models,  or  models 

using  a  counting  process  data  layout  for  recurrent  events. 


1 .  ESTIMATING  SURVIVAL  FUNCTIONS 
(UNADJUSTED)  AND  COMPARING  THEM 
ACROSS  STRATA 

To  obtain  Kaplan-Meier  survival  estimates,  select  Analyze 
— +  Survival  — >  Kaplan-Meier.  Select  the  SURVT  from  the 
variable  list  and  enter  it  into  the  Time  box,  then  select  the 
variable  STATUS  and  enter  it  into  the  Status  box.  You  will 
then  see  a  question  mark  in  parentheses  after  the  status 
variable,  indicating  that  the  value  of  the  event  needs  to  be 
entered.  Click  the  Define  Event  button  and  insert  the  value 
1  in  the  box  since  the  variable  STATUS  is  coded  1  for  events 
and  0  for  censorships.  Click  on  Continue  and  then  OK  to 
view  the  output.  The  syntax,  obtained  from  clicking  on 
Paste  (rather  than  OK),  is  as  follows  (output  omitted): 

KM 

survt  /STATUS=status(l) 

/PRINT  TABLE  MEAN. 

The  stream  of  output  of  these  KM  estimates  is  quite  long. 
If  you  wish  to  edit  the  output,  try  right  clicking  inside  the 
output  and  then  select  Edit  Content.  You  then  have  a 
choice  to  select  In  Viewer  or  In  Separate  Window.  Click 
on  one  of  these  depending  on  if  you  want  to  open  a  sepa¬ 
rate  window  for  your  edited  output. 
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To  obtain  KM  survival  estimates  and  plots  by  CLINIC  as 
well  as  log  rank  (and  other)  test  statistics,  select  Analyze  — > 
Survival  — >  Kaplan-Meier  and  then  select  SURVT  as  the 
time-to-event  variable  and  STAUS  as  the  status  variable  as 
described  above.  Enter  CLINIC  into  the  Factor  box  and 
click  the  Compare  Factor  button.  You  have  a  choice  of 
three  test  statistics  for  testing  the  equality  of  survival  func¬ 
tions  across  CLINIC.  Select  all  three  (log  rank,  Breslow, 
and  Tarone-Ware)  for  comparison  and  click  Continue. 
Select  the  Options  button  to  request  plots.  There  are  four 
choices  (unfortunately,  log-log  survival  plots  are  not 
included).  Select  Survival  to  obtain  KM  plots  by  clinic. 
Click  Continue  and  then  OK  to  view  the  output. 

The  syntax  follows: 

KM 

survt  BY  clinic  /STATUS=status(l) 

/PRINT  TABLE  MEAN 
/PLOT  SURVIVAL 

/TEST  LOGRANK  BRESLOW  TARONE 
/COMPARE  OVERALL  POOLED. 

The  output  containing  the  KM  estimates  for  the  first  five 
events  or  censorship  times  from  CLINIC=1  and  CLINIC=2 
as  well  for  the  log  rank,  Breslow,  and  Tarone-Ware  tests 
follow: 


Survival  Analysis  for  SURVT  Survival  time  (days) 


Survival  Table 


clinic 

Time 

Status 

Cumulative  Proportion 

Surviving  at  the  Time 

N  of 

Cumulative 

Events 

N  of 

Remaining 

Cases 

Estimate 

Std.  Error 

i 

7.000 

endpoint 

.  994 

.006 

i 

161 

2 

17.000 

endpoint 

.  988 

.009 

2 

160 

3 

19.000 

endpoint 

.  981 

.011 

3 

159 

4 

28.000 

censored 

3 

158 

5 

28.000 

censored 

3 

157 

Software:  C.  SPSS  611 


Factor  CLINIC  =2.00 


2.00  1 

13.000 

endpoint 

.986 

.013 

1 

73 

2 

26.000 

endpoint 

.973 

.019 

2 

72 

3 

35.000 

endpoint 

.959 

.023 

3 

71 

4 

41.000 

endpoint 

.946 

.026 

4 

70 

5 

53.000 

censored 

4 

69 

Test  Statistics  for  Equality  of  Survival  Distributions  for  CLINIC 
Statistic  df  Significance 

Note  that  what  SPSS  calls  the  Breslow  test  statistic  is 
equivalent  to  what  Stata  (and  SAS)  call  the  Wilcoxon  test 
statistic. 

Life  Table  estimates  can  be  obtained  by  selecting  Analyze 
-4  Survival  — >  Life  Tables.  The  time-to-event  and  status 
variables  are  defined  similarly  as  described  above  for  KM 
estimates.  However  with  life  tables,  SPSS  presents  a  Dis¬ 
play  Time  Intervals  box.  This  allows  the  user  to  define  the 
time  intervals  used  for  the  life  table  analysis.  For  example, 
0  to  1,000/100  would  define  10  time  intervals  of  equal 
length.  Life  table  plots  can  similarly  be  requested  as 
described  above  for  the  KM  plots. 


2.  ASSESSING  THE  PH  ASSUMPTION  USING 
KAPLAN-MEIER  LOG-LOG  SURVIVAL  CURVES 

SPSS  does  not  provide  unadjusted  KM  log-log  curves  by 
directly  using  the  point  and  click  approach  with  the  KM 
command.  SPSS  does  provide  adjusted  log  log  curves  from 
running  a  stratified  Cox  model  (described  later  in  the  stra¬ 
tified  Cox  section).  A  log-log  curve  equivalent  to  the  unad¬ 
justed  KM  log-log  curve  can  be  obtained  in  SPSS  by 
running  a  stratified  Cox  without  including  any  covariates 
in  the  model.  In  this  section,  however,  we  illustrate  how 
new  variables  can  be  defined  in  the  working  dataset  and 
then  used  to  plot  unadjusted  log-log  KM  plots. 
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First,  a  variable  will  be  created  containing  the  KM  survival 
estimates.  Then,  another  new  variable  will  be  created  con¬ 
taining  the  log-log  of  the  survival  estimates.  Finally,  the 
log-log  survival  estimates  will  be  plotted  against  survival 
time  to  see  if  the  curves  for  CLINIC=1  and  CLINIC=2  are 
parallel.  Each  step  can  be  done  with  the  point  and  click 
approach  or  by  typing  in  the  code  directly. 

A  variable  containing  the  survival  estimates  can  be  created 
by  selecting  Analyze  — >  Survival  — >  Kaplan-Meier  and  then 
selecting  SURVT  as  the  time-to-event  variable,  STAUS  as 
the  status  variable,  and  CLINIC  as  the  factor  variable  as 
described  above.  Then  click  the  Save  button.  This  opens  a 
dialogue  box  called  Kaplan-Meier  Save  New  Variables. 
Check  Survival  and  click  on  Continue  and  then  on  Paste. 
The  code  that  is  created  is  as  follows: 

KM 

survt  BY  clinic  /STATUS=status(l) 

/PRINT  TABLE  MEAN 
/SAVE  SURVIVAL. 

By  submitting  this  code,  a  new  variable  containing  the  KM 
estimates  called  SUR_1  is  created.  To  create  a  new  variable 
called  11s  containing  the  log(-log)  of  SUR_1,  submit  the 
following  code: 

COMPUTE  11s  =  LN(-LN  (SUR  1 )). 

EXECUTE. 

The  above  code  could  also  be  generated  by  selecting  Trans¬ 
form  — +  Compute  Variable  and  defining  the  new  variable  in 
the  dialogue  box.  To  plot  11s  against  survival  time,  submit 
the  code: 


GRAPH 

/SCATTERPLOT(BIVAR)=survt  WITH  11s  BY  clinic 
/MISSING=LISTWISE. 

This  final  piece  of  code  could  also  be  run  by  selecting 
Graphs  — >  Legacy  Dialogue  — >  Scatter/Dot  — >  and  then 
clicking  on  Simple  Scatter  and  then  Define  in  the  Scatter/ 
Dot  dialogue  box.  Select  LLS  for  the  Y-axis,  SURVT  for  the 
X-axis,  and  CLINIC  in  the  Set  Marker  By  box.  Clicking  on 
paste  creates  the  code  or  clicking  OK  submits  the  program. 
A  plot  of  LLS  against  log(SURVT)  could  similarly  be  cre¬ 
ated.  Parallel  curves  support  the  PH  assumption  for 
CLINIC. 


Software:  C.  SPSS  61  3 


3.  RUNNING  A  COX  PH  MODEL 

A  Cox  PH  model  can  be  run  by  selecting  Analyze  — >  Sur¬ 
vival  — >  Cox  Regression.  Select  the  SURVT  from  the  vari¬ 
able  list  and  enter  it  into  the  Time  box,  then  select  the 
variable  STATUS  and  enter  it  into  the  Status  box.  You 
will  then  see  a  question  mark  in  parentheses  after  the 
status  variable,  indicating  that  the  value  of  the  event 
needs  to  be  entered.  Click  the  Define  Event  button  and 
insert  the  value  1  in  the  box  since  the  variable  STATUS  is 
coded  1  for  events  and  0  for  censorships.  Click  on  Continue 
and  select  PRISON,  DOSE,  and  CLINIC  from  the  variable 
list  and  enter  them  into  the  Covariates  box.  You  can  click 
on  Plots  or  Options  to  explore  some  of  the  options  (e.g., 
95%  Cl  for  exp(p)).  Click  OK  to  view  the  output  or  click  on 
Paste  to  see  the  code.  The  code  follows: 

COXREG 

survt  /STATUS=status(l) 

/METHOD=ENTER  prison  dose  clinic 
/CRITERIA=PIN(.05)  POUT(.IO)  ITERATE(20). 

Note  that  the  PH  assumption  is  assumed  to  hold  for  all 
three  covariates  using  this  Cox  model  (the  output  follows). 


Omnibus  Tests  of  Model  Coefficients3’ b 


-2Log 

Likelihood 

Overall  (score) 

Change  From 
Previous  Step 

Change  From 
Previous  Block 

Chi-square  df  Sig. 

Chi-square  df  Sig. 

Chi-square  df  Sig. 

1346.805 

56.273  3  .000 

64.519  3  .000 

64.519  3  .000 

a  Beginning  Block  Number  0,  initial  Log  Likelihood  function:  —2  Log  likelihood:  141 1.324 
b  Beginning  Block  Number  1 .  Method  =  Enter 


Variables  in  the  Equation 


B 

SE 

Wald 

df 

Sig. 

Exp(B) 

PRISON 

.327 

.167 

3.813 

1 

.051 

1.386 

DOSE 

—.035 

.006 

30.785 

1 

.000 

.965 

CLINIC 

-1.009 

.215 

22.045 

1 

.000 

.365 
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4.  RUNNING  A  STRATIFIED  COX  MODEL  AND 
OBTAINING  COX  ADJUSTED  LOG-LOG  CURVES 

A  stratified  Cox  model  is  run  by  selecting  Analyze  — > 
Survival  — >  Cox  Regression.  Select  the  SURVT  from  the 
variable  list  and  enter  it  into  the  Time  box.  Select  the 
variable  STATUS  and  enter  it  into  the  Status  box  and 
then  define  the  value  of  the  event  as  1.  Put  the  variables 
PRISON  and  DOSE  in  the  Covariates  box  and  the  variable 
CLINIC  in  the  Strata  box.  The  Cox  model  will  be  stratified 
by  CLINIC.  Click  the  Plots  button  and  check  Log  minus  log 
as  the  plot  type  and  then  click  on  Continue.  Click  on  OK  to 
view  the  output  or  click  on  Paste  to  see  the  code.  The  code 
follows: 


COXREG 

survt  /STATUS=status(l) 

/STRATA=clinic 

/METHOD =ENTER  prison  dose 
/PLOT  LML 

/CRITERIA=PIN(.05)  POUT(.IO)  ITERATE(20). 

The  output  containing  the  parameter  estimates  and  the 
adjusted  log  log  plots  follows: 


Variables  in  the  Equation 


B 

SE 

Wald 

df 

Sig. 

Exp(B) 

PRISON 

.389 

.169 

5.298 

1 

.021 

1.475 

DOSE 

-.035 

.006 

29.552 

1 

.000 

.965 

LML  Function  at  mean  of  covariates 


Survival  time  (days) 


Software:  C.  SPSS  615 


Notice  that  there  are  parameter  estimates  for  PRISON  and 
DOSE  but  not  CLINIC  since  CLINIC  is  the  stratified  vari¬ 
able.  The  Cox-adjusted  log  log  plots  are  fitted  using  the 
mean  values  of  PRISON  and  DOSE  and  are  used  to  evalu¬ 
ate  the  PH  assumption  for  CLINIC. 

Suppose  rather  than  using  the  mean  value  of  DOSE  for  the 
adjusted  log  log  plots,  you  wish  to  obtain  adjusted  plots  in 
which  DOSE=70.  Run  the  same  code  as  before,  up  to  click¬ 
ing  on  the  Plots  button  and  checking  Log  minus  log  as  the 
plot  type.  Instead,  click  on  DOSE(Mean)  in  the  window 
called  Covariate  Values  Plotted  at.  Underneath  the  heading 
called  Change  Value,  click  on  the  word  Value,  type  in  the 
value  70,  and  then  click  on  the  button  called  Change.  Now, 
the  variable  in  the  window  should  be  called  DOSE(70) 
rather  than  DOSE(Mean).  Click  on  Continue  and  then 
OK  to  view  the  output. 


5.  ASSESSING  THE  PH  ASSUMPTION  WITH 
A  STATISTICAL  TEST 

SPSS  does  not  easily  accommodate  a  statistical  test  on  the 
PH  assumption  using  the  Schoenfeld  residuals.  However, 
it  can  be  programmed  using  several  steps.  The  steps  are  as 
follows: 

1 .  Run  a  Cox  PH  model  to  obtain  the  Schoenfeld  residuals 
for  all  the  covariates.  These  residuals  are  saved  as  new 
variables  in  the  working  dataset. 

2.  Delete  observations  that  were  censored. 

3.  Create  a  variable  that  contains  the  ranked  order  of 
survival  time.  For  example,  the  subject  who  had  the 
fourth  event  gets  a  value  of  4  for  this  variable. 

3.  Run  correlations  on  the  survival  rankings  with  the 
Schoenfeld  residuals. 

4 .  The  p-value  for  testing  whether  the  correlation  between 
the  ranked  survival  time  and  the  covariate’s  Schoenfeld 
residuals  is  zero  is  the  samep-value  used  to  test  the  PH 
assumption.  The  null  hypothesis  is  that  the  PH 
assumption  is  not  violated. 

First,  run  a  Cox  PH  model  with  CLINIC,  PRISON,  and 
DOSE.  Click  on  the  Save  button  before  submitting  the 
model.  A  dialogue  box  appears  that  is  called  Cox  Regres¬ 
sion:  Save  Model  Variables.  Check  Partial  Residuals  and 
click  on  Continue.  This  creates  three  new  variables  in  the 
working  dataset  called  PR1_1,  PR2_1,  and  PR3_1,  which 
are  the  partial  residuals  (Schoenfeld  residuals)  for  CLINIC, 
PRISON,  and  DOSE,  respectively.  Click  OK  to  run  the 
model  (or  Paste  to  generate  the  code). 
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Next,  delete  all  censored  observations  (i.e.,  only  keep 
observations  in  which  STATUS=1).  To  do  this,  select  Data 
—>  Select  Cases.  Then  check  If  condition  is  satisfied,  and 
then  click  on  If.  Type  status=l  in  the  dialogue  box  and  click 
on  Continue.  Check  Delete  unselected  cases  in  the  box 
called  Output.  Click  OK  and  only  observations  with  events 
will  be  kept  in  the  dataset.  (Remember  to  go  back  to  the 
addicts  dataset  that  contains  the  censored  observations 
when  you  continue  work  through  the  other  sections  that 
use  the  addicts  data.) 

Create  the  variable  that  contains  the  ranking  of  survival 
times  by  selecting  Transform  — >  Ranked  Cases.  Select  the 
SURVT  into  the  Variables  box.  Click  on  Rank  Types,  check 
Ranks,  and  click  on  Continue  and  then  click  on  Ties,  check 
Mean,  and  click  Continue.  Click  OK  and  a  new  variable 
(called  Rsurvt)  will  be  created  containing  the  ranked  sur¬ 
vival  time. 

Finally,  obtain  correlations  (and  their  p-values)  between 
the  ranked  survival  and  the  Schoenfeld  residuals.  Select 
Analyze  — >  Correlate  — >  Bivariate.  Move  the  ranked  survival 
time  variable  as  well  as  the  three  partial  residual  variables 
into  the  variable  box.  Check  Pearson  (for  Pearson  correla¬ 
tions)  and  Two-tailed  for  a  two-tail  test  of  significance  and 
click  OK  to  see  the  output.  The  code  that  is  generated  from 
these  steps  follows: 

COXREG 

survt  /STATUS=status(l) 

/METHOD=ENTER  clinic  prison  dose 

/SAVE=  PRESID 

/CRITERIA=PIN(.05)  POUT(.IO)  ITERATE(20)  . 

FILTER  OFF. 

USE  ALL. 

SELECT  IF(status=l). 

EXECUTE 

RANK 

VARIABLES=survt  (A)  /RANK  /PRINT=YES 

/TIES=MEAN. 

CORRELATIONS 

/VARIABLES=Rsurvt  PR1_1  PR2  J  PR.3.  1 

/PRINT=TWOTAIL  NOSIG 

/MISSING=PAIRWISE. 
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The  output  containing  the  correlations  follows: 


Correlations 


RANK  of 
SURVT 

Partial 
residual  for 
CLINIC 

Partial 
residual  for 
PRISON 

Partial 
residual 
for  DOSE 

RANK  of  SURVT 

Pearson  Correlation 

1 

-.080 

.077 

Sig.  (2-tailed) 

.332 

.347 

N 

150 

150 

150 

150 

Partial  residual  for 

Pearson  Correlation 

1 

.010 

.023 

CLINIC 

Sig.  (2-tailed) 

.904 

.776 

N 

150 

150 

150 

150 

Partial  residual  for 

Pearson  Correlation 

-.080 

.010 

1 

.171* 

PRISON 

Sig.  (2-tailed) 

.332 

.904 

.037 

N 

150 

150 

150 

150 

Partial  residual  for  DOSE 

Pearson  Correlation 

.077 

.023 

.171* 

1 

Sig.  (2-tailed) 

.347 

.776 

.037 

N 

150 

150 

150 

150 

**•  Correlation  is  significant  at  the  0.01  level  (2-tailed). 
‘•Correlation  is  significant  at  the  0.05  level  (2-tailed). 


The  p-values  for  the  correlations  are  the  p-values  for 
the  PH  test.  In  the  output,  examine  the  row  labeled 
RANK  of  SURVT  Sig(2-tailed).  Notice  that  the  null  hypoth¬ 
esis  is  rejected  for  CLINIC  (p  =  0.001)  but  not  for  PRISON 
(p  =  0.332)  or  DOSE  (p  =  0.347). 


6.  RUNNING  AN  EXTENDED  COX  MODEL 

An  extended  Cox  model  with  exactly  one  time-dependent 
covariate  can  be  run  using  the  point  and  click  approach. 
Suppose  we  want  to  include  a  time-dependent  covariate 
DOSE  times  the  log  of  survival  time.  This  product  term 
could  be  appropriate  if  the  hazard  ratio  comparing  any  two 
levels  of  DOSE  monotonically  increases  (or  decreases)  over 
time.  Select  Analyze  — >  Survival  — >  Cox  w/  Time-Dep  Cov. 
This  opens  a  dialogue  called  Expression  for  T_COV_.  The 
user  defines  a  time-dependent  variable  (called  T_COV_)  in 
this  box.  A  variable  T_  is  included  in  the  variable  list.  This 
is  the  variable  that  represents  time-varying  survival  (as 
opposed  to  SURVT  which  is  an  individual’s  fixed  time  of 
event).  We  wish  to  define  T_COV_  to  be  the  log  of  T_  x 
DOSE.  Enter  the  expression  LN(T_)*dose  into  the  dialogue 
box  and  click  on  the  Model  button.  Now,  run  a  Cox  model 
that  includes  the  covariates:  PRISON,  CLINIC,  DOSE,  and 
T_COV_.  The  code  generated  is  as  follows: 
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TIME  PROGRAM. 

COMPUTE  T_COV_  =  LN(T_)  *  dose. 

COXREG 

survt  /STATUS=status(l) 

/METHOD=ENTER  prison  clinic  dose  T_COV_ 
/CRITERIA=PIN(,05)  POUT(.IO)  ITERATE(20). 

The  output  containing  the  parameter  estimates  follows: 


Variables  in  the  Equation 


B 

SE 

Wald 

df 

Sig. 

Exp(B) 

PRISON 

.340 

.167 

4.134 

1 

.042 

1.406 

CLINIC 

-1.019 

.215 

22.369 

1 

.000 

.361 

DOSE 

-.082 

.036 

5.247 

1 

.022 

.921 

T-COV. 

.009 

.006 

1.765 

1 

.184 

1.009 

The  variable  T_COV_  represents  the  time-dependent  vari¬ 
able  included  in  the  model,  which  in  this  example  is  DOSE 
times  the  log  of  survival  time. 


A  heaviside  function  for  CLINIC  can  similarly  be  created. 
We  can  define  a  time  dependent  variable  equal  to  CLINIC  if 
time  is  greater  than  or  equal  to  365  days  and  0  otherwise. 
Select  Analyze  — >  Survival  — >  Cox  w /  Time-Dep  Cov.  Define 
T_COV  to  be  (T_  x  365)  x  clinic.  After  clicking  on  the 
Model  button,  run  a  Cox  model  that  includes  PRISON, 
DOSE,  CLINIC,  and  T_COV_.  The  code  generated  is  as 
follows: 


TIME  PROGRAM. 

COMPUTE  T_COV_  =  (T_  >  =  365)*  clinic. 

COXREG 

survt  /STATUS=status(l) 

/METHOD=ENTER  prison  clinic  dose  T_COV_ 
/CRITERIA=PIN(,05)  POUT(.IO)  ITERATE(20). 

Note  that  SPSS  recognizes  the  expression  (T_  x  365)  as 
taking  the  value  1  if  survival  time  is  >365  days  and  0  other¬ 


wise. 
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The  output  follows: 


Variables  in  the  Equation 


B 

SE 

Wald 

df 

Sig. 

Exp(B) 

PRISON 

.378 

.168 

5.030 

1 

.025 

1.459 

CLINIC 

-.460 

.255 

3.241 

1 

.072 

.632 

DOSE 

-.036 

.006 

30.450 

1 

.000 

.965 

T-COV. 

-1.369 

.461 

8.799 

1 

.003 

.254 

Notice  the  variable  CLINIC  is  included  in  this  model  and 
the  time-dependent  heaviside  function,  T_COV_,  does  not 
contribute  to  the  estimated  hazard  ratio  until  day  365.  The 
estimated  hazard  ratio  for  CLINIC  at  100  days  is  exp(- 
0.460)  =  0.632  while  the  estimated  hazard  ratio  for  CLINIC 
at  400  days  is  exp((-0.460)  +  (-1.369))  =  0.161. 

It  may  be  of  interest  to  define  two  heaviside  functions  (with 
CLINIC)  and  not  include  CLINIC  in  the  model.  This  is 
essentially  the  same  model  as  the  one  described  above 
with  one  heaviside  function.  However,  the  coding  of  two 
heaviside  functions  makes  it  somewhat  computationally 
more  convenient  for  estimating  the  two  hazard  ratios  for 
CLINIC  (HR  for  <365  days  and  HR  for  >365  days).  Unfor¬ 
tunately,  SPSS  allows  just  one  time-dependent  variable  (i. 
e.,  T_COV_)  using  the  point  and  click  approach.  However, 
by  examining  the  code  created  for  the  single  heaviside 
function,  there  is  only  a  slight  adjustment  needed  to  create 
code  for  two  heaviside  functions.  The  following  code  cre¬ 
ates  two  heaviside  functions  (called  HV1  and  HV2)  and 
runs  a  model  containing  PRISON,  DOSE,  HV1,  and  HV2: 

TIME  PROGRAM. 

COMPUTE  hvl=  (T_  <  365)*  clinic. 

COMPUTE  hv2=  (T_  >=  365)*  clinic. 

COXREG 

survt  /STATUS=status(l) 

/METHOD=ENTER  prison  dose  hvl  hv2 
/CRITERIA=PIN(,05)  POUT(.IO)  ITERATE(20). 
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The  output  follows: 


Variables  in  the  Equation 


B 

SE 

Wald 

df 

Sig. 

Exp(B) 

PRISON 

.378 

.168 

5.030 

1 

.025 

1.459 

DOSE 

-.036 

.006 

30.450 

1 

.000 

.965 

HV1 

-.460 

.255 

3.241 

1 

.072 

.632 

HV2 

-1.828 

.386 

22.439 

1 

.000 

.161 

The  parameter  estimates  for  HV1  and  HV2  can  be  used 
directly  to  obtain  the  estimated  hazard  ratio  for  CLINIC=2 
vs  CLINIC=1  before  and  after  365  days.  The  estimated 
hazard  ratio  for  CLINIC  at  100  days  is  exp(-0.460)  = 
0.632  and  the  estimated  hazard  ratio  for  CLINIC  at  400 
days  is  expexp(-1.828)  =  0.161.  These  results  are  consistent 
with  the  estimates  obtained  from  the  previous  model  with 
one  heaviside  function. 


D.  R  Software 

R  is  available  for  free  and  can  be  downloaded  from  the 
Comprehensive  R  Archive  Network  (CRAN)  at  its  home 
site  at  http://www.r-project.org/.  Analyses  are  carried  out 
in  R  by  applying  functions  on  R  data  (stored  as  R  objects). 
R  functions  are  stored  in  packages.  Only  when  a  package  is 
loaded,  its  contents  are  available.  The  base  packages  are 
installed  when  you  download  R.  Packages  that  are  not  base 
packages  need  to  be  installed  separately. 

Once  you  open  R,  you'll  see  a  prompt:  Type  1  +  1  and  press 
enter.  You’ll  (hopefully)  see  the  answer  2  returned  at  the 
line  below.  Alternatively,  you  can  type  commands  in  a 
script  by  clicking  on  File  — >  New  script.  A  new  script  win¬ 
dow  will  open  up.  By  typing  commands  in  this  window, 
you  can  submit  batches  of  code  at  one  time  by  highlighting 
the  code  and  clicking  on  Edit  — >  Run  line  or  by  clicking  on 
Edit  — >  selection.  Programming  in  a  script  window  serves  a 
similar  function  as  the  program  editor  in  SAS  or  the  Do-file 
Editor  in  Stata,  in  that  code  can  be  submitted  as  a  block 
rather  than  one  line  at  a  time. 

To  see  which  packages  are  installed  at  your  site,  type  and 
enter  library(  ).  To  run  many  of  the  functions  needed  to 
perform  survival  analyses,  you  will  need  to  install  the  sur¬ 
vival  package  (not  generally  a  base  package). 
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To  install  the  survival  package,  click  on  Packages  — >  Install 
package(s).  You  will  see  a  heading  called  CRAN  mirror 
with  a  listing  of  many  different  countries  under  that  head¬ 
ing.  Click  on  one  of  these  (e.g.,  USA  (AZ))  and  then  scroll 
down  and  click  on  survival  and  then  click  OK.  The  survival 
package  (with  its  many  survival  functions)  should  now  be 
installed.  Type  library( survival)  and  press  enter,  and  the 
survival  package  will  be  ready  to  use.  As  a  check,  type  the 
word  kidney  and  hit  enter.  A  dataset  called  kidney  (which 
is  part  of  the  survival  package)  should  print  on  your  screen. 
Once  the  survival  package  is  installed,  you  do  not  have  to 
reinstall  it  each  session.  However,  you  will  need  to  type 
library(survival)  each  session  before  you  run  the  survival 
functions  contained  in  the  package. 

Before  discussing  survival  analyses  in  R,  it  may  be  useful  to 
give  a  brief  overview  on  some  of  the  ways  data  are  stored  in 
R.  In  particular,  we  describe  four  classes  of  data  storage: 
vectors,  matrices,  dataframes,  and  lists.  If  you  type  and 
enter  the  code  below,  R  will  create  a  numerical  vector  with 
five  elements: 

c(l,7,12,6,3) 

The  c  function  combines  its  arguments  to  form  a  vector. 
We  can  store  this  vector  as  an  object  under  the  name 
(identifier),  xl: 


xl=c(l, 7,12,6, 3) 


Type  xl  and  press  enter,  and  you  will  see  the  vector  xl 
printed  as  output.  The  code  and  output  are  shown  below: 

xl 

1  7  12  6  3 

We  can  identify  elements  from  the  vector  xl  by  placing 
brackets  [  ]  after  xl.  For  example,  xl[2]  will  identify  the 
2nd  element  of  xl.  The  code  xl[l:3]  will  identify  the  first 
three  elements  of  xl  and  the  code  xl[xl>6]  will  identify 
the  elements  in  xl  greater  than  6.  The  code  and  output  for 
these  three  examples  follow: 

xl[2] 

7 

xl[l:3] 

1  7  12 
xl[xl>6] 

7  12 
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The  :  operator  (used  in  the  2nd  example)  creates  a  sequence 
of  integers  incremented  by  1.  Next,  we  create  four  more 
vectors  called  x2,  x3,  x4,  and  x5: 

x2=2*(l:5) 
x3=2*xl +  x2 
x4=xl>z6 

x5=c(“blue”, “green”, “red”, “green”, “purple”) 

The  code  x2=2*(l:5)  creates  the  vector  2,  4,  6,  8,  and  10 
which  we  named  x2.  The  vector  x3  results  from  arithmetic 
operations  of  xl  and  x2  (2  x  xl  +  x2).  The  vectors  xl,  x2, 
and  x3  are  each  numeric  vectors.  If  you  apply  the  mode 
function  on  xl  (i.e.,  type  mode(xl)  and  press  enter),  it  will 
return  the  word  "numeric”  as  output.  The  vector  x4  is  a 
logical  vector.  The  mode  function  will  return  the  word 
“logical”  if  you  submit  the  code  mode(x4).  The  elements 
of  a  vector  of  mode  logical  are  either  "TRUE”  or  “FALSE.” 
Enter  the  code  x4  (output  below): 

x4 

FALSE  TRUE  TRUE  FALSE  FALSE 

The  2nd  and  3rd  elements  of  x4  are  TRUE  because  the  2nd 
and  3rd  elements  of  xl  are  greater  than  6.  The  vector  x5  is  a 
character  vector.  R  is  case  sensitive,  so  naming  the  vector 
x5  is  not  the  same  as  naming  it  X5. 

We  can  create  a  numeric  matrix  (called  y)  using  the  vec¬ 
tors  xl,  x2,  and  x3  as  columns  of  the  matrix  by  applying 
the  cbind  function: 

y=cbind(x  1  ,x2,x3) 

Enter  the  code  class(y)  and  the  word  “matrix”  will  be 
returned  as  output.  Enter  the  code  mode(y)  and  the  word 
"numeric”  will  be  returned  since  y  is  a  numeric  matrix.  You 
cannot  mix  numeric  and  character  vectors  in  a  matrix. 

Type  y  and  press  enter,  and  the  matrix  will  be  printed 
(shown  below): 

y 

xl  x2  x3 
[1,]  1  2  4 

[2,]  7  4  18 

[3,  ]  12  6  30 

[4,  ]  6  8  20 

[5,]  3  10  16 
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A  dataframe  provides  a  more  general  class  of  data  storage 
than  a  matrix  in  R  because  a  dataframe  can  contain  a  mix  of 
numeric,  character,  and  logical  variables.  A  dataframe  in  R 
is  similar  to  a  dataset  in  Stata,  SAS,  or  SPSS,  in  that  it  can 
store  different  types  of  variables.  The  data. frame  function 
can  be  used  to  combine  vectors  and  matrices  as  follows: 

data.frame(xl,x2,x3,x4,x5)  or,  equivalently,  z=data.frame(y,x4,x5) 

Type  z  and  press  enter,  and  the  dataframe  will  be  printed 
(shown  below): 

z 

xl  x2  x3  x4  x5 

1  124  FALSE  blue 

2  7  4  18  TRUE  green 

3  12  6  30  TRUE  red 

4  6  8  20  FALSE  green 

5  3  10  16  FALSE  purple 

Brackets  []  can  be  used  to  access  particular  rows  and/or 
columns  of  a  dataframe  or  matrix.  Enter  the  code:  z[2,5] 
and  the  2nd  row,  5th  column  will  be  printed  from  the  matrix 
z  (the  element  "green”  in  this  example).  If  you  want  to 
access  the  first  three  rows  (observations)  of  the  fifth  col¬ 
umn,  type  the  code  z[l:3,5]  or  equivalently  z[c(l,2,3),5]. 
If  you  want  to  access  the  entire  5th  column,  enter  z[,5]. 
Alternatively,  since  the  5th  column  (or  variable)  is  named 
x5,  you  can  access  the  entire  5th  column  by  entering  the 
code  z$x5.  The  $  in  this  example  points  to  the  variable 
named  x5  from  the  dataframe  named  z. 

A  list  offers  a  more  general  type  of  data  storage  than  the 
vector,  matrix,  or  dataframe,  and  can  include  any  of  those 
data  objects  as  part  of  the  list.  The  following  code  creates  a 
list  called  w  that  contains  a  character  vector  of  length  2  as 
its  first  element,  the  vector  xl  as  its  second  element,  the 
matrix  y  as  its  third  element,  and  the  dataframe  z  as  its 
fourth  element: 

w=list(c(“hello”,”good-bye”),xl,y,z) 

Double  brackets  [[  ]]  can  be  used  to  access  particular  ele¬ 
ments  of  a  list.  If  you  want  to  access  the  dataframe  z  from 
the  list  w,  enter  the  code  w[[4]]  since  z  is  the  fourth  element 
of  w.  If  you  want  to  access  the  first  row  third  column  of  the 
fourth  element  of  w  from  the  list,  enter  the  following  code: 

w[[4]]  [1,3] 

The  1 st  row  3rd  column  of  the  4th  element  of  w  has  the  value  4. 
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Survival  Functions  in  R 

Once  the  survival  package  has  been  installed  you  will  have 
access  to  the  survival  functions  needed  to  perform  the 
survival  analyses  in  this  appendix.  Enter  the  code  library 
(survival)  each  session  to  access  these  functions.  Some  of 
the  key  survival  functions  are  listed  below: 

Surv  -  Used  to  define  the  "time-to-event”  and  “status” 
outcome  variables.  This  function  creates  a  survival 
object  that  can  be  used  as  the  outcome  variable  for 
other  survival  functions  in  R 

survfit  -  Produces  KM  or  Cox-adjusted  survival  estimates 
or  survival  estimates  from  a  previously  fitted  parametric 
model 

survdiff  -  Used  to  perform  statistical  tests  for  the  equality 
of  survival  functions  across  strata 

coxph  -  Used  to  run  a  Cox  PH  model,  a  stratified  Cox 
model,  or  an  extended  Cox  model 

cox.zph  -  Performs  statistical  tests  on  the  PH  assumption 
based  on  Schoenfeld  residuals 

survSplit  -  Creates  a  new  dataset  in  the  counting  process 
format,  with  a  start  time,  stop  time,  and  event  status  for 
each  record.  Splits  single  observations  into  multiple 
observations  given  survival  data  and  specified  cut  times 

survreg  -  Used  to  run  parametric  survival  models 

Generic  functions  in  R  such  as  the  summary  function  and 
the  plot  function  are  often  used  in  conjunction  with  these 
survival  functions  in  order  to  produce  survival  estimates 
and  plots. 

R  documentation  (online  help)  for  these  functions  can  be 
obtained  by  typing  and  submitting  a  question  mark  and 
then  the  name  of  the  function  as  one  word.  For  example,  to 
access  R  documentation  on  the  coxph  function,  submit  the 
code  Pcoxph. 

The  survival  analyses  demonstrated  in  R  are  as  follows: 

1.  Estimating  survival  functions  (unadjusted)  and 
comparing  them  across  strata. 

2.  Assessing  the  PH  assumption  using  graphical 
approaches. 

3.  Running  a  Cox  PH  model. 

4.  Running  a  stratified  Cox  model. 

5.  Assessing  the  PH  assumption  with  a  statistical  test. 

6.  Obtaining  Cox-adjusted  survival  curves. 


Software:  D.  R  Software  625 


7.  Running  an  extended  Cox  model. 

8.  Running  parametric  models. 

9.  Running  frailty  models. 

10.  Modeling  recurrent  events. 

We  use  the  addicts  dataset  for  illustration.  The  load  func¬ 
tion  is  used  to  access  an  R  dataframe  that  has  been  saved 
as  a  file.  Suppose  the  addicts  dataset  has  been  saved  on 
your  C  drive  as  C:\craddicts.rda.  The  following  code  will 
load  the  addicts  data: 

load(  “C  :\craddicts.rda” ) 

To  print  the  addicts  dataset,  enter  the  code: 

addicts 

To  print  the  first  five  observations,  enter  the  code: 

addicts[l:5,  ] 

All  6  variables  (columns)  are  printed  because  there  was  no 
entry  after  the  comma.  Equivalently,  we  could  have 
entered  the  code  addicts[l:5,l:6].  The  output  follows: 


id 

clinic 

status 

survt 

prison 

dose 

1 

1 

1 

1 

428 

0 

50 

2 

2 

1 

1 

275 

1 

55 

3 

3 

1 

1 

262 

0 

55 

4 

4 

1 

1 

183 

0 

30 

5 

5 

1 

1 

259 

1 

65 

The  time-to-event  variable  in  the  addicts  dataset  is  named 
SURVT  and  the  variable  indicating  whether  a  subject  had 
an  event  or  was  censored  is  named  STATUS.  The  function 
Surv  creates  a  survival  object  in  R  linking  these  two  out¬ 
come  variables  (code  shown  below): 

Surv(addicts$survt,addicts$status== 1 ) 

The  first  argument  is  the  time-to-event  variable  which  is 
accessed  from  the  addicts  dataframe  with  the  $  notation 
(addicts$survt).  The  second  argument  (addicts$sta- 
tus==l)  indicates  an  event  occurs  (as  opposed  to  a  censor¬ 
ship)  when  the  status  variable  equals  1.  Notice  that  two 
equal  signs  are  used  to  express  equality.  A  single  equal  sign 
is  used  to  designate  assignment  in  R.  A  portion  of  the 
output  from  the  Surv  function  is  shown  below: 
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[1]  428  275  262  183  259  714  438  796+  892  393  161+  836 

[13]  523  612  212  399  771  514  512  624  209  341  299  826+ 

[25]  262  566+  368  302  602+  652  293  564+  394  755  591  787+ 

The  output  above  shows  the  survival  times  for  the  first  36 
subjects  in  the  addicts  data  (out  of  238).  A  plus  (+)  sign 
after  their  time  indicates  censorship  rather  than  event. 

This  survival  object  created  by  the  Surv  function  is  often 
used  in  R  as  the  response  variable  for  survival  analyses. 
Next  we  demonstrate  survival  analyses  in  R  by  specific 
topics. 


1 .  ESTIMATING  SURVIVAL  FUNCTIONS 
(UNADJUSTED]  AND  COMPARING  THEM 
ACROSS  STRATA 

Kaplan-Meier  survival  estimates  are  obtained  in  R  with  the 
use  of  three  functions.  The  Surv  function  (described 
above)  is  used  within  the  survfit  function,  which  is  then 
used  within  the  summary  function.  The  code  follows: 

summary(survfit(Surv(addicts$survt,addicts$status==  1 )~  1 )) 

To  better  understand  how  this  code  works  we’ll  break 
down  each  function.  The  code:  Y=Surv(addicts$survt, 
addicts$status==l)  creates  a  survival  object  called  Y  that 
is  used  as  the  response  variable  in  the  analysis.  Now  con¬ 
sider  the  code  Y~1  This  syntax  is  called  a  formula.  For¬ 
mulas  are  used  as  arguments  in  many  functions  in  R, 
particularly  those  that  specify  statistical  models.  Y~1 
requests  an  intercept  only  model.  In  other  words,  we  are 
not  conditioning  on  any  other  variable.  Later  in  this  sec¬ 
tion  we  stratify  on  the  variable  CLINIC  and  use  the  formula 
Y~  addicts$clinic.  A  formula  needs  to  be  supplied  as  the 
argument  of  the  survfit  function  (shown  below): 

kmfitl  =survfit(Y~  1 ) 

An  object,  which  we  named  kmfitl,  was  created  with  the 
survfit  function.  Enter  the  code  kmfitl  and  press  enter 
(output  shown  below): 

kmfitl 

records  n.max  restart  events  median  0.95LCL  0.95UCL 

238  238  238  150  504  399  560 
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The  output  contains  descriptive  information  on  the  num¬ 
ber  of  records,  the  number  at  risk  at  time  0,  the  number 
of  events,  and  the  median  estimated  survival  time  with 
a  95%  confidence  internal.  The  summary  function  can 
then  be  used  to  get  Kaplan-Meier  survival  estimates 
for  all  event  times.  The  code  summary(kmfitl)  is  equiva¬ 
lent  to  the  code  summary(survfit(Surv(addicts$survt, 
addicts$status==  1 )~  1 ))  shown  above.  The  output  follows: 


time  n.risk  n. event  survival  std.err  lower  95%  Cl  upper  95%  Cl 


7 

236 

1 

0.996 

0.00423 

0.9875 

1.000 

13 

235 

1 

0.992 

0.00597 

0.9799 

1.000 

17 

234 

1 

0.987 

0.00729 

0.9731 

1.000 

19 

233 

1 

0.983 

0.00840 

0.9667 

1.000 

26 

232 

1 

0.979 

0.00937 

0.9606 

0.997 

29 

229 

1 

0.975 

0.01026 

0.9546 

0.995 

30 

228 

1 

0.970 

0.01107 

0.9488 

0 . 992 

821 

20 

2 

0.225 

0.03675 

0.1635 

0.310 

836 

17 

1 

0.212 

0.03690 

0.1506 

0.298 

837 

16 

1 

0.199 

0.03689 

0.1380 

0.286 

857 

14 

1 

0.184 

0.03688 

0.1246 

0.273 

878 

13 

1 

0.170 

0.03667 

0.1116 

0.260 

892 

10 

1 

0.153 

0.03675 

0.0958 

0.245 

899 

9 

1 

0.136 

0.03639 

0.0807 

0.230 

The  summary  function  can  also  produce  survival  esti¬ 
mates  for  specified  survival  times  (e.g.,  at  day  365)  with 
the  times=  option.  Code  and  output  follow: 


summaryfkmfit  1  ,times= 365 ) 

time  n.risk  n. event  survival  std.err  lower  95%  Cl  upper  95%  Cl 
365  122  87  0.606  0.0331  0.545  0.675 

If  we  wish  to  stratify  by  the  variable  CLINIC  and  compare 
the  Kaplan-Meier  survival  estimates  at  specified  times,  we 
can  first  create  an  object  (called  kmfit2,  where  the  name  is 
arbitrary)  from  the  survfit  function: 


kmfit2 = survfit(  Y^addicts$  clinic) 


To  get  survival  estimates  at  specified  times  (every  100  days) 
for  each  level  of  CLINIC,  enter  the  code: 


summary(kmfit2,times=c(0, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000)) 
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The  output  follows: 


addict s$clinic=l 


time  n, 

.risk  n , 

.  event 

survival 

std .err 

lower  95%  Cl 

upper 

95%  Cl 

0 

163 

0 

1.0000 

0.0000 

1 . 00000 

1.000 

100 

137 

20 

0.8746 

0.0262 

0.82467 

0.928 

200 

110 

20 

0.7420 

0.0353 

0.67601 

0.814 

300 

87 

20 

0.6046 

0.0399 

0.53120 

0.688 

400 

68 

14 

0.5025 

0.0415 

0.42741 

0.591 

500 

53 

9 

0.4319 

0.0418 

0.35719 

0.522 

600 

30 

16 

0.2951 

0.0403 

0.22570 

0.386 

700 

20 

8 

0.2113 

0.0383 

0.14818 

0.301 

800 

10 

8 

0.1268 

0.0326 

0.07660 

0.210 

900 

1 

7 

0.0181 

0.0172 

0.00283 

0.116 

addict s$ clinic 

=2 

time  n. 

.risk  n , 

.  event 

survival 

std .err 

lower  95%  Cl 

upper 

95%  Cl 

0 

75 

0 

1.000 

0.0000 

1.000 

1.000 

100 

66 

5 

0.932 

0.0294 

0.876 

0 . 991 

200 

58 

7 

0.832 

0.0442 

0.750 

0.924 

300 

50 

7 

0.730 

0.0530 

0.633 

0.842 

400 

43 

3 

0.685 

0.0558 

0.584 

0.804 

500 

39 

2 

0.653 

0.0577 

0.549 

0.776 

600 

27 

1 

0.634 

0.0590 

0.528 

0.761 

700 

19 

1 

0.606 

0.0625 

0.495 

0.742 

800 

11 

1 

0.575 

0.0669 

0.457 

0.722 

900 

7 

1 

0.517 

0.0812 

0.380 

0.703 

1000 

3 

0 

0.517 

0.0812 

0.380 

0.703 

Survival  estimates  are  supplied  for  each  100th  day.  For 
CLINIC=1,  survival  times  stopped  at  900  rather  than  1000 
as  requested  because  no  subject  was  at  risk  on  day  1000. 
The  second  argument  of  the  summary  function  requesting 
a  vector  of  survival  times  could  have  been  equivalently 
written:  summary(kmfit2,times=100*(0:10)).  The  output 
would  be  identical  if  this  alternative  syntax  had  been  used. 


KM  survival  plots  can  be  obtained  using  the  plot  function: 
plot(kmfit2) 


There  are  many  plotting  options  that  can  be  applied  with 
the  plot  function.  The  code  below  requests  different  line 
types  (lty=)  and  different  colors  (col=)  for  CLINIC=1  and 
CLINIC=2  as  well  as  labels  for  the  X  and  Y  axes  (xlab=  and 
ylab=).  If  the  code  col( )  is  submitted,  then  R  returns  a  list 
of  over  600  colors  that  can  be  selected  with  the  col=  option. 
The  legend  function  is  used  to  add  a  legend.  The  first 
argument,  “topright,”  places  the  legend  at  the  top  right 
part  of  the  graph.  The  code  and  output  follow: 
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plot(kmfit2,  lty  =  c(“solid”,  “dashed”),  col=c(“black”,“grey”), 
xlab= “survival  time  in  days”,ylab=“survival  probabilities”) 

legendf “topright” ,  c(“Clinic  1”, “Clinic  2”),  lty=c(“solid”,“dashed”), 
col= c(  “black”,  “grey” ) ) 


The  plot  indicates  that  subjects  from  CLINIC=2  have  a 
higher  rate  of  survival  than  subjects  from  CLINIC=1. 

The  survdiff  function  can  be  used  to  implement  a  log  rank 
test  on  the  variable  CLINIC  (the  code  follows): 

survdiff(Surv(survt,status)~clinic,  data=addicts) 

The  second  argument  of  the  survdiff  function,  data= 
addicts,  indicates  that  the  variables  come  from  the  addicts 
dataset.  Alternatively,  you  could  use  the  code: 

survdiff(Surv(addicts$survt,addicts$status)~addicts$clinic) 

As  a  third  alternative,  the  attach  function  can  be  used  to 
indicate  that  all  subsequent  variable  names  apply  to  the 
addicts  dataset  (R  will  search  the  addicts  dataset  for  vari¬ 
ables).  The  detach  function  can  be  used  to  remove  a  data¬ 
set  from  the  search  path. 

attach(addicts) 

survdiff(Surv(survt,status)~clinic) 
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The  output  follows: 


N  Observed  Expected  (O-E) A2/E  (O-E) A2/V 

clinic=l  163  122  90.9  10.6  27.9 

clinic=2  75  28  59.1  16.4  27.9 

Chisq=  27.9  on  1  degrees  of  freedom,  p=  1.28e-07 

The  log  rank  statistic  is  highly  significant  with  a  p-value  of 
0.000000128  (i.e.,  1.28e-07). 

Variations  of  the  log  rank  test  can  be  obtained  by  using  the 
rho=  option  as  an  argument  in  the  survdiff  function.  The 
contribution  of  the  jth  failure  time  to  the  test  statistic  is 
weighted  by  s(tj)rho,  where  s(tj)  represents  the  KM  survival 
estimates  at  time  tj.  If  rho=0,  then  each  failure  time  is 
equally  weighted  since  s(tj)°  =  1  and  the  resulting  test  is 
the  log  rank  test.  If  rho=  1 ,  then  the  weights  for  each  failure 
time  are  the  KM  survival  estimate  at  that  failure  time  since 
s(tj)1  =  s(tj).  This  test  is  equivalent  to  the  Peto  &  Peto 
modification  of  the  Gehan-Wilcoxon  test.  The  code  and 
output  with  rho=l  follows: 

survdiff(Surv(survt,status)~clinic,data=addicts,rho=  1 ) 


N  Observed  Expected  (O-E) A2/E  (O-E) A2/V 

clinlc=l  163  77.3  61.4  4.08  15.8 

clinic=2  75  19.9  35.7  7.03  15.8 

Chisq=  15.8  on  1  degrees  of  freedom,  p=  7.18e-05 

The  results  of  the  test  in  which  rho=l  yield  a  chi-square 
value  of  15.8  with  a  p-value  of  0.0000718.  This  is  a  some¬ 
what  different  result  than  the  log  rank  test  but  still  shows  a 
highly  significant  effect  of  CLINIC  on  survival. 

A  stratified  log  rank  test  for  CLINIC  (stratified  by  PRISON) 
can  be  ran  with  the  +  strata(prison)  term  included  in  the 
model  formula.  With  the  stratified  approach,  the  observed 
minus  expected  number  of  events  are  summed  over  all 
failure  times  for  each  group  within  each  stratum  and 
then  summed  over  all  strata.  The  code  and  output  follow: 

survdiff(Surv(survt, status)  ~  clinic  +  strata(prison),data=addicts) 


N  Observed  Expected  (0-E)A2/E  (O-E) A2/V 
clinic=l  163  122  91.7  10.0  26.9 
clinic=2  75  28  58.3  15.8  26.9 


Chisq=  26.9  on  1  degrees  of  freedom,  p=  2 . le-07 


Software:  D.  R  Software  631 


The  formula  includes  the  term  +  strata(prison)  in  the 
survdiff  function.  The  result  of  this  test  is  very  similar  to 
that  obtained  from  the  log  rank  test  without  stratifying  on 
PRISON. 


2.  ASSESSING  THE  PH  ASSUMPTION  USING 
GRAPHICAL  APPROACHES 

The  proportional  hazards  assumption  for  CLINIC  can  be 
assessed  by  plotting  log-log  Kaplan  Meier  survival  esti¬ 
mates  against  time  (or  against  the  log  of  time)  and  evalu¬ 
ating  whether  the  curves  are  reasonably  parallel.  Recall 
that  a  survival  object,  called  kmfit2,  was  created  in  the 
previous  section  with  the  survfit.  The  code  plot(survfit2) 
was  used  to  plot  the  survival  estimates  against  time.  The 
fun=“cloglog”  option  in  the  plot  function  requests  that 
log-log  survival  plot  be  plotted  against  time  (on  the  log 
scale).  The  code  follows: 

plot(kmfit2,fun=“cloglog”,xlab=“time  in  days  using  logarithmic 

scale”, ylab= “log-log  survival”,  main= “log-log  curves  by  clinic”) 

The  xlab=  and  ylab=  request  labels  for  the  x-  andy-axes  and 
the  main=  option  requests  a  title.  fun=“cloglog”  requests 
the  complimentary  log  log  function.  The  output  follows: 


log-log  curves  by  clinic 
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The  plot  suggests  that  the  proportional  hazards  assumption 
is  violated  as  the  log-log  survival  curves  are  not  parallel.  The 
fun=  option  (fun  denotes  function)  plots  time  on  a  logarith¬ 
mic  scale.  It  is  not  so  straightforward  if  you  want  the  log-log 
survival  estimates  plotted  against  time  with  time  not  on  a 
logarithmic  scale.  However,  it  may  be  useful  to  program  this 
task  in  order  to  illustrate  how  analytic  output  can  be  saved, 
manipulated,  and  plotted.  To  do  that,  we  first  save  the 
survival  estimates  as  an  object  (which  we  will  call  kmfit3) 
using  the  summary  function: 

kmfit3 = summary  (kmfit2) 

If  we  submit  the  code  names(kmfit3),  then  the  column 
names  of  the  object  kmfit3  are  printed.  We  are  interested 
in  the  columns  that  indicate  each  subject’s  survival  time, 
KM  survival  estimate,  and  level  of  clinic  (1  or  2).  With  the 
names  function,  we  can  see  that  these  columns  are  called 
time,  surv,  and  strata.  We  can  examine  any  of  these  three 
columns  by  submitting  kmfit3$strata,  kmfit3$time,  or 
kmfit3$surv  as  code.  A  dataframe  (called  kmfit4)  consist¬ 
ing  of  these  three  columns  as  variables  can  be  created  with 
the  data.frame  function: 

kmfit4=data.frame(kmfit3$strata,kmfit3$time,kmfit3$surv) 
names(kmfit4)=c(“clinic”, “time”, “survival”) 

The  names  function  is  used  (above)  on  kmfit4  to  overwrite 
the  default  variable  names.  Next,  well  print  the  first  5 
observations  of  kmfit4: 

kmfit4[l:5,  ] 

clinic  time  survival 

1  addicts$clinic=l  7  0.9938272 

2  addicts$clinic=l  17  0.9876543 

3  addicts$clinic=l  19  0.9814815 

4  addicts$clinic=l  29  0.9752300 

5  addicts$clinic=l  30  0.9689785 

We  are  interested  in  separating  out  CLINIC=1  and 
CLINIC=2.  Below,  we  create  two  dataframes  (clinic  1  and 
clinic2)  from  kmfit4: 

clinic  1  =kmf it4 [kmf it4 $clinic==“addicts$ clinic  =  1  ” ,  ] 

clinic2=kmfit4[kmfit4$clinic==“addicts$clinic=2”,  ] 

The  dataframes  clinic  1  and  clinic2  contain  the  survival 
times  and  survival  estimates  for  those  in  CLINIC=1  and 
CLINIC=2,  respectively.  We  can  now  use  the  plot  function 
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to  plot  the  log-log  survival  curves  against  time  (with  time 
not  plotted  on  the  log  scale).  The  code  follows: 

plot(clinicl$time,log(-log(clinicl$survival)),xlab=“survival  time  in  days”,ylab= 
“log-log  survival”  ,xlim=c(0, 800), col=“black”,type=T,lty=“solid”,main=“log-log 
curves  by  clinic”) 

par(new=T) 

plot(clinic2$time,log(-log(clinic2$survival)),axes=F,xlab=“survival  time  in 
days”,ylab=“log-log  survival”, col=“grey50”,type=T,lty=  “dashed”) 

legend(“bottomright”,  c(“Clinic  1”,  “Clinic  2”),  lty  =  c(“solid”,  “dashed”), col=c 
(“black”, “grey50”)) 

par(new=F) 


In  the  first  plot,  time  (clinic  1  $  time)  is  plotted  on  the  x  axis, 
and  the  log(-log)  of  survival  (clincl$survival)  is  plotted  on 
the  y  axis  using  the  dataframe  clinic  1.  The  code  par 
(new=T)  requests  that  the  first  plot  not  get  erased  when 
the  second  plot  is  requested  (i.e.,  the  two  plots  will  be  over- 
layed).  The  par  function  is  used  to  set  or  query  graphical 
parameters.  The  second  plot  function  is  similar  to  the  first 
except  that  the  data  that  is  plotted  are  from  the  dataframe 
clinic2.  A  legend  is  added  with  the  legend  function  and 
finally  par(new=F)  sets  the  graphical  parameter  new  back 
to  its  default  value  of  false  (so  that  these  plots  will  be  erased 
when  the  next  plot  is  requested).  The  output  follows: 


log-log  curves  by  clinic 


The  plot  suggests  that  the  proportional  hazards  assump¬ 
tion  is  violated  for  CLINIC. 
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3.  RUNNING  A  COX  PH  MODEL 

The  coxph  function  is  used  to  run  a  Cox  proportional 
hazards  model.  First,  the  response  variable  is  created 
with  the  Surv  function  and  then  a  Cox  PH  model  contain¬ 
ing  the  variables  CLINIC,  PRISON,  and  DOSE  is  run  with 
the  coxph  function.  The  code  and  the  coxph  output  follow: 

Y = Surv(  addicts  $  survt,  addicts  $  status = =  1 ) 
coxph(Y~  prison  +  dose  +  clinic, data=addicts) 

coef  exp(coef)  se(coef)  z  p 

prison  0.3266  1.386  0.16722  1.95  5.1e-02 

dose  -0.0354  0.965  0.00638  -5.54  2.9e-08 

clinic  -1.0099  0.364  0.21489  -4.70  2.6e-06 

Likelihood  ratio  test=64 . 6  on  3  df,  p=6.23e-14  n=  238 

The  output  contains  the  regression  coefficients,  the  expo¬ 
nentiated  coefficients  (estimated  hazard  ratios),  as  well  as 
the  standard  errors,  z-tests,  and  corresponding  p-values  for 
the  coefficients.  Additional  output  including  95%  confi¬ 
dence  intervals  can  be  obtained  by  applying  the  summary 
function  to  the  coxph  function  (code  and  output  shown 
below): 

summary  (coxph  (Y~  prison  +  dose  +  clinic, data=addicts)) 


coef  exp (coef) 
prison  0.326555  1.386184 
dose  -0.035369  0.965249 
clinic  -1.009896  0.364257 


se (coef)  z  Pr (> [ z  1  ) 

0.167225  1.953  0.0508  . 

0.006379  -5.545  2.94e-08  *** 
0.214889  -4.700  2.61e-06  *** 


Signif.  codes:  0  '***'  0.001  '**'  0.01  0.05  0.1 


exp (coef)  exp (-coef)  lower  .95  upper  .95 
prison  1.3862  0.7214  0.9988  1.9238 
dose  0.9652  1.0360  0.9533  0.9774 
clinic  0.3643  2.7453  0.2391  0.5550 


Rsquare=  0.238  (max  possible^  0.997  ) 
Likelihood  ratio  test=  64.56  on  3  df, 
Wald  test  =54.12  on  3  df. 

Score  (logrank)  test  =  56.32  on  3  df, 


p=6 . 228e-14 
p=l . 056e-ll 
p=3 . 598e-12 


1 


The  second  table  of  the  output  gives  the  estimated  hazard 
ratio,  under  the  column  exp(coef),  for  CLINIC=2  vs 
CLINIC=  1  at  0.3643  with  95%  Cl  (0.2391,  0.5550).  Under 
the  column  exp(-coef),  we  see  that  the  estimated  hazard 
ratio  for  CLINIC=1  vs  CLINIC=2  is  2.7453  (the  reciprocal 
of  0.3643). 
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If  any  of  the  events  in  the  data  occur  at  the  same  time,  there 
are  several  options  for  handling  ties  in  the  Cox  likelihood. 
R  offers  three  approaches  with  the  coxph  function  1)  the 
Efron  method  (the  default),  2)  the  Breslow  method,  and  3) 
the  exact  method.  Generally,  these  methods  have  little 
impact  on  the  estimates  but  model  results  obtained  from 
different  software  packages  may  differ  depending  on  the 
default  tie  handling  method.  R  uses  the  Efron  method  as 
the  default  while  Stata,  SAS  and  SPSS  use  the  Breslow 
method  as  the  default.  The  method=  option  in  the  coxph 
function  is  used  to  specify  the  method  for  handling  ties 
(code  shown  below,  output  omitted): 

coxph(Y~  prison  +  dose  +  clinic, data=addicts,  method=”efron”) 
coxph(Y~  prison  +  dose  +  clinic, data=addicts,  method=”breslow”) 
coxph(Y~  prison  +  dose  +  clinic, data=addicts,  method=”exact”) 

Next  we  include  two  interaction  (product)  terms  with 
PRISON  and  test  the  significance  of  the  interaction  terms 
simultaneously  with  a  likelihood  ratio  test.  The  following 
code  creates  two  objects  (called  modi  and  mod2)  that 
contain  information  obtained  from  the  coxph  function 
for  the  no  interaction  model  (modi  -  the  reduced  model) 
and  the  model  with  the  two  interaction  terms  (mod2  -  the 
full  model) 

modl=coxph(Y  ~  prison  +  dose  +  clinic,data=addicts) 
mod2=coxph(Y  ~  prison  +  dose  +  clinic  +  clinic*prison 
+  clinic*dose,  data=addicts) 


Enter  the  code  mod2  to  see  the  output  with  the  interaction 
terms  (code  and  output  shown  below): 


mod2 

coef 

exp (coef) 

se (coef) 

z 

P 

prison 

1.1920 

3.294 

0 .5414 

2.202 

0.028 

dose 

-0.0193 

0.981 

0.0194 

-0.998 

0.320 

clinic 

0 . 1747 

1.191 

0.8931 

0 .196 

0.840 

prison : clinic 

-0.7380 

0.478 

0.4315 

-1.710 

0.087 

dose : clinic 

-0.0139 

0.986 

0.0143 

-0.967 

0.330 

Likelihood  ratio  test=68.1  on  5  df,  p=2.52e-13  n=  238 

The  rest  of  this  section  gets  a  little  complicated  but  we 
include  it  to  demonstrate  how  analytic  output  in  R  can  be 
accessed  and  then  manipulated. 
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The  objects  modi  and  mod2  contain  information  that  we 
may  wish  to  utilize.  Type  the  code  names(mod2)  to  see  the 
names  of  the  elements  in  mod2  (code  and  output  shown 
below): 


names(mod2) 


[1] 

"coefficients" 

"var" 

"loglik" 

[4] 

"score" 

"iter" 

"linear . predictors 

[7] 

"residuals" 

"means" 

"method" 

[10] 

"n" 

"terms" 

"assign" 

[13] 

[16] 

"wald. test" 

"call" 

"y" 

"formula" 

The  3rd  element  of  mod2  is  named  “loglik.”  We  can  access 
the  data  stored  under  this  name  by  entering  the  code  mod2 
$loglik  or  equivalently  mod2[[3]]  since  loglik  is  the  3rd 
element  in  the  list  (code  and  output  follow): 

mod2$loglik 

-705.6619  -671.5997 

The  second  element  of  mod2$loglik  is  —671.5997,  which 
is  the  log  likelihood  of  the  two  interaction  terms  in  the 
model.  The  first  element  of  -704.6619  is  the  log  likelihood 
of  a  model  that  contains  none  of  the  predictors  (not  of 
interest  right  now). 

Next  we  wish  to  perform  a  likelihood  ratio  test  on  the  two 
interaction  terms.  To  calculate  the  test  statistic,  we  need  to 
subtract  the  log-likelihood  of  the  full  model  (with  the  inter¬ 
action  terms)  from  the  reduced  model  (without  the  inter¬ 
action  terms)  and  multiply  that  difference  by  negative  2. 
We  can  obtain  this  by  entering  the  following  code: 

(-2)*(modl$loglik[2]-mod2$loglik[2]) 

We  get  the  output:  3.605457,  which  is  the  likelihood  ratio 
test  statistic.  Under  the  null,  this  test  statistic  follows  a  chi- 
square  distribution  with  two  degrees  of  freedom.  We  can 
use  the  pchisq  function  to  obtain  a  p-value  for  this  test. 
The  code  1  x  pchisq(3. 605457, 2)  returns  the  p-value  for  a 
two  degree  of  freedom  chi-square  test.  In  summary,  the 
following  code  will  produce  a  p-value  for  the  likelihood 
ratio  test  (output  follows): 

LRT=(-2)*(modl$loglik[2]-mod2$loglik[2]) 
Pvalue  =  1  -  pchisq(LRT,  2) 

Pvalue 

0.1648485 
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Thep-value  of  0.168485  is  not  significant  at  the  0.05  level  of 
significance. 

One  of  the  powerful  features  of  R  is  the  ability  for  users  to 
define  their  own  functions.  We  illustrate  this  feature  by 
defining  our  own  function  that  performs  a  likelihood 
ratio  test  from  two  Cox  models  (a  full  and  reduced 
model).  The  following  code  creates  a  function  which  we 
call  lrt.surv.  This  function  requests  the  user  to  enter  three 
arguments  (1)  the  name  of  the  full  model,  (2)  the  name  of 
the  reduced  model,  and  (3)  the  degrees  of  freedom  for  the 
test.  The  function  will  return  the  p-value  for  the  likelihood 
ratio  test. 

An  R  function  called  function  is  used  to  define  a  new 
function.  The  three  arguments  for  this  function  we  call 
mod.full,  mod.reduced,  and  df.  The  code  that  R  will  use 
to  calculate  the  function  output  is  contained  within  brack¬ 
ets  { }  after  the  arguments  are  listed.  The  argument  in  the  R 
function  called  return  informs  R  of  the  output  that  we 
wish  to  return  from  this  function  (in  this  example,  the  p- 
value  for  the  likelihood  ratio  test).  The  code  follows: 

lrt.surv=function(mod.fu!l, mod.reduced, df)  { 
lrts=(-2)*(mod.full$loglik[2]-  mod.reduced$loglik[2]) 
pvalue=  1  -pchisq(lrts,df ) 
return(pvalue) 

) 

Once  this  code  is  submitted  any  user  can  obtain  a  p-value 
from  a  likelihood  ratio  test  from  two  Cox  models  by  invok¬ 
ing  the  function  lrt.surv.  We  invoke  this  new  function  by 
performing  the  same  likelihood  ratio  test  that  we  previ¬ 
ously  ran  for  the  objects  modi  and  mod2.  The  code  and 
output  follow: 

lrt.surv(modl,  mod2,  2) 

0.1648485 

The  p-value  is  the  same  as  that  which  we  obtained  earlier. 
The  function  lrt.surv  is  more  general  and  now  available  to 
simply  obtain  p-values  for  other  likelihood  ratio  tests  that 
compare  two  (full  and  reduced)  Cox  models. 
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4.  RUNNING  A  STRATIFIED  COX  MODEL 

If  the  proportional  hazards  assumption  is  violated  for  the 
variable  CLINIC  but  met  for  PRISON  and  DOSE,  a  strati¬ 
fied  Cox  model  can  be  performed  with  CLINIC  the  strati¬ 
fied  variable.  The  coxph  function  includes  a  strata() 
option  in  the  model  formula.  First  we  define  the  response 
variable  Y  with  the  Surv  function  and  then  the  coxph 
function  is  used  to  run  a  stratified  Cox  model  (code  and 
output  shown  below): 

Y=Surv(addicts$survt,addicts$status==  1 ) 

coxph (Y-  prison  +  dose  +  strata(clinic),data=addicts) 

coef  exp(coef)  se(coef)  z  p 

prison  0.3896  1.476  0.16893  2.31  2.1e-02 

dose  -0.0351  0.965  0.00646  -5.43  5.6e-08 

Likelihood  ratio  test=33.9  on  2  df,  p=4.32e-08  n=  238 

Interaction  terms  for  CLINIC  can  be  included  directly  in 
the  model  formula  by  including  product  terms  using  the  : 
operator  (clinicrprison  and  clinic:dose)  (code  and  output 
follow): 

coxph(Y  prison  +  dose  +  clinic:prison  +  clinic:dose  + 
strata(clinic),data=addicts) 


prison 

dose 

prison : clinic 
dose : clinic 


coef  exp 
1.08584 
-0.03464 
-0.58299 
-0.00116 


(coef)  se (coef) 
2.962  0.5386 
0.966  0.0198 
0.558  0.4281 
0.999  0.0146 


z  p 

2.0159  0.044 
-1.7495  0.080 
-1.3617  0.170 
-0.0799  0.940 


Likelihood  ratio  test=35.8  on  4  df,  p=3.22e-07  n=  238 

Suppose  we  wish  to  estimate  the  hazard  ratio  for 
PRISON=l  vs.  PRISON=0  for  CLINIC=2.  This  hazard 
ratio  can  be  estimated  by  exponentiating  the  coefficient 
for  prison  plus  2  times  the  coefficient  for  the  CLINIC* 
PRISON  interaction  term.  This  expression  is  obtained  by 
substituting  the  appropriate  values  into  the  hazard  in  both 
the  numerator  (for  PRISON=l)  and  denominator  (for 
PRISON=0)  (see  below): 

h0(t)  expjd/4  +  jhDOSE  +  (2)(l)/?3  +  ^CLINIC  x  DOSE]  = 

h0(t)  exp[10  +  p2DOSE  +  (2)(0)/?3  +  /?4 CLINIC  x  DOSE]  PlPl  ~P2>' 
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The  resulting  hazard  ratio,  expffh  +  2p2),  is  an  exponen¬ 
tiated  linear  combination  of  parameters.  Unfortunately, 
R  does  not  have  a  lincom  command  that  Stata  provides 
or  an  estimate  statement  that  SAS  provides  in  order  to 
calculate  a  linear  combination  of  parameter  estimates. 
However  an  approach  that  can  be  used  in  any  statistical 
software  package  for  such  a  situation  is  to  recode  the 
variable(s)  of  interest  such  that  the  desired  estimate  is  no 
longer  a  linear  combination  of  parameter  estimates. 

In  this  example,  we  are  interested  in  a  hazard  ratio 
PRISON=l  versus  PRISON=0  for  CLINIC=2.  We  can 
define  a  new  variable  CLINIC  x  2  so  when  CLINIC=2, 
CLINIC  x  2=0. 

Addict  s  $  clinic2 = addict  s  $  clinic-  2 

summary ( coxph( Y ~  prison+dose+clinic2  :prison+ 

clinic2:dose+strata(clinic2),data=addicts)) 

The  first  line  of  code  defines  a  new  variable  CLINIC2. 
CLINIC2  is  used  in  the  stratified  Cox  model  rather  than 
CLINIC.  We  are  interested  in  the  hazard  ratio  for 
PRISON=l  vs  PRISON=0  for  CLINIC2=0.  When 
CLINIC2=0,  the  product  terms  cancel  and  the  hazard 
ratio  reduces  to  e x p ( [I , ) . 

The  second  line  of  code  applies  the  summary  function  to 
the  coxph  function.  The  summary  function  applied  in  this 
way  produces  additional  output  including  95%  confidence 
intervals  for  the  hazard  ratios.  The  output  follows: 


n=  238 


coef 

exp (coef) 

se  (coef) 

prison 

-0.080143 

0.922985 

0.384305 

dose 

-0.036964 

0.963711 

0.012346 

prison :clinic2 

-0.582989 

0.558227 

0.428135 

dose : clinic2 

-0.001164 

0.998837 

0 .014570 

Signif.  codes: 

0  '***' 

0.001  '**' 

0.01  '*'  0 

exp (coef) 

exp ( -coef ) 

lower  .95 

prison 

0.9230 

1.083 

0.4346 

dose 

0.9637 

1.038 

0.9407 

prison : clinic2 

0.5582 

1.791 

0.2412 

dose : clinic2 

0.9988 

1.001 

0.9707 

-2.994 

-1.362 


0.05 


Pr (>  |  z  |  ) 
0.83481 
0.00275  ** 
0.17329 
0.93632 

0.1  '  '  1 


1 . 9603 
0.9873 
1.2919 
1.0278 


Rsquare=  0.14  (max  possible=  0.994  ) 
Likelihood  ratio  test=  35.77  on  4  df, 
Wald  test  =34.09  on  4  df. 
Score  (logrank)  test  =34.97  on  4  df. 


p=3 . 222e-07 
p=7 . 138e-07 
p=4 . 706e-07 
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The  estimate  for  exp(Pi)  can  be  found  in  the  second  table, 
exp(coef)  for  prison  =  0.9203.  The  lower  and  upper  confi¬ 
dence  limits  are  0.4346  and  1.9603,  respectively.  If  we  did 
not  recode  the  variable  CLINIC  the  problem  would  have 
been  more  complicated  in  that  we  would  have  had  to  use 
variance-covariance  matrix  (which  can  be  obtained  with 
the  vcov  function)  to  calculate  a  95%  confidence  interval 
for  this  hazard  ratio. 


5.  ASSESSING  THE  PH  ASSUMPTION 
WITH  A  STATISTICAL  TEST 

The  cox.zph  function  is  designed  to  perform  a  statistical 
test  on  the  proportional  hazards  assumption.  This  statisti¬ 
cal  test  is  a  test  of  correlation  between  the  Schoenfeld 
residuals  and  survival  time  (or  ranked  survival  time). 
A  correlation  of  zero  supports  the  proportional  hazards 
assumption  (the  null  hypothesis).  First,  we  define  the 
response  variable  Y  with  the  Surv  function  and  then  the 
coxph  function  is  used  to  run  a  Cox  proportional  hazards 
model  with  the  variables  PRISON,  DOSE,  and  CLINIC: 

Y=Surv(addicts$survt,addicts$status==  1 ) 

mod  1  =coxph(Y~prison  +  dose  +  clinic,  data=addicts) 

The  object  called  modi  is  created  from  the  coxph  func¬ 
tion.  This  object  is  the  first  argument  for  the  cox.zph 
function.  The  code  to  run  the  test  of  the  proportional 
hazards  assumption  follows: 

cox.zph(modl,transform=rank) 

The  second  argument  requests  that  ranked  survival  times 
be  tested  against  the  Schoenfeld  residuals  rather  than  the 
actual  survival  times  (the  default).  The  output  follows: 


prison 

dose 

clinic 

GLOBAL 


rho  chisq  p 

-0.0462  0.322  0.57068 

0.0905  1.096  0.29521 

-0.2498  10.495  0.00120 
NA  12.425  0.00606 


The  output  shows  that  the  correation  between  the  Schoen¬ 
feld  residuals  for  the  variable  CLINIC  (3rd  row)  and  ranked 
survival  time  is  -0.2498  with  a  p-value  of  0.00120.  The 
significant  p-value  offers  evidence  that  the  proportional 
hazards  assumption  is  not  satisfied  for  the  variable 
CLINIC.  The  p-values  for  PRISON  and  DOSE  are  not  sig¬ 
nificant  suggesting  that  there  is  not  enough  evidence  to 
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reject  the  proportional  hazards  assumption  for  PRISON 
and  DOSE.  The  global  test  (4th  row)  tests  the  proportional 
hazards  assumption  for  the  entire  model  (i.e.,  for  all  three 
predictor  variables  simultaneously)  and  is  significant  with 
p  =  0.00606.  The  global  test  offers  evidence  that  the  pro¬ 
portional  hazards  assumption  is  violated  for  this  model. 

We  can  plot  the  Schoenfeld  residuals  against  each  indivi¬ 
dual’s  failure  time  with  the  plot  function  and  an  object 
created  from  the  cox.zph  function  as  the  first  argument. 
The  argument  var=clinic,  specifies  that  the  residuals 
should  pertain  the  variable  CLINIC.  The  argument  se=F, 
suppresses  the  printing  of  confidence  limits  for  the  fitted 
curve.  The  code  and  output  follow: 

plot(cox.zph(modl,transform=rank),se=F,var= ‘clinic’) 


If  the  PH  assumption  is  met  then  the  fitted  curve  should 
look  horizontal  because  the  Schoenfeld  residuals  would  be 
independent  of  survival  time.  However,  the  fitted  curve 
slopes  downward. 


6.  OBTAINING  COX-ADJUSTED  SURVIVAL  CURVES 

Cox  adjusted  survival  estimates  and  plots  can  be  obtained 
by  applying  the  summary  or  plot  function  to  an  object 
created  from  the  function  survfit.  The  first  step  is  to  run 
the  Cox  model  with  the  coxph  function: 

Y=Surv(addicts$survt,addicts$status==  1 ) 
modl=coxph(Y  ~  prison  +  dose  +  clinic,  data=addicts) 

Adjusted  survival  curves  generally  depend  on  the  pattern  of 
covariates.  Suppose  we  are  interested  in  plotting  the  sur¬ 
vival  curve  for  the  pattern  PRISON=0,  DOSE=70,  and 
CLINIC=2.  First,  we  need  to  create  a  dataset  (or  data  frame) 
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with  the  data.frame  function  with  one  observation.  Code 
and  output  follows: 

pattern  l=data.frame(prison=0,dose=70,clinic=2) 

pattern  1 

prison  dose  clinic 
0  70  2 

This  one  observation  dataframe  is  called  patteml.  To 
obtain  Cox  adjusted  survival  estimates  apply  the  survfit 
function  within  the  summary  function  as  shown  below: 

summary  ( survfit(mod  1  ,newdata=pattem  1 ) ) 

The  first  argument  of  the  survfit  function  is  the  object 
called  modi  created  with  coxph  function.  The  second 
argument  supplies  the  dataframe  containing  the  pattern 
of  covariates  of  interest  (called  patteml). 

The  output  follows: 


time  n.risk  n. event  survival  std.err  lower  95%  Cl  upper  95%  Cl 


7 

236 

1 

0. 

.  999 

0. 

.  00105 

0. 

.  997 

1. 

.  000 

13 

235 

1 

0. 

.  998 

0. 

.00154 

0  , 

.  995 

1 . 

.000 

17 

234 

1 

0. 

.997 

0. 

.00193 

0  . 

.  993 

1 . 

.000 

19 

233 

1 

0. 

.  996 

0  . 

.  00229 

0  . 

.  991 

1 . 

.  000 

26 

232 

1 

0. 

.  995 

0. 

.00263 

0  . 

.  990 

1 . 

.  000 

29 

229 

1 

0. 

.  994 

0. 

.00296 

0. 

.  988 

1 . 

.  000 

30 

228 

1 

0. 

.  993 

0. 

.00328 

0. 

.  986 

0  . 

.  999 

33 

227 

1 

0. 

.992 

0. 

.00359 

0  . 

.  985 

0  . 

.  999 

35 

226 

2 

0. 

.  989 

0. 

.00419 

0. 

.981 

0  . 

.  998 

857 

14 

1 

0. 

.549 

0. 

.07953 

0  . 

.414 

0  . 

.730 

878 

13 

1 

0. 

.526 

0. 

.08204 

0  . 

.  387 

0. 

.714 

892 

10 

1 

0. 

.496 

0. 

.08580 

0. 

.  354 

0. 

.697 

899 

9 

1 

0. 

.456 

0. 

.09090 

0  . 

.309 

0. 

.  674 

To  obtain  a  Cox  adjusted  survival  curve  for  the  same  pat¬ 
tern  of  covariates,  apply  the  plot  function  in  the  same 
manner  that  the  summary  function  was  applied  above. 
The  code  follows: 


plot(survfit(modl,newdata=patternl),conf.int=F,main=“Adjusted 
survival  for  prison=0,  dose=70,  clinic=2”) 
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The  conf.int=F  option  suppresses  the  plotting  of  the  con¬ 
fidence  limits.  The  option  conf.int=T  (the  default)  would 
plot  the  95%  confidence  limits.  The  main=  option  requests 
a  title  for  the  plot.  The  output  follows: 


Adjusted  survival  for  prison=0,  dose=70,  clinic=2 


Stratified  Cox  adjusted  survival  curves  can  be  obtained  by 
first  running  a  stratified  Cox  model  (stratified  by  CLINIC): 

mod3=coxph(Y~  prison  +  dose  +  strata(clinic),data=addicts) 

To  obtain  stratified  Cox  adjusted  curves  controlling  for 
PRISON  and  DOSE,  we  create  a  one  observation  data- 
frame  with  the  mean  values  of  0.46  for  PRISON  and  60.4 
for  DOSE: 

pattem2=data.frame(prison=.46,dose=60.40) 

Now  apply  the  plot  function  to  the  survfit  function  as 
shown  in  the  last  example.  The  code  and  output  follow: 

plot(survfit(mod3,newdata=pattem2),  conf.int=F,  lty  =  c(“solid”, 
“dashed”),  col=c(“black”,“grey”),  main=“Survival  curves  for  clinic, 
adjusted  for  prison  and  dose”) 

legend(“topright”,  c(“Clinic  1”, “Clinic  2”),  lty=c(“solid”,“dashed”), 
col= c(  “black”,  “grey” ) ) 
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Survival  curves  for  clinic,  adjusted  for  prison  and  dose 


The  fun=  option  in  the  plot  function  can  be  used  to  plot 
log-log  survival  curves.  The  code  and  output  follow: 

plot(survfit(mod3,newdata=pattem2),fun=“cloglog”,  main= 
“Log-log  curves  for  cinic,  adjusted  for  prison  and  dose”) 


Log-log  curves  for  cinic,  adjusted  for  prison  and  does 


The  fun=  option  plots  time  on  a  logarithmic  scale.  It  is  not 
so  straightforward  if  you  want  the  log-log  plot  against  time 
with  time  not  on  a  logarithmic  scale.  This  was  shown  in 
Sect.  2  for  KM  log  log  curves.  First,  the  adjusted  survival 


log-log  survival 
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estimates  can  be  saved  in  an  object  well  call  sum.mod3 
(shown  below): 

sum.mod3=summary(survfit(mod3,newdata=pattern2)) 

Now,  if  it  desired  to  plot  the  log-log  plot  against  time  with 
time  not  on  a  logarithmic  scale,  similar  code  can  be  used  as 
was  shown  in  Section  2,  except  replace  the  object  we  had 
called  kmfit3  in  Section  2  with  the  object  created  above, 
called  sum.mod3.  The  code  and  plot  follows: 

sum.mod4=data.frame(sum.mod3$strata,sum.mod3 
$time,sum.mod3  $surv) 

colnames(sum.mod4)=c(“clinic”,  “time”,  “survival”) 
clinic  l=sum.mod4[sum.mod4$clinic==“clinic=l”,  ] 
clinic2=sum.mod4[sum.mod4$clinic==“clinic=2”,  ] 

plot(clinic  1  $time,log(-log(clinic  1  $survival)),xlab=“survival 
time  in  days”,ylab=‘log-log  survival”,xlim=c(0,800),col= 
“black”, type=T,lty=“solid”,  main=“log-log  curves  stratified  by 
clinic,  adjusted  for  prison,  dose”) 

par(new=T) 

plot(clinic2$time,log(-log(clinic2$survival)),axes=F,xlab= 
“survival  time  in  days”,ylab=“log-log  survival”,col=“grey50”, 
type= T  ,lty  =  “dashed” ) 

legend(“bottomright”,  c(“Clinic  1”,  “Clinic  2”),  Ity  =  c(“solid”, 
“dashed”), col=c(“black”,“grey50”)) 

par(new=F) 


log-log  curves  stratified  by  clinic,  adjusted  for  prison,  does 
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7.  RUNNING  AN  EXTENDED  COX  MODEL 

In  contrast  to  Stata,  SAS,  and  SPSS,  in  order  to  run  an 
extended  Cox  model  in  R,  the  analytic  dataset  must  be  in 
the  counting  process  (start,  stop)  format.  Unfortunately, 
the  addicts  dataset  is  not  in  that  format,  so  it  needs  to  be 
altered  in  order  to  include  a  time-varying  covariate.  This 
can  be  accomplished  with  the  survSplit  function.  The 
survSplit  function  can  create  a  dataset  that  provides  mul¬ 
tiple  observations  for  the  same  subject  allowing  a  subject’s 
covariate  to  change  values  from  observation  to  observa¬ 
tion.  The  user  supplies  the  time  cutpoint(s). 

The  most  general  choice  for  time  cutpoints  that  can 
accommodate  the  modeling  of  any  time-varying  covariate 
is  a  vector  of  time  cutpoints  that  includes  all  event  times  in 
the  data.  The  variable  SURVT  in  the  addicts  dataset  con¬ 
tains  each  individual’s  time-to-event  or  time-to-censorship. 
The  following  code  creates  a  new  analytic  dataset  (called 
addicts.cp)  which  puts  the  addicts  data  in  the  counting 
process  format  using  the  survSplit  function: 

addicts.cp=survSplit(addicts,cut=addicts$survt[addicts$status==l], 

end=  “survt”,  event= “status”,start= “start”, id=“id” ) 

The  first  argument  of  the  survSplit  function  specifies  the 
dataframe  (addicts)  to  be  manipulated  into  the  counting 
process  format.  The  cut=  addicts$survt[addicts$sta- 
tus==l]  option  specified  that  the  time  cutpoints  are  indi¬ 
cated  by  the  SURVT  variable  subsetted  where  the  STATUS 
variable  equals  1  (i.e.,  keeping  the  event  times  but  omitting 
censorship  times).  The  event=”status”  option  specifies 
STATUS  as  the  variable  indicating  whether  the  individual 
had  an  event  or  was  censored.  The  start=”start”  option 
creates  a  new  variable  called  START.  This  newly  defined 
variable  for  the  starting  times  for  each  observation  is  nec¬ 
essary  for  the  data  to  be  in  counting  process  (start,  stop) 
format.  The  end=”survt”  option  defines  SURVT  as  the 
stop  variable  (i.e.,  the  time-to-event  variable).  The  option 
id=”id”  indicates  that  ID  is  the  variable  that  identifies  each 
individual.  The  survSplit  function  creates  multiple  obser¬ 
vations  for  individuals  at  risk  at  multiple  time  points.  The 
dataset  addicts.cp  created  above  contains  18,708  observa¬ 
tions  from  the  238  observations  in  the  addicts  dataset  (use 
the  nrow  function  and  the  code  nrow( addicts.cp))  to 
return  the  number  of  observations. 

Suppose  the  PH  assumption  was  violated  for  the  variable 
DOSE  and  we  were  interested  in  defining  a  time-varying 
covariate  as  the  product  of  DOSE  and  the  natural  log  of 
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time  (SURVT).  This  variable  can  easily  be  defined  if  the 
dataset  is  in  counting  process  form  with  time  cutpoints  at 
each  event  time  as  shown  below: 

addicts.cp$logtdose=addicts.cp$dose*log(addicts.cp$survt) 

We  now  have  a  new  variable  in  the  dataset  (called  LOGT- 
DOSE=ln(DOSE)*T)  that  varies  over  time.  We  print  the 
dataset  for  one  individual  (id=  1 06)  who  had  an  event  at 
time=35  days.  Rather  than  print  all  the  variables,  we 
request  a  subset  of  them  with  the  c  function: 

addicts.cp[addicts.cp$id==106,c(‘id’, ‘start’, ‘survt’, ‘status’, 
‘dose’,‘logtdose’) 


id 

start 

survt 

status 

dose  logtdos 

106 

0 

7 

0 

40 

77.8364 

106 

7 

13 

0 

40 

102.5980 

106 

13 

17 

0 

40 

113.3285 

106 

17 

19 

0 

40 

117.7776 

106 

19 

26 

0 

40 

130 . 3239 

106 

26 

29 

0 

40 

134.6918 

106 

29 

30 

0 

40 

136.0479 

106 

30 

33 

0 

40 

139.8603 

106 

33 

35 

1 

40 

142.2139 

The  variable  LOGTDOSE  is  time  dependent  as  its  values 
increase  with  time  as  expected.  The  variable  SURVT  lists 
all  the  event  times  in  the  addicts  dataset  up  to  day  35  when 
this  individual  had  an  event.  Notice  STATUS=1  when  the 
event  occurred  and  STATUS=0  prior  to  the  event.  Next  we 
run  an  extended  Cox  model  including  the  predictors 
PRISON,  DOSE,  and  CLINIC  and  the  time-dependent  var¬ 
iable  LOGTDOSE: 

coxph(Surv(addicts.cp$start,addicts.cp$survt,addicts.cp$status)  ~ 

prison  +  dose  +  clinic  +  logtdose  +  cluster(id),data=addicts.cp) 

The  Surv  function  now  takes  three  arguments:  the  start 
variable  (called  START),  the  stop  variable  (called  SURVT), 
and  the  status  variable  (called  STATUS).  The  term  cluster 
(ID)  in  the  model  formula  indicates  that  there  are  multiple 
observations  (clusters)  from  the  same  subject  and  requests 
that  robust  standard  errors  be  produced  for  the  coefficient 
estimates.  These  robust  standard  errors  are  designed  to 
account  for  the  non-independence  of  observations  from 
the  same  subject.  The  model  output  follows: 


648  Computer  Appendix:  Survival  Analysis  on  the  Computer 


se(coef)  robust  se  z  p 

0.16747  0.15972  2.13  3.3e-02 

0.03598  0.02960  -2.79  5.3e-03 

0.21542  0.23637  -4.31  1.6e-05 

0.00645  0.00525  1.64  1.0e-01 


coef  exp(coef) 
prison  0.34063  1.406 
dose  -0.08262  0.921 
clinic  -1.01988  0.361 
logtdose  0.00862  1.009 

Likelihood  ratio  test=66.3 


on  4  df,  p=l . 34e-13  n=  18708 


The  Wald  test  z  statistic  of  1.64  (p  =  1.0e-01  or  p=0.10)  is 
not  significant  for  LOGTDOSE,  providing  no  evidence  that 
the  proportional  hazards  assumption  is  violated  for  DOSE. 


Next  we  run  an  extended  Cox  model  with  heaviside  func¬ 
tions  for  CLINIC  defined  about  the  time  cutpoint  of  365 
days.  We  could  use  the  dataset  that  we  just  created, 
addicts.cp,  but  since  there  is  now  only  one  cutpoint,  we 
illustrate  how  to  create  a  dataset  in  counting  process  for¬ 
mat  with  only  one  cutpoint.  The  new  dataset  (called 
addicts.cp365)  will  have  360  observations  compared  to 
18,708  in  the  dataset  we  previously  had  created  called 
addicts.cp.  The  code  follows: 

addicts.cp365=survSplit(addicts,cut=365,end=“survt”, 

event=“status”,start=“start”,id=“id”) 

The  cut=365  option  in  the  survSplit  function  requests  that 
day  365  be  the  only  cutpoint.  Next  we  create  the  two  time- 
dependent  variables  (HV1  and  HV2).  HV1  is  defined  to 
equal  the  value  of  CLINIC  if  survival  time  is  less  than  365 
days  and  0  otherwise.  HV2  is  defined  to  equal  0  if  survival 
time  is  less  than  365  days  and  equal  the  value  of  CLINIC 
otherwise  (code  follows): 

addicts.cp365$hvl=addicts.cp365$clinic*(addicts.cp365$start<365) 

addicts.cp365$hv2=addicts.cp365$clinic*(addicts.cp365$start>=365) 

The  conditional  statements  in  the  code  (addicts.cp365 
$start<365)  and  (addicts.cp365$start>=365),  take  the 
values  of  1  if  true  and  0  if  false  and  are  then  multiplied  by 
the  variable  CLINIC  to  define  HV1  and  HV2. 


Next  well  sort  the  dataset  by  the  variables  ID  and  START. 
This  is  not  a  necessary  step  but  it  is  easier  to  view  and 
understand  the  data  when  multiple  observations  from  the 
same  subject  are  consecutive.  The  order  function  sorts  the 
dataset: 


addicts.cp365=addicts.cp365[order(addicts.cp365$id,addicts.cp365$start),  ] 

Next  we  print  the  first  10  observations  for  selected  vari¬ 
ables: 
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addicts.cp365[l  :10,c(‘id’ start’, ‘survt’, ‘status’, ‘clinic’, ‘hvl’, ‘hv2’)] 


id 

start 

survt 

status 

clinic 

hvl 

hv2 

1 

0 

365 

0 

1 

1 

0 

1 

365 

428 

1 

1 

0 

1 

10 

0 

365 

0 

1 

1 

0 

10 

365 

393 

1 

1 

0 

1 

100 

0 

146 

0 

2 

2 

0 

101 

0 

365 

0 

2 

2 

0 

101 

365 

450 

1 

2 

0 

2 

102 

0 

365 

0 

2 

2 

0 

102 

365 

555 

0 

2 

0 

2 

103 

0 

365 

0 

2 

2 

0 

Notice  the  sorted  order  of  the  ID  variable  is  1,  10,  and  100 
rather  than  1,2,  and  3.  The  ID  variable  is  a  character  rather 
than  numeric  variable  and  is  sorted  in  "alphabetical” 
rather  than  numerical  order.  The  first  subject  (ID=1)  had 
an  event  at  428  days,  so  was  censored  (STATUS=0)  during 
the  first  time  interval  (0,  365)  but  had  an  event  (STA- 
TUS=1)  during  the  second  interval  (365,  428).  This  subject 
has  the  value  CLINIC=1,  thus  has  the  time-dependent 
values  HV1  =  1  and  HV2=0  over  the  first  interval  and 
HV1=0  and  HV2=1  over  the  second  interval. 

Before  running  an  extended  Cox  model  with  these  heavi- 
side  functions  we  define  an  object  (called  Y365)  for  the 
response  variable  using  the  Surv  function.  This  object  is 
then  used  in  the  coxph  model  formula.  It  is  not  necessary 
to  explicitly  define  this  object  and  we  did  not  do  so  for  the 
previous  extended  Cox  model  that  we  ran  containing 
LOGTDOSE,  but  the  code  is  more  readable  with  the  nota¬ 
tion  for  the  response  variable  simplified.  The  code  follows: 

Y365=Surv(addicts.cp365$start,addicts.cp365$survt, 

addicts.cp365$status) 

Next  we  run  the  model  with  two  heaviside  functions  (code 
and  output  follow): 

coxph(Y365  ~  prison  +  dose  +  hvl  +  hv2  +  cluster(id), 
data=addicts.cp365) 


coef 

exp (coef) 

se  (coef) 

robust  se 

z 

P 

prison 

0.3780 

1.459 

0.16841 

0.16765 

2.25 

hJ 

CD 

1 

O 

to 

dose 

-0.0355 

0.965 

0.00643 

0.00652 

-5.44 

5 . 3e-08 

hvl 

-0.4594 

0.632 

0.25529 

0.25998 

-1.77 

7 . 7e-02 

hv2 

-1.8305 

0.160 

0.38595 

0.39838 

-4.59 

4 . 3e-06 

Likelihood  ratio  test=74.2  on  4  df,  p=2.89e-15  n=  360 
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The  estimated  hazard  ratio  (CLINIC=2  vs.  CLINIC=1)  is 
0.632  for  days  <365  and  0.160  for  days  ^365  (found  in 
the  second  numeric  column  under  exp(coef)).  If  we  wish 
to  match  the  SAS,  Stata,  and  SPSS  output,  we  could  run 
the  model  without  robust  standard  errors  and  use  the 
method=”breslow”  to  handle  simultaneous  events  (ties) 
in  the  Cox  likelihood.  The  code  follows  (output  omitted): 

coxph(Y365  ~  prison  +  dose  +  hvl  +  hv2,data=addicts. 
cp3 65 ,method=  “breslow” ) 

To  run  an  equivalent  model  with  one  heaviside  function, 
we  need  to  include  the  CLINIC  variable  in  the  model  (code 
and  output  shown  below): 


coxph(Y365  ~  prison  +  dose  +  clinic  +  hv2  +  cluster 
(id),data=addicts.cp365) 


coef 

exp (coef) 

se  (coef) 

robust  se 

z 

P 

prison 

0.3780 

1.459 

0 .16841 

0.16765 

2.25 

2 . 4e-02 

dose 

-0.0355 

0.965 

0.00643 

0.00652 

-5.44 

5 . 3e-08 

clinic 

-0.4594 

0.632 

0.25529 

0.25998 

-1.77 

-J 

-J 

CD 

1 

o 

ro 

hv2 

-1.3711 

0.254 

0.46140 

0.47054 

-2.91 

3 . 6e-03 

Likelihood  ratio  test=74.2  on  4  df,  p=2.89e-15  n=  360 

The  coefficient  estimates  are  different  with  this  model 
compared  to  the  model  with  two  heaviside  functions  but 
the  estimated  hazard  ratios  are  the  same.  The  estimated 
hazard  ratio  (CLINIC=2  vs.  CLINIC=1)  is  0.632  for  days 
<365  (exponentiate  the  coefficient  for  CLINIC).  In  order  to 
estimate  the  hazard  ratio  for  days  >  365,  we  need  to  sum 
the  coefficient  estimates  for  CLINIC  and  HV2  and  then 
exponentiate  (exp(-0.4594  +  -1.3711))  =  0.160).  The  signif¬ 
icant  p-value  for  the  estimated  coefficient  for  HV2  of  (p  = 
3.6e-10  or  p  =  0.0036)  suggests  that  the  hazard  ratios  for 
CLINIC  for  the  two  different  time  periods  are  not  equal.  In 
other  words,  the  significant  p-value  provides  evidence  that 
the  proportional  hazard  assumption  is  violated  for 
CLINIC. 
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8.  RUNNING  PARAMETRIC  MODELS 

The  survreg  function  in  R  runs  parametric  accelerated 
failure  time  (AFT)  models.  Whereas  the  key  assumption 
of  a  proportional  hazards  (PH)  model  is  that  hazard  ratios 
are  constant  over  time,  the  key  assumption  for  an  AFT 
model  is  that  survival  time  accelerates  (or  decelerates) 
by  a  constant  factor  when  comparing  different  levels  of 
covariates. 

The  most  common  distribution  for  parametric  modeling  of 
survival  data  is  the  Weibull  distribution.  The  hazard  func¬ 
tion  for  a  Weibull  distribution  is  Xpf~x.  If  p  =  1,  then  the 
Weibull  distribution  is  also  an  exponential  distribution. 
The  Weibull  distribution  has  the  desirable  property  in 
that  if  the  AFT  assumption  holds  then  the  PH  assumption 
also  holds.  The  exponential  distribution  is  a  special  case  of 
the  Weibull  distribution.  The  key  property  for  the  exponen¬ 
tial  distribution  is  that  the  hazard  is  constant  over  time  (di 
(t)  =  X).  In  R,  the  Weibull  and  exponential  model  are  run 
only  as  AFT  models. 

The  Weibull  distribution  has  the  property  that  the  log-log 
of  the  survival  function  is  linear  with  the  log  of  time.  Recall 
in  Section  2  (assessing  the  PH  assumption  graphical 
approach)  that  the  fun=“cloglog”  option  in  the  plot 
function  requested  Kaplan-Meier  log-log  survival  plot 
be  plotted  against  time  (on  the  log  scale)  for  the  variable 
CLINIC.  The  curves  from  this  plot  can  be  used  to  evaluate 
the  Weibull  assumption.  If  the  survival  curves  are  approxi¬ 
mately  straight  lines  (and  parallel),  then  the  Weibull 
assumption  is  reasonable  for  CLINIC.  Furthermore,  if  the 
straight  lines  have  a  slope  of  1 ,  then  the  exponential  distri¬ 
bution  is  appropriate.  We  repeat  and  condense  the  code 
that  was  given  in  Section  2  (see  outputted  plot  in  Section  2): 

plot(survfit(Y~addicts$clinic),  fun=“cloglog”,xlab=“time  in  days  using  log¬ 
arithmic  scale”, ylab=“log-log  survival”,  main=“log-log  curves  by  clinic”) 

The  log-log  curves  in  Section  2  do  not  look  straight  but  for 
illustration,  we  shall  proceed  as  if  the  Weibull  assumption 
were  appropriate.  First  an  exponential  model  is  run  with 
the  survreg  function.  In  this  model,  the  Weibull  shape 
parameter  (p)  is  forced  to  equal  1 ,  which  forces  the  hazard 
to  be  constant.  Well  save  the  results  in  an  object  called 
modparl: 

modparl=survreg(Surv(addicts$survt,addicts$status)  ~  prison  +  dose  + 

clinic, data=addicts,dist=“exponential”) 
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Next  we  apply  the  summary  function  to  the  object  we  just 
created  (code  and  output  shown  below): 

summary(modpar  1 ) 

Value  Std.  Error 
(Intercept)  3.6843  0.43072 

prison  -0.2526  0.16489 

dose  0.0289  0.00614 

clinic  0.8806  0.21063 

Scale  fixed  at  1 

Exponential  distribution 
Loglik (model ) =  -1094  Loglik ( intercept  only) =  -1118.9 

Chisq=  49.91  on  3  degrees  of  freedom,  p=  8.3e-ll 
Number  of  Newton-Raphson  Iterations:  5 
n=  238 


z  p 

8.55  1 . 1 9e-17 
-1.53  1.25e-01 
4.71  2 . 52e-0  6 
4.18  2 . 91e-05 


The  key  assumption  of  an  exponential  model  is  that  the 
hazard  is  constant  over  time.  This  is  indicated  in  the  out¬ 
put  by  the  statement  “Scale  fixed  at  1”  listed  under  the 
tables  of  parameter  estimates.  The  output  can  be  used  to 
estimate  the  hazard  ratio  for  any  subject  given  a  pattern  of 
covariates.  Note  that  R  outputs  the  parameter  estimates 
for  the  AFT  form  of  the  exponential  model.  Multiply  the 
estimated  coefficients  by  one  to  get  estimates  consistent 
with  the  PH  parameterization  of  the  model  (see  Chapter. 
7).  For  example,  the  estimated  hazard  ratio  comparing 
PRISON=l  vs  PRISON=0  is  exp(0.2526)  =  1.29.  The 
corresponding  acceleration  factor  for  an  exponential 
model  is  just  the  reciprocal  of  the  hazard  ratio,  exp(- 
0.2526)  =  0.78.  Having  a  prison  record  accelerates  the 
time  to  event  by  a  factor  of  0.78. 

Next  a  Weibull  AFT  model  is  run  with  the  survreg  func¬ 
tion.  The  results  are  saved  in  an  object  called  modpar2: 

modpar2=survreg(Surv(addicts$survt,addicts$status) 
~  prison  +  dose  +  clinic, data=addicts,dist=“weibull”) 
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Next  we  apply  the  summary  function  to  the  object 
modpar2  (code  and  output  follow): 


summary(modpar2) 

Value  Std.  Error  z 


(Intercept)  4.1048 
prison  -0.2295 
dose  0.0244 
clinic  0.7090 
Log(scale)  -0.3150 


0.32806  12.51  6.37e-36 
0.12079  -1.90  5 . 7 5e-02 
0.00459  5.32  1.03e-07 
0.15722  4.51  6 . 4  9e-0  6 
0.06756  -4.66  3.13e-06 


Scale=  0.73 


Weibull  distribution 

Loglik (model ) =  -1084.5  Loglik ( intercept  only) =  -1114.9 

Chisq=  60.89  on  3  degrees  of  freedom,  p=  3.8e-13 

n=  238 


The  Weibull  shape  parameter  is  the  reciprocal  of  what  R 
calls  the  Scale  parameter  (estimated  at  0.73).  An  estimate 
for  the  Weibull  shape  parameter  can  be  obtained  by  taking 
the  reciprocal,  1/0.73  =  1.37.  The  acceleration  factor  com¬ 
paring  CLINIC=2  to  CLINIC=1  is  estimated  at  exp(0.7090) 
=  2.03.  So,  the  estimated  median  survival  time  (time  off 
heroin)  is  double  for  patients  enrolled  in  CLINIC=2  com¬ 
pared  to  CLINIC  =  1 . 


We  can  use  the  model  results  and  the  predict  function  to 
estimate  the  median  (or  any  other  quantile)  time  to  event 
for  any  specified  pattern  of  covariates.  For  example,  we 
can  obtain  the  25th,  50th,  and  75th  percentile  of  survival 
time  estimated  from  the  Weibull  model  results  that  we 
saved  in  the  object  modpar2  for  an  individual  who  has 
the  covariate  pattern  PRISON=l,  DOSE=50,  and 
CLINIC=1.  The  code  follows: 


pattern  1  =  data,  frame  ( prison  =  1  ,dose=50,clinic=  1 ) 
pct=c(.25,.50,.75) 

days=predict(modpar2,newdata=patternl,type=“quantile”,p=pct) 

cbind(pct,days) 

The  first  statement  in  the  code  creates  a  dataframe  of  one 
observation  specifying  the  pattern  of  covariates  of  interest. 
This  dataframe  (called  pattern  1)  could  have  contained 
more  than  one  observation  if  we  were  interested  in  com¬ 
paring  different  patterns  of  covariates.  The  next  statement 
creates  a  vector  (called  pet)  which  contains  the  percentiles 
of  interest  (25th,  50th,  and  75th).  The  third  statement  cre¬ 
ates  an  object  (called  days)  that  contains  output  from  the 
predict  function.  The  first  argument  of  the  predict  func¬ 
tion  is  the  object  we  called  modpar2  that  contains  the 
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Weibull  model  results.  The  second  argument,  newdata= 
pattern  1,  inputs  the  pattern  of  covariates  of  interest.  The 
third  argument,  type=“quantile”,  requests  that  quantiles 
be  output.  The  fourth  argument,  p=pct,  inputs  the  vector 
of  quantiles  that  we  created  in  the  line  of  code  above  it.  The 
last  statement  of  code  uses  the  cbind  function  to  combine 
the  vectors  pet  and  days  side-by-side  in  columns.  The 
output  follows: 

pet  days 

0.25  133.8074 

0.50  254.2196 

0.75  421.6070 

The  estimated  median  survival  time  is  254.2196  days. 
We  can  use  similar  code  to  plot  the  survival  curve  for  an 
individual  who  has  the  covariate  pattern  PRISON=l, 
DOSE=50,  and  CLINIC=1  using  the  Weibull  model  results. 
The  code  follows: 

pct2=0: 100/1 00 

days2=predict(modpar2,newdata=patternl, 

type=“quantile”,p=pct2) 

survival=  1  -pct2 

plot(days2,survival,xlab=  “survival  time  in  days”,ylab=  “survival 
probabilities”, main=“Weibull  survival  estimates  for  prison=0, 
dose=40,clinic=l”,xlim=c(0,800)) 

The  first  statement  creates  a  vector  called  pct2  that  con¬ 
tains  a  sequence  of  percentiles  between  0  and  1  incremen¬ 
ted  by  0. 01,(0,  0.01,  0.02,...,0.99,  1).  The  second  statement 
creates  an  object,  called  days2,  containing  output  from  the 
predict  function.  The  third  argument  creates  a  vector 
called  survival  which  reverses  the  order  of  pct2.  Finally, 
the  plot  function  plots  the  vectors  days2  on  the  horizontal 
axis  and  survival  on  the  vertical  axis.  Axis  labels  and  a  title 
are  added  using  plot  function  options.  The  output  follows: 
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Weibull  survival  estimates  for  prison=0,  does=40,  cilinic=1 


Next  a  log-logistic  AFT  model  is  run  with  the  survreg 
function.  The  results  are  saved  in  an  object  called 

modpar3: 

modpar3=survreg(Surv(addicts$survt,addicts$status)~ 
prison  +  dose  +  clinic, data=addicts,dist=“loglogistic”) 


Next,  we  apply  the  summary  function  to  the  object 
modpar3  (code  and  output  shown  below): 


summary(modpar3) 


(Intercept) 

3 

prison 

-0 

dose 

0 

clinic 

0 

Log (scale) 

-0 

alue  Std.  Error 
5633  0.38945 
2913  0.14396 
0316  0.00552 
5806  0.17157 
5331  0.06863 


z  p 

9.15  5 . 72e-20 
-2.02  4 . 31e-02 
5.73  1 . 02e-08 
3.38  7 . 14e-04 
-7.77  7 . 95e-15 


Scale=  0.587 


Log  logistic  distribution 

Loglik (model ) =  -1093.9  Loglik ( intercept  only) =  -1120 

Chisq=  52.18  on  3  degrees  of  freedom,  p=  2.7e-ll 
Number  of  Newton-Raphson  Iterations:  4 
n=  238 
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From  this  output,  the  acceleration  factor  comparing 
CLINIC=2  to  CLINIC=1  is  estimated  as  exp(0.5806)  = 
1.79.  If  the  AFT  assumption  holds  for  a  log  logistic  model, 
then  the  proportional  odds  assumption  holds  for  the  sur¬ 
vival  function  (although  the  PH  assumption  will  not  hold). 
The  proportional  odds  assumption  can  be  evaluated  by 
plotting  the  log  odds  of  survival  (using  KM  estimates) 
against  the  log  of  survival  time.  If  the  plots  look  like 
straight  lines  for  each  pattern  of  covariates  then  the  log 
logistic  distribution  is  reasonable.  If  the  straight  lines  are 
also  parallel  then  the  proportional  odds  and  AFT  assump¬ 
tions  also  hold. 

In  Section.  2,  we  created  an  object  which  we  called  kmfit2 
that  contained  the  Kaplan-Meier  survival  estimates.  We 
repeat  the  code  to  recreate  that  object: 

kmfit2=survfit(Surv(addicts$survt,addicts$status)~addicts$clinic) 

The  vector  kmfit2$time  contains  the  survival  times  and 
the  vector  kmfit2$surv  contains  the  KM  survival  estimates 
by  CLINIC.  The  plot  function  can  be  used  to  plot  log  odds 
of  survival,  log[(S/(l  x  S)],  against  the  log  of  survival  time. 
The  code  and  output  follow: 

plot(log(kmfit2$time),log(kmfit2$surv/(l-kmfit2$surv))) 


log(kmfit2$time) 


The  curves  do  not  look  like  straight  lines  or  parallel  so  the 
proportional  odds  assumption  for  CLINIC  looks  to  be  vio¬ 
lated.  We  had  run  the  log-logistic  model  earlier  for  illustra¬ 
tion,  even  though  the  graph  suggests  that  it  is  not  the 
appropriate  model. 

Other  distributions  supported  by  the  survreg  function  are 
the  normal  (dist="gaussian”)  and  the  lognormal  (dist=”log- 
normal”)  distributions. 
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9.  RUNNING  FRAILTY  MODELS 

Frailty  models  contain  an  extra  random  component 
designed  to  account  for  individual-level  differences  in  the 
hazard  otherwise  unaccounted  for  by  the  model.  The 
frailty,  a,  is  a  multiplicative  effect  on  the  hazard  assumed 
to  follow  some  distribution.  The  hazard  function  condi¬ 
tional  on  the  frailty  can  be  expressed  as  h(t\a)  =  <x[h(t)]. 

R  offers  three  choices  for  the  distribution  of  the  frailty:  the 
gamma,  Gaussian,  and  t  distributions.  The  variance  (theta) 
of  the  frailty  component  is  a  parameter  typically  estimated 
by  the  model.  If  theta  =  0,  then  there  is  no  frailty. 

First,  we  rerun  a  stratified  Cox  model  without  frailty  (pre¬ 
viously  shown  in  Section.  4).  The  stratified  variable  is 
CLINIC  while  PRISON  and  DOSE  are  predictor  variables. 
A  stratified  Cox  model  is  appropriate  if  the  PH  assumption 
is  violated  for  CLINIC  and  met  for  PRISON  and  DOSE  and 
our  interest  is  in  estimating  a  hazard  ratio  for  PRISON  or 
DOSE.  The  code  and  output  follow: 

Y=Surv(addicts$survt,addicts$status==  1 ) 
coxph(Y~  prison  +  dose  +  strata(clinic),data=addicts) 

coef  exp(coef)  se(coef)  z  p 

prison  0.3896  1.476  0.16893  2.31  2 .  le-02 

dose  -0.0351  0.965  0.00646  -5.43  5.6e-08 

Likelihood  ratio  test=33 . 9  on  2  df,  p=4.32e-08  n=  238 

The  estimated  hazard  ratio  for  PRISON=l  versus 
PRISON=0  is  exp(0.3896)  =  1.476.  Next  we  illustrate  how 
to  include  a  frailty  component  in  this  model.  The  code 
follows: 

coxph(Y~  prison  +  dose  +  strata(clinic)  +  frailty(id,  distributions 
“gamma”),  data=addicts) 

The  term  +  frailty(id,  distributions  “gamma”)  is  included 
in  the  model  formula.  The  first  argument  of  the  frailty 
function  is  the  variable  id  and  indicates  that  the  unmea¬ 
sured  heterogeneity  (the  frailty)  is  at  the  individual  level. 
The  second  argument  indicates  that  the  distribution  of  the 
random  component  is  the  gamma  distribution.  The  output 
follows: 
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coef  se(coef)  se2  Chisq  DF  p 

prison  0.3900  0.16916  0.16893  5.32  1.00  2.1e-02 

dose  -0.0352  0.00647  0.00647  29.51  1.00  5.6e-08 

frailty(id,  distribution  0.34  0.32  3.1e-01 

Iterations:  5  outer,  41  Newton-Raphson 

Variance  of  random  effect=  0.00227  I-likelihood  =  -597.5 
Degrees  of  freedom  for  terms=  1.0  1.0  0.3 

Likelihood  ratio  test=34.6  on  2.32  df,  p=5.26e-08  n=  238 

Under  the  table  of  parameter  estimates  the  output  indi¬ 
cates  that  the  variance  of  random  effect  =  0.00227.  The  p- 
value  for  the  frailty  component  of  3.1e-01=  0.31  is  provided 
in  the  third  row  and  right  column  of  the  table  and  indicates 
that  the  frailty  component  is  not  significant.  We  conclude 
that  the  variance  of  the  random  component  is  zero  for  this 
model  (i.e.,  there  is  no  frailty).  The  parameter  estimates  for 
PRISON  and  DOSE  changed  minimally  in  this  model  com¬ 
pared  to  the  model  previously  run  without  the  frailty. 

Now,  suppose  the  variable  CLINIC  was  unmeasured.  Next 
we  consider  a  Cox  model  (without  frailty)  that  does  not 
contain  CLINIC.  The  code  and  output  follow: 

coxph(Y~  prison  +  dose,  data=addicts) 

coef  exp (coef)  se(coef)  z  p 

prison  0.1897  1.209  0.164  1.15  2.5e-01 

dose  -0.0361  0.965  0.006  -6.01  1.8e-09 

Likelihood  ratio  test=38.2  on  2  df,  p=5.04e-09  n=  238 

The  estimated  hazard  ratio  for  PRISON=l  versus 
PRISON=0  is  exp(0.1897)  =  1.209  as  compared  to  exp 
(0.3896)  =  1.476  that  was  observed  in  the  model  that 
contained  CLINIC  as  a  stratified  variable.  In  previous  sec¬ 
tions  CLINIC  was  shown  to  be  an  important  predictor  that 
violates  the  proportional  hazards  assumption.  If  CLINIC 
was  unaccounted  for  (as  in  the  model  above),  there  may  be 
a  source  of  unobserved  heterogeneity  that  a  frailty  compo¬ 
nent  might  address.  The  next  model  omits  CLINIC  but 
includes  a  frailty  component  and  the  predictors  PRISON 
and  DOSE.  The  code  and  output  follow: 
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coxph(Y~  prison  +  dose  +  frailty(id,  distribution  “gamma”),  data=addicts) 

coef  se(coef)  se2  Chisq  DF  p 

prison  0.4144  0.22160  0.17590  3.5  1.0  6.1e-02 

dose  -0.0517  0.00845  0.00699  37.4  1.0  9.6e-10 

frailty(id,  distribution  100.5  69.3  8.6e-03 

Iterations:  6  outer,  44  Newton-Raphson 

Variance  of  random  effect=  0.65  I-likelihood  =  -685.4 
Degrees  of  freedom  for  terms=  0.6  0.7  69.3 

Likelihood  ratio  test=190  on  70.7  df,  p=6.17e-13  n=  238 

The  variance  of  the  frailty  component  is  estimated  at  0.65 
compared  to  0.00227  for  the  model  that  we  showed  previ¬ 
ously  that  contained  CLINIC  as  the  stratified  variable.  The 
p-value  for  the  frailty  is  highly  significant  at  8.6e-3  = 
0.0086.  The  hazard  ratio  for  the  effect  of  PRISON  is  exp 
(0.4144)  =  1.51.  The  summary  function  can  be  applied  to 
the  coxph  function  to  get  R  to  exponentiate  the  parameter 
estimates  (with  95%  Cl)  when  a  frailty  component  is 
included  in  a  Cox  model.  The  code  and  output  follow: 

summary  (coxph(Y^  prison  +  dose  +  frailty  (id, 
distribution=“gamma”),  data=addicts)) 


coef 

se (coef) 

se2 

Chisq 

DF 

P 

prison 

0.4144 

0.22160 

0.17590 

3.5 

1.0 

6.  le-02 

dose 

-0.0517 

0.00845 

0.00699 

37 . 4 

1.0 

9. 6e-10 

frailty (id. 

distribution 

100.5 

69.3 

8 . 6e-03 

exp (coef)  exp (-coef)  lower  .95  upper  .95 
prison  1.51  0.661  0.980  2.337 

dose  0.95  1.053  0.934  0.966 

Iterations:  6  outer,  44  Newton-Raphson 

Variance  of  random  effect=  0.65  I-likelihood  =  -685.4 
Degrees  of  freedom  for  terms=  0.6  0.7  69.3 

Rsquare=  0.551  (max  possible=  0.997  ) 

Likelihood  ratio  test=  190  on  70.7  df,  p=6.17e-13 
Wald  test  =  38.8  on  70.7  df,  p=l 

It  is  interesting  that  the  estimated  hazard  ratio  for  PRISON 
(1.51)  obtained  in  this  model  (without  CLINIC  but  with  the 
frailty  component)  is  closer  to  the  corresponding  hazard 
ratio  obtained  from  the  model  that  included  CLINIC 
(1.476)  compared  to  the  one  that  did  not  include  CLINIC 
(1.209).  In  this  example,  the  frailty  component  might  be 
accounting  to  some  extent  for  the  fact  that  CLINIC  was 
omitted  from  the  model. 
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10.  MODELING  RECURRENT  EVENTS 

The  modeling  of  recurrent  events  is  illustrated  with  the 
bladder  cancer  dataset  (bladder.rda)  described  at  the 
start  of  this  appendix.  Recurrent  events  are  represented 
in  the  data  with  multiple  observations  for  subjects  having 
multiple  events.  The  data  layout  for  the  bladder  cancer 
dataset  is  in  the  counting  process  (start,  stop)  format 
with  time  intervals  defined  for  each  observation  (see  Chap¬ 
ter  8).  The  load  function  is  used  to  access  an  R  dataframe 
that  has  been  saved  as  a  file.  Suppose  the  bladder  dataset 
has  been  saved  on  your  C  drive  as  C:\crbladder.rda.  The 
following  code  will  load  the  bladder  data: 

load(  “C  :\\bladder.rda” ) 

The  following  code  prints  the  12th-20th  observation,  which 
contains  information  for  four  subjects: 

bladder [  12:20,  ] 

The  output  follows: 


id 

event 

interval 

inttime 

start  stop 

tx 

num 

size 

12 

10 

1 

1 

12 

0 

12 

0 

1 

1 

13 

10 

1 

2 

4 

12 

16 

0 

1 

1 

14 

10 

0 

3 

2 

16 

18 

0 

1 

1 

15 

11 

0 

1 

23 

0 

23 

0 

3 

3 

16 

12 

1 

1 

10 

0 

10 

0 

1 

3 

17 

12 

1 

2 

5 

10 

15 

0 

1 

3 

18 

12 

0 

3 

8 

15 

23 

0 

1 

3 

19 

13 

1 

1 

3 

0 

3 

0 

1 

1 

20 

13 

1 

2 

13 

3 

16 

0 

1 

1 

There  are  three  observations  for  ID=10,  one  observation 
for  ID=1 1,  three  observations  for  ID=12,  and  two  observa¬ 
tions  for  ID=13.  The  variables  START  and  STOP  represent 
the  time  interval  for  the  risk  period  specific  to  that  obser¬ 
vation.  The  variable  EVENT  indicates  whether  an  event 
(coded  1)  occurred.  The  first  three  observations  indicate 
that  the  subject  with  ID=10  had  an  event  at  12  months, 
another  event  at  16  months,  and  was  censored  at  18 
months. 

Recall  we  analyzed  data  in  the  counting  process  format 
when  we  ran  extended  Cox  models  (Section  7).  In  that 
section  we  saw  how  a  subject’s  covariate  can  change  values 
from  time-interval  to  time-interval.  With  the  bladder  data¬ 
set,  the  (start, stop)  data  format  provides  a  way  to  indicate 
that  a  subject  experienced  multiple  events. 
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As  mentioned  in  the  beginning  of  our  discussion  of  R,  the 
code  library(survival)  must  be  submitted  at  each  session 
before  survival  functions  in  R  can  be  accessed. 

library(survival) 

The  coxph  function  can  be  used  to  run  Cox  models  with 
recurrent  events.  First,  well  define  a  response  variable 
using  the  Surv  function  (called  Y): 

Y=Surv(bladder$start,bladder$stop,bladder$event==l) 

As  we  have  seen  in  Section  7,  the  Surv  function  requires 
three  arguments  with  data  in  the  counting  process  format: 
the  start  variable  (called  START),  the  stop  variable  (called 
STOP),  and  the  status  variable  (called  EVENT).  The  code 
bladder$event==l  indicates  that  an  event  is  coded  1.  R 
recognizes  the  value  1  as  the  default  coding  of  an  event,  so 
it  was  not  necessary  to  state  this  explicitly  in  the  Surv 
function  as  we  did.  Next,  a  recurrent-events  Cox  model  is 
run  with  the  predictors:  treatment  status  (TX),  initial  num¬ 
ber  of  tumors  (NUM),  and  the  initial  size  of  tumors  (SIZE): 

coxph(Y  ~  tx  4  num  +  size  +  cluster(id),  data=bladder) 

The  term  +  cluster(id)  in  the  model  formula  requests 
robust  standard  errors  for  the  parameter  estimates.  The 
model  output  follows: 

coef  exp(coef)  se(coef)  robust  se  z  p 

tx  -0.4116  0.663  0.1999  0.2488  -1.655  0.0980 

num  0.1637  1.178  0.0478  0.0584  2.801  0.0051 

size  -0.0411  0.960  0.0703  0.0742  -0.554  0.5800 

Likelihood  ratio  test=14.7  on  3  df,  p=0. 00213  n=190 

The  treatment  variable  (TX)  is  coded  1  for  treatment  with 
thiotepa  and  0  for  the  placebo.  The  estimated  hazard  ratio 
(TX=1  vs.  TX=0)  is  0.663  (with  ap-value  of  0.0980).  There 
are  two  sets  of  standard  errors  presented  in  the  table  under 
the  columns  labeled:  se(coef)  and  robust  se.  The  p -values 
and  "-test  statistics  in  this  table  are  calculated  using  the 
robust  standard  errors.  We  could  obtain  additional  model 
output  (including  95%  CIs)  by  applying  the  summary 
function  to  the  coxph  function. 

A  stratified  Cox  model  can  also  be  run  using  the  data  in  this 
format  with  the  variable  INTERVAL  as  the  stratified  vari¬ 
able.  The  stratified  variable  indicates  whether  the  subject 
was  at  risk  for  their  first,  second,  third,  or  fourth  event. 
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This  approach  is  called  a  Stratified  CP  recurrent  event 
model  (see  Chap.  8)  and  is  used  if  the  investigator  wants 
to  distinguish  the  order  in  which  recurrent  events  occur. 
The  bladder  data  is  in  the  proper  format  to  run  this  model. 
The  code  and  output  follow: 

coxph(Y  ~  tx  +  num  +  size  +  strata(interval)  +  cluster 
(id),data=bladder) 

coef  exp(coef)  se(coef)  robust  se  z  p 

tx  -0.3335  0.716  0.2162  0.2048  -1.628  0.10 

num  0.1196  1.127  0.0533  0.0514  2.328  0.02 

size  -0.0085  0.992  0.0728  0.0616  -0.138  0.89 

Likelihood  ratio  test=6.51  on  3  df,  p=0.0893  n=190 

The  only  additional  code  from  the  previous  model  is  the 
term  +  strata(interval)  in  the  model  formula  which  indi¬ 
cates  that  INTERVAL  is  the  stratified  variable.  Interaction 
terms  between  the  treatment  variable  (TX)  and  the  strati¬ 
fied  variable  could  be  created  to  examine  whether  the 
effect  of  treatment  differed  for  the  1st,  2nd,  3rd,  or  4th  event. 

Another  stratified  approach  (called  Gap  Time)  is  a  slight 
variation  of  the  Stratified  CP  approach.  The  difference  is  in 
the  way  the  time  intervals  for  the  recurrent  events  are 
defined.  There  is  no  difference  in  the  time  intervals  when 
subjects  are  at  risk  for  their  first  event.  However,  with  the 
Gap  Time  approach,  the  starting  time  at  risk  gets  reset  to 
zero  for  each  subsequent  event.  To  run  a  Gap  Time  model, 
we  need  to  create  two  new  (start,  stop)  variables  in  the 
bladder  dataset,  which  we'll  call  START2  and  STOP2: 

bladder  $  start2 = 0 

bladder$stop2=bladder$stop  -  bladder$start 

The  first  of  the  two  newly  defined  variables  (START2)  is 
always  zero.  The  second  (STOP2)  is  defined  as  the  time 
between  each  event  (STOP-START).  To  print  a  subset  of 
these  variables,  we  can  use  the  data.frame  function.  The 
attach  function  allows  variables  in  the  bladder  dataset  to 
be  listed  without  the  bladder$  prefix  (code  and  output  for 
printing  the  12th-20lh  observation  below). 

attach(bladder) 

data.frame(id,event,start,stop,start2,stop2)[12:20,  ] 
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id 

event 

start 

stop 

start2 

sto; 

12 

10 

1 

0 

12 

0 

12' 

13 

10 

1 

12 

16 

0 

4 

14 

10 

0 

16 

18 

0 

2 

15 

11 

0 

0 

23 

0 

23 

16 

12 

1 

0 

10 

0 

10 

17 

12 

1 

10 

15 

0 

5 

18 

12 

0 

15 

23 

0 

8 

19 

13 

1 

0 

3 

0 

3 

20 

13 

1 

3 

16 

0 

13 

Next  we  need  to  reset  our  response  variable  using  the  Surv 
function  by  changing  our  time  intervals  from  (START, 
STOP)  to  (START2,  STOP2): 

Y2=Surv(bladder$start2,bladder$stop2,bladder$event) 

Next  we  run  a  Gap  Time  model  with  the  bladder  data  using 
similar  code  that  was  used  for  the  Stratified  CP  model 
except  we  use  Y2  rather  than  Y  as  our  response  variable. 
The  code  and  output  follow: 

coxph(Y2  ~  tx  4-  num  +  size  +  strata(interval)  +  cluster(id),data=bladder) 


coef 

exp  (coef) 

se  (coef) 

robust  se 

z 

P 

tx 

-0.27900 

0.757 

0.2073 

0.2156 

-1.294 

0.2000 

num 

0.15805 

1.171 

0.0519 

0.0509 

3.103 

0 . 0019 

size 

0.00742 

1.007 

0.0700 

0.0643 

0 . 115 

0.9100 

Likelihood  ratio  test=9.33  on  3  df,  p=0.0252  n=190 

The  results  using  the  Gap  Time  approach  varies  slightly 
from  that  obtained  using  the  Stratified  CP  approach. 
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True-False  Questions: 

1.  T 

2.  T 

3.  T 

4.  F:  Step  function. 

5.  F:  Ranges  between  0  and  1. 

6.  T 

7.  T 

8.  T 

9.  T 

10.  F:  Median  survival  time  is  longer  for  group  1  than  for 
group  2. 

11.  F:  Six  weeks  or  greater. 

12.  F:  The  risk  set  at  7  weeks  contains  15  persons. 

13.  F:  Hazard  ratio. 

14.  T 

15.  T 

16.  h(t)  gives  the  instantaneous  potential  per  unit  time  for 
the  event  to  occur  given  that  the  individual  has 
survived  up  to  time  t;k(t)  is  greater  than  or  equal  to 
0;  h(t)  has  no  upper  bound. 

17.  Hazard  functions 

•  give  insight  about  conditional  failure  rates; 

•  help  to  identify  specific  model  forms  (e.g., 
exponential,  Weibull); 

•  are  used  to  specify  mathematical  models  for 
survival  analysis. 

18.  Three  goals  of  survival  analysis  are: 

•  to  estimate  and  interpret  survivor  and/or  hazard 
functions; 

•  to  compare  survivor  and/or  hazard  functions; 

•  to  assess  the  relationship  of  explanatory  variables 
to  survival  time. 
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?0') 

mj 

<7; 

R(f(/)) 

Group  1:  0 

0 

0 

25  persons  survive  >  0  years 

1.8 

1 

0 

25  persons  survive  >1.8  years 

2.2 

1 

0 

24  persons  survive  >  2.2  years 

2.5 

1 

0 

23  persons  survive  >  2.5  years 

2.6 

1 

0 

22  persons  survive  >  2.6  years 

3.0 

1 

0 

21  persons  suivive  >  3.0  years 

3.5 

1 

0 

20  persons  survive  >  3.5  years 

3.8 

1 

0 

19  persons  survive  >3.8  years 

5.3 

1 

0 

18  persons  survive  >5.3  years 

5.4 

1 

0 

17  persons  survive  >5.4  years 

5.7 

1 

0 

16  persons  survive  >5.7  years 

6.6 

1 

0 

15  persons  survive  >  6.6  years 

8.2 

1 

0 

14  persons  survive  >  8.2  years 

8.7 

1 

0 

13  persons  survive  >8.7  years 

9.2 

2 

0 

12  persons  survive  >  9.2  years 

9.8 

1 

0 

10  persons  survive  >  9.8  years 

10.0 

1 

0 

9  persons  survive  >  10.0  years 

10.2 

1 

0 

8  persons  survive  >  10.2  years 

10.7 

1 

0 

7  persons  survive  >  10.7  years 

11.0 

1 

0 

6  persons  survive  >11.0  years 

11.1 

1 

0 

5  persons  survive  >11.1  years 

11.7 

1 

3 

4  persons  survive  >11.7  years 

Chapter  2 


20.  a.  Group  1  has  a  better  survival  prognosis  than 
group  2  because  group  1  has  a  higher  average 
survival  time  and  a  correspondingly  lower 
average  hazard  rate  than  group  2. 
b.  The  average  survival  time  and  average  hazard  rates 
give  overall  descriptive  statistics.  The  survivor 
curves  allow  one  to  make  comparisons  over  time. 


1.  a.  KM  plots  and  the  log  rank  statistic  for  the  cell 
type  1  variable  in  the  vets. data  dataset  are  shown 
below. 


Group  0  1 
Group  1  2 


KM  Plots 


0 


200 


400 
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Group 

Events  observed 

Events  expected 

1 

102 

93.45 

2 

26 

34.55 

Total 

128 

128.00 

Log  rank  =  chi2(l)  =  3.02 
p-value  =  Pr  >  chi2  =  0.0822 

The  KM  curves  indicate  that  persons  with  large 
cell  type  have  a  consistently  better  prognosis  than 
persons  with  other  cell  types,  although  the  two 
curves  are  essentially  the  same  very  early  on  and 
after  250  days.  The  log  rank  test  is  not  significant 
at  the  .05  level,  which  gives  somewhat  equivocal 
findings. 

b.  KM  plots  and  the  log  rank  statistic  for  the  four 
categories  of  cell  type  are  shown  below. 


The  KM  curves  suggest  that  persons  with  adeno  or 
small  cell  types  have  a  poorer  survival  prognosis 
than  persons  with  large  or  squamous  cell  types. 
Moreover,  there  does  not  appear  to  be  a 
meaningful  difference  between  adeno  or  small 
cell  types.  Also,  persons  with  squamous  cell  type 
seem  to  have,  on  the  whole,  a  better  prognosis 
than  persons  with  large  cell  type. 

Computer  results  from  Stata  giving  log  rank 
statistics  are  now  shown. 


Group 

Events  observed 

Events  expected 

1 

26 

34.55 

2 

26 

15.69 

3 

45 

30.10 

4 

31 

47.65 

Total 

128 

128.00 

Log  rank  =  chi2(3)  =  25.40 
P-value  =  Pr  >  chi2  =  0.0000 
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The  log-rank  test  yields  highly  significant  p-values, 
indicating  that  there  is  some  overall  difference 
between  all  four  curves;  that  is,  the  null  hypothesis 
that  the  four  curves  have  a  common  survival  curve 
is  rejected, 

2.  a.  KM  plots  for  the  two  clinics  are  shown  below.  These 
plots  indicate  that  patients  in  clinic  2  have  a 
consistently  better  prognosis  for  remaining  under 
treatment  than  do  patients  in  clinic  1 .  Moreover,  it 
appears  that  the  difference  between  the  two  clinics 
is  small  before  one  year  of  follow-up  but  diverges 
after  one  year  of  follow  up. 


b.  The  log  rank  statistic  (27.893)  and  Wilcoxon 
statistic  (1 1.63)  are  both  significant  well  below  the 
.01  level,  indicating  that  the  survival  curves  for  the 
two  clinics  are  significantly  different.  The  log  rank 
statistic  is  nevertheless  much  larger  than  the 
Wilcoxon  statistic,  which  makes  sense  because  the 
log  rank  statistic  emphasizes  the  later  survival 
experience,  where  the  two  survival  curves  are  far 
apart,  whereas  the  Wilcoxon  statistic  emphasizes 
earlier  survival  experience,  where  the  two  survival 
curves  are  closer  together. 

c.  If  methadone  dose  is  categorized  into  high  (70+), 
medium  (55-70)  and  low  (<55),  we  obtain  the  KM 
curves  shown  below. 
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The  KM  curves  indicate  that  persons  with  high 
doses  have  a  consistently  better  survival  prognosis 
(i.e..  maintenance)  than  persons  with  medium  or 
low  doses.  The  latter  two  groups  are  not  very 
different  from  each  other,  although  the  medium 
dose  group  has  a  somewhat  better  prognosis  up  to 
the  first  400  days  of  follow-up. 

The  log  rank  test  statistic  is  shown  below  for  the 
above  categorization  scheme. 


Group 

Events  observed 

Events  expected 

0 

45 

30.93 

1 

74 

54.09 

2 

31 

64.99 

Total 

150 

150.00 

Log  rank  =  chi2(2)  =  33.02 
P-value  =  Pr>chi2  =  0.0000 


The  test  statistic  is  highly  significant,  indicating 
that  these  three  curves  are  not  equivalent. 

1.  a.  h(t,X)  =  hoWexpfpjTl  +  p2T2  +  p3PS  +  p4DC 
+  p5BF  +  p6(Tl  x  PS)  +  p7(T2  x  PS) 

+  p8(Tl  x  DC)  +  p9(T2  x  DC) 

+  p10(TlxBF)+p11(T2xBF)] 

b.  Intervention  A:  X*  =  (1,  0,  PS,  DC,  BF,  PS,  0, 
DC,  0,  BE  0) 

Intervention  C:  X  =  (—  1,-1,  PS,  DC,  BF,  -PS, 
-PS,  -DC,  -DC,  -BF,  -BF) 

HR  =  ^^y  =  exp[2p!+  P2  +  2p6PS+  P7PS 

+  2p8DC+  p9DC  +  2p10BF 
+  Pn  BF] 

c.  H0:  p6  =  p7  =  p8  =  p9  =  p10  =  Pn  =  0  in  the  full 
model. 

Likelihood  ratio  test  statistic:  -  2  In  LR  -  (— 2InLF), 
which  is  approximately  /\  under  H0,  where  R 
denotes  the  reduced  model  (containing  no  product 
terms)  under  H0,  and  F  denotes  the  full  model 
(given  in  Part  la  above) 

d.  The  two  models  being  compared  are: 

Full  model  (F):  h(t,X)  =  FoWexpl^Tl  +  p2T2  +  p3PS 
+  p4DC  +  p5BF] 
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Reduced  model  ( R ):  h(t,X )  =  /20(t)exp[p3PS  +  p4DC 
+  PsBF] 

H0:  Pj  =  p2  =  0  in  the  full  model 
Likelihood  ratio  test  statistic:  2\nLR  —  (-21nLF), 
which  is  approximately  under  H0. 
e. 

Intervention  A:  _  _ 

_  J^^^jexP[A+(PS)fe+(DC)^4+(BF)ft] 

Intervention  B:  _  _  _ 

S(t.X)  =  [S0(f)]ex  p[A+(ps)^+(dc)A+(bf)A] 

Intervention  C:  _  _  _ 

S(t.X)  =  [S0(?)]ex p[— A— A+(ps)A+(DC)ft+(BF)fe] 

a.  /z(f,X)  =  ho(t)exp[p!  CHR  +  p2  AGE  +  p3(CHR  x 

AGE)] 

b.  H0:  p3  =  0 

LR  statistic  =  264.90  -  264.70  =  0.21;  j2  with  1  d.f. 
under  H0;  not  significant. 

Wald  statistic  gives  a  chi-square  value  of  .01,  also 
not  significant.  Conclusions  about  interaction:  the 
model  should  not  contain  an  interaction  term. 

c.  When  AGE  is  controlled  (using  the  gold  standard 
model  2),  the  hazard  ratio  for  the  effect  of  CHR  is 
exp(.8051)  =  2.24,  whereas  when  AGE  is  not 
controlled,  the  hazard  ratio  for  the  effect  of  CHR 
(using  Model  1)  is  exp(.8595)  =  2.36.  Thus,  the 
hazard  ratios  are  not  appreciably  different,  so 
AGE  is  not  a  confounder. 

Regarding  precision,  the  95%  confidence  interval 
for  the  effect  of  CHR  in  the  gold  standard  model 
(Model  2)is  given  by  exp[.8051  ±  1.96(.3252)]  = 
(1.183,  4.231)  whereas  the  corresponding  95% 
confidence  interval  in  the  model  without  AGE 
(Model  1)  is  given  by  exp[.8595  ±  1.96(.3116)]  = 
(1.282,  4.350).  Both  confidence  intervals  have 
about  the  same  width,  with  the  latter  interval 
being  slightly  wider.  Thus,  controlling  for  AGE  has 
little  effect  on  the  final  point  and  interval  estimates 
of  interest. 

d.  If  the  hazard  functions  cross  for  the  two  levels 
of  the  CHR  variable,  this  would  mean  that  none 
of  the  models  provided  is  appropriate,  because 
each  model  assumes  that  the  proportional  hazards 
assumption  is  met  for  each  predictor  in  the  model. 
If  hazard  functions  cross  for  CHR,  however, 
the  proportional  hazards  assumption  cannot  be 
satisfied  for  this  variable. 
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e. 


f. 


for  CHR  =  1  :  S(>,  X)  =  W*>E)] 

For  CHR  =  0  :  X)  =  [SoVi  “PI““,4CE,1 

Using  Model  1,  which  is  the  best  model,  there  is 
evidence  of  a  moderate  effect  of  CHR  on  survival 
time,  because  the  hazard  ratio  is  about  2.4  with  a 
95%  confidence  interval  between  1.3  and  4.4,  and 
the  Wald  text  for  significance  of  this  variable  is 
significant  below  the  .01  level. 
a3.  Full  model  (F  =  Model  1):  h(t,X)=h0(t)exTp[fiiRx 
+  B2Sex+R3log  WBC+B4(iCr  x  Sex) 

+  p5(Pv  x  log  WBC)] 

Reduced  model  ( R  =  model  4): 

h(t,X.)  =  h0(t)  expfPjRx  +  p2Sex+  p3logWBC] 

H0:  P4— P5=0 


LR  statistic  =  144.218  -  139.030  =  5.19;  x2  with 
2  d.f.  under  H0\  not  significant  at  0.05,  though 
signilicant  at  0.10.  The  chunk  test  indicates  some 
(though  mild)  evidence  of  interaction. 

b.  Using  either  a  Wald  test  (p-value  =  .776)  or  a  LR 
test,  the  product  term  Rx  x  log  WBC  is  clearly  not 
significant,  and  thus  should  be  dropped  from 
Model  1.  Thus,  Model  2  is  preferred  to  Model  1. 

c.  Using  Model  2,  the  hazard  ratio  for  the  effect  of 
Rx  is  given  by  HR  -  (h(t,X*))/(h(t,X))  =  exp[0.405 
+  2.013  Sex]  _ 

d.  Males  (Sex  =  0)  :  HR  =  exp[0.405]  =  1.499 
Females  (Sex  =  1)  :  HR  =  exp[0.405  +  2.013(1)]  = 


11.223 


e.  Model  2  is  preferred  to  Model  3  if  one  decides  that 
the  coefficients  for  the  variables  Rx  and  Rx  x  Sex 
are  meaningfully  different  for  the  two  models. 
It  appears  that  such  corresponding  coefficients 
(0.405  vs.  0.587  and  2.013  vs.  1.906)  are  different. 
The  estimated  hazard  ratios  tor  Model  3  are  1.799 
(males)  and  12.098  (females),  which  are  different, 
but  not  very  different  from  the  estimates  computed 
in  Part  3d  for  Model  2.  If  it  is  decided  that  there 
is  a  meaningful  difference  here,  then  we  would 
conclude  that  log  WBC  is  a  confounder;  otherwise 
log  WBC  is  not  a  confounder.  Note  that  the  log  WBC 
variable  is  significant  in  Model  2  (P  =  .000),  but  this 
addresses  precision  and  not  confounding.  When  in 
doubt,  as  in  this  case,  the  safest  thing  to  do  (for 
validity  reasons)  is  to  control  for  log  WBC. 

f.  Model  2  appears  to  be  best,  because  there  is 
significant  interaction  of  Rx  x  Sex  (P  =  .023)  and 
because  log  WBC  is  a  likely  confounder  (from  Part  e). 
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1 .  The  P(PH )  values  in  the  printout  provide  GOF 
statistics  for  each  variable  adjusted  for  the  other 
variables  in  the  model  These  P{PH)  values  indicate 
that  the  clinic  variable  does  not  satisfy  the  PH 
assumption  (P  <<  .01),  whereas  the  prison  and  dose 
variables  satisfy  the  PH  assumption  (P>.10). 

2.  The  log-log  plots  shown  are  parallel.  However,  the 
reason  why  they  are  parallel  is  because  the  clinic 
variable  has  been  included  in  the  model,  and  log-log 
curves  for  any  variable  in  a  PH  model  must  always  be 
parallel.  If,  instead,  the  clinic  variable  had  been 
stratified  (i.e.,  not  included  in  the  model),  then  the 
log-log  plots  comparing  the  two  clinics  adjusted  for 
the  prison  and  dose  variables  might  not  be  parallel. 

3.  The  log-log  plots  obtained  when  the  clinic  variable  is 
stratified  (i.e.,  using  a  stratified  Cox  PH  model)  are  not 
parallel.  They  intersect  early  on  in  follow-up  and 
diverge  from  each  other  later  in  follow-up.  These 
plots  therefore  indicate  that  the  PH  assumption  is 
not  satisfied  for  the  clinic  variable. 

4.  Both  graphs  of  log-log  plots  for  the  prison  variable 
show  curves  that  intersect  and  then  diverge  from  each 
other  and  then  intersect  again.  Thus,  the  plots  on  each 
graph  appear  to  be  quite  nonparallel,  indicating  that 
the  PH  assumption  is  not  satisfied  for  the  prison 
variable.  Note,  however,  that  on  each  graph,  the 
plots  are  quite  close  to  each  other,  so  that  one  might 
conclude  that,  allowing  for  random  variation,  the  two 
plots  are  essentially  coincident;  with  this  latter  point 
of  view,  one  would  conclude  that  the  PH  assumption 
is  satisfied  for  the  prison  variable. 

5.  The  conclusion  of  nonparallel  log-log  plots  in 
Question  4  gives  a  different  result  about  the  PH 
assumption  for  the  prison  variable  than  determined 
from  the  GOF  tests  provided  in  Question  1.  That  is, 
the  log-log  plots  suggest  that  the  prison  variable 
does  not  satisfy  the  PH  assumption,  whereas  the 
GOF  test  suggests  that  the  prison  variable  satisfies 
the  assumption.  Note,  however,  if  the  point  of  view  is 
taken  that  the  two  plots  are  close  enough  to  suggest 
coincidence,  the  graphical  conclusion  would  be  the 
same  as  the  GOF  conclusion.  Although  the  final 
decision  is  somewhat  equivocal  here,  we  prefer  to 
conclude  that  the  PH  assumption  is  satisfied  for  the 
prison  variable  because  this  is  strongly  indicated  from 
the  GOF  test  and  questionably  counterindicated  by 
the  log-log  curves. 
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6.  Because  maximum  methadone  dose  is  a  continuous 
variable,  we  must  categorize  this  variable  into  two  or 
more  groups  in  order  to  graphically  evaluate  whether 
it  satisfies  the  PH  assumption.  Assume  that  we  have 
categorized  this  variable  into  two  groups,  say,  low 
versus  high.  Then,  observed  survival  plots  can  be 
obtained  as  KM  curves  for  low  and  high  groups 
separately  To  obtain  expected  plots,  we  can  fit  a  Cox 
model  containing  the  dose  variable  and  then 
substitute  suitably  chosen  values  for  dose  into  the 
formula  for  the  estimated  survival  curve.  Typically, 
the  values  substituted  would  be  either  the  mean  or 
median  (maximum)  dose  in  each  group. 

After  obtaining  observed  and  expected  plots  for  low 
and  high  dose  groups,  we  would  conclude  that  the  PH 
assumption  is  satisfied  if  corresponding  observed  and 
expected  plots  art;  not  widely  discrepant  from  each 
other.  If  a  noticeable  discrepancy  is  found  for  at  least 
one  pair  of  observed  versus  expected  plots,  we 
conclude  that  the  PH  assumption  is  not  satisfied. 

7.  /z(t,X)  =  /zo^lexptPj  clinic  +  p2  prison  +  p3  dose 

+  §!  (clinic  x  g(t))  +  §2  (prison  x  g(t)) 

+  §3  (dose  x  g(t))] 

where  g(t)  is  some  function  of  time.  The  null 
hypothesis  is  given  by  H0:  8j  =  82  =  83  =  0.  The  test 
statistic  is  a  likelihood  ratio  statistic  of  the  form  LR  = 
— 21n Lr  —  (— 2In LF)  where  R  denotes  the  reduced  (PH) 
model  obtained  when  all  8s  are  0,  and  F  denotes  the 
full  model  given  above.  Under  H0,  the  LR  statistic  is 
approximately  chi-square  with  3  d.f. 

8.  Drawbacks  of  the  extended  Cox  model  approach: 

•  Not  always  clear  how  to  specify  g(t);  different 
choices  may  give  different  conclusions; 

•  Different  modeling  strategies  to  choose  from,  for 
example,  might  consider  g(t)  to  be  a  polynomial 
in  t  and  do  a  backward  elimination  to  eliminate 
nonsignificant  higher-order  terms;  alternatively, 
might  consider  g(t)  to  be  linear  in  t  without 
evaluating  higher-order  terms. 

Different  strategies  may  yield  different  conclusions. 
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9.  h(t,X )  =  /20(f)exp[|T|  clinic  +  p2prison  +  p3  dose  + 
81(clinic  x  g(/))]  where  git)  is  some  function  of  time. 
The  null  hypothesis  is  given  by  H0:  Sj  =  0,  and  the 
test  statistic  is  either  a  Wald  statistic  or  a  likelihood 
ratio  statistic.  The  LR  statistic  would  be  of  the  form 
LR  =  —2  In  Lr  —  (— 2In  LF),  where  R  denotes  the 
reduced  (PH)  model  obtained  when  §!  =  0,  and  F 
denotes  the  full  model  given  above.  Either  statistic  is 
approximately  chi-square  with  1  d.f.  under  the  null 
hypothesis. 

10.  t  >  365  days:  HR  =  exp[Pi  +  §)] 
t  <  365  days:  HR  =  expfp!] 

If  §i  is  not  equal  to  zero,  then  the  model  does  not  satisfy 
the  PH  assumption  for  the  clinic  variable.  Thus,  a  test 
of  H0:  8 1  0  evaluates  the  PH  assumption;  a  significant 
result  would  indicate  that  the  PH  assumption  is 
violated.  Note  that  if  §i  is  not  equal  to  zero,  then  the 
model  assumes  that  the  hazard  ratio  is  not  constant 
over  time  by  giving  a  different  hazard  ratio  value 
depending  on  whether  t  is  greater  than  365  days  or 
t  is  less  than  or  equal  to  365  days. 

1 .  By  fitting  a  stratified  Cox  (SC)  model  that  stratifies  on 
clinic,  we  can  compare  adjusted  survival  curves  for 
each  clinic,  adjusted  for  the  prison  and  dose 
variables.  This  will  allow  us  to  visually  describe  the 
extent  of  clinic  differences  on  survival  over  time. 
However,  a  drawback  to  stratifying  on  clinic  is  that  it 
will  not  be  possible  to  obtain  an  estimate  of  the  hazard 
ratio  for  the  effect  of  clinic,  because  clinic  will  not  be 
included  in  the  model. 

2.  The  adjusted  survival  surves  indicate  that  clinic  2  has 
a  better  survival  prognosis  than  clinic  1  consistently 
over  time.  Moreover,  it  seems  that  the  difference 
between  the  effects  of  clinic  2  and  clinic  1  increases 
over  lime. 

3.  hg(t,  X)  =  /i0  (t)exp[  /ij  prison  +  /?2  dose],  g  =  1,2 
This  is  a  no-interaction  model  because  the  regression 
coefficients  for  prison  and  dose  are  the  same  for  each 
stratum. 

4.  Effect  of  prison,  adjusted  for  clinic  and  dose:  HR  = 
1.475,  95%  Cl:  (1.059,  2.054).  It  appears  that  having 
a  prison  record  gives  a  1.475  increased  hazard  for 
failure  than  not  having  a  prison  record.  The  p-value 
is  0.021,  which  is  significant  at  the  0.05  level. 

Version  1:  hg  (t,  X)  =  /?os(f)exp[  plg  prison  +  p2g  dose], 
5=1,2 

Version  2:  hg  (r,  X)  =  /*os(/jexp[  prison  +  f}2  dose 
+  /?3  (clinic  x  prison)  +  /?4  (clinic  x  dose)],  g  =  1,2 


5. 
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6.  g  =  1  (clinic  1): 

hi(t,X)  =  /ioi(t)exp[(0.502)prison  +  (—0.036)  dose] 
g  =  2  (clinic  2): 

h2(t,X)  =  /z02(t)exp[(— 0.083)prison  +  (— 0.037)dose] 

7.  The  adjusted  survival  curves  stratified  by  clinic 
are  virtually  identical  for  the  no-interaction  and 
interaction  models.  Consequently,  both  graphs  (no¬ 
interaction  versus  interaction)  indicate  the  same 
conclusion  that  clinic  2  has  consistently  larger 
survival  (i.e.,  retention)  probabilities  than  clinic  1  as 
time  increases. 

8.  H0:  p3  —  p4  =  0  in  the  version  2  model  (i.e.,  the  no¬ 

interaction  assumption  is  satisfied).  LR  =  -  2In 
Lk .  (—2  In  LF)  where  R  denotes  the  reduced  (no¬ 

interaction)  model  and  F  denotes  the  full  (interaction) 
model.  Under  the  null  hypothesis,  LR  is  approximately 
a  chi  square  with  2  degrees  of  freedom. 

Computed  LR  =  1195.428  —  1193.558=  1.87;  p-value  = 
0.395;  thus,  the  null  hypothesis  is  not  rejected  and 
we  conclude  that  the  no  interaction  model  is 
preferable  to  the  interaction  model. 


1 .  For  the  chemo  data,  the  -log-log  KM  curves  intersect 
at  around  600  days;  thus  the  curves  are  not  parallel, 
and  this  suggests  that  the  treatment  variable  does  not 
satisfy  the  PH  assumption. 

2.  The  P  (PH)  value  for  the  Tx  variable  is  0,  indicating 
that  the  PH  assumption  is  not  satisfied  for  the 
treatment  variable  based  on  this  goodness-of-fit  test. 

3.  h(t,X)  =  /zoOOexptPdT x)g1(t)  +  p 2(T x)g2(t) 

+  P  3(T  x)g3(t)] 

where 


*lW  =  {o 
*2(')  =  {o 
*1 2 3 4(')  =  {o 


if  0  <  t  <  250  days 

if  otherwise 

if  25  0  <  t  <  500  days 

if  otherwise 

if  t  >  500  days 

if  otherwise 


4.  Based  on  the  printout  the  hazard  ratio  estimates  and 
corresponding  p-values  and  95%  confidence  intervals 
are  given  as  follows  for  each  time  interval: 
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Haz.  Ratio 

p>  M 

[95%  Conf. 
Interval] 

0  <  t  <  250  days: 

0.221 

0.001 

0.089  0.545 

250  <  t  <  500  days: 

1.629 

0.278 

0.675  3.934 

t  >  500  days: 

1.441 

0.411 

0.604  3.440 

The  results  show  a  significant  effect  of  treatment 
below  250  days  and  a  nonsignificant  effect  of 
treatment  in  each  of  the  two  intervals  after  250 
days.  Because  the  coding  for  treatment  was  1  = 
chemotherapy  plus  radiation  versus  2  =  chemo¬ 
therapy  alone,  the  results  indicate  that  the  hazard 
for  chemotherapy  plus  radiation  is  1/0.221  =  4.52 
times  the  hazard  for  chemotherapy  alone.  The 
hazard  ratio  inverts  to  a  value  less  than  1  (in  favor 
of  chemotherapy  plus  radiation  after  250  days), 
but  this  result  is  nonsignificant.  Note  that  for 
the  significant  effect  of  1/0.221  =  4.52  below  250 
days,  the  95%  confidence  interval  ranges  between 
1/0.545=  1.83  and  1/0.089=  11.24  when  inverted, 
which  is  a  very  wide  interval. 

5.  Model  with  two  Heaviside  functions: 

h(t,X)  =  h0(t)exTp[^1(Tx)g1(t)  +  p2(7x)g2(/)] 
where 


*l(f)  =  {o 
82(t)  =  {  J 


if  0  <  t  <  250  days 
if  otherwise 
if  t  >  250  days 
if  otherwise 


Model  with  one  Heaviside  function: 

Ht,X)  =  h0(t)exp\$\(Tx)  +  p2(7h:)gi(/)] 
where  gi(f)  is  defined  above. 

6.  The  results  for  two  time  inteivals  give  hazard  ratios 
that  are  on  the  opposite  side  of  the  null  value  (i.e.,  1). 
Below  250  days,  the  use  of  chemotherapy  plus 
radiation  is,  as  in  the  previous  analysis,  4.52  times 
the  hazard  when  chemother  apy  is  used  alone.  This 
result  is  significant  and  the  same  confidence  interval 
is  obtained  as  before.  Above  250  days,  the  use  of 
chemotherapy  alone  has  1.532  times  the  hazard  of 
chemotherapy  plus  radiation,  but  this  result  is 
nonsignificant. 


678  Test  Answers 


Chapter  7 


1.  F:  They  are  multiplicative  models,  although  additive 
on  the  log  scale. 

2.  T 

3.  T 

4.  F:  If  the  AFT  assumption  holds  in  a  log-logistic  model, 
the  proportional  odds  assumption  holds. 

5 .  F:  An  acceleration  factor  greater  than  one  suggests  the 
exposure  is  beneficial  to  survival. 

6.  T 

7.  T 

8.  T 

9.  F:  ln(T)  follows  an  extreme  value  minimum 
distribution. 

10.  F:  The  subject  is  right-censored. 

11.  _  exp[a0  +  0£i (2)  +  a2PRISON  +  a 3DOSE  +  a4PRISDOSE\ 
1  exp[ao  +  oci  (1)  +  a2PRISON  +  a2DOSE  +  01.4PRISDOSE } 

=  exp(ai) 

y  =  exp(0.698)  =  2.01 

95%  Cl  =  exp[0.698  ±  1.96(0.158)]  =  (1.47,  2.74) 

The  point  estimate  for  the  acceleration  factor  (2.01) 
suggests  that  the  survival  time  (time  off  heroin)  is 
double  for  those  enrolled  in  CLINIC  =  2  compared  to 
CLINIC  =  1.  The  95%  confidence  interval  does  not 
include  the  null  value  of  1.0  indicating  a  statistically 
significant  preventive  effect  for  CLINIC  =  2  compared 
to  CLINIC  =  1. 

12.  _  exp[/?0  +  /h (2)  +  [i2PRISON  +  fcDOSE  +  fiAPRISDOSE \ 
~  exp[/?0  +  ft(l)  +  IMPRISON  +  fS3DOSE  +  fl4PRISDOSE\ 
=  exP(/h) 

HR  =  exp(— 0.957)  =  0.38 

95%  Cl  =  exp[— 0.957  ±  1.96(0.213)]  =  (0.25,  0.58) 

The  point  estimate  of  0.38  suggests  the  hazard  of  going 
back  on  heroin  is  reduced  by  a  factor  of  0.38  for  those 
enrolled  in  CLINIC  =  2  compared  to  CLINIC  =  1. 
Or  from  the  other  perspective:  the  estimated  hazard 
is  elevated  for  those  in  CLINIC  =  1  by  a  factor  of  exp 
(+0  957)  =  2.60. 

13.  pj  =  -a.xp  for  CLINIC,  so  (h  =  -(0.698X1.370467)  = 
—0.957,  which  matches  the  output  for  the  PH  form  of 
the  model. 
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14.  The  product  term  PRISDOSE  is  included  in  the  model 
as  a  potential  confounder  of  the  effect  of  CLINIC  on 
survival.  It  is  not  an  effect  modifier  because  under  this 
model  the  hazard  ratio  or  acceleration  factor  for 
CLINIC  does  not  depend  on  the  value  of  PRISDOSE. 
The  PRISDOSE  term  would  cancel  in  the  estimation 
of  the  hazard  ratio  or  acceleration  factor  (see 
Questions  11  and  12).  On  the  other  hand,  a  product 
term  involving  CLINIC  would  be  a  potential  effect 
modifier. 

15.  Using  the  AFT  form  of  the  model: 

=  exp[  «o  +  a i  CLINIC  +  a2  PRISON  +  0:3  DOSE 
+  a4  PRISDOSE] 

Median  survival  time  for  CLINIC  =  2,  PRISON  =  1, 
DOSE  =  50,  PRISDOSE  =  100: 

t=[-lnS(t)]^x^=[-ln(0.5)]^ 

x  exp[/?(l  +  2/1,  +  p2  +  50/?3  +  100^] 

t  (median)  =  403.66  days  (obtained  by  substituting 
parameter  estimates  from  output). 

16.  Using  the  same  approach  as  the  previous  question: 
Median  survival  time  for  CLINIC  =  1,  PRISON  =  1, 
DOSE  =  50,  PRISDOSE  =  100: 

t  =  [— ln(0.5)]1/p  x  exp[p0  +  lp]  +  p2  +  50p3  +  100S4] 
t  (median)  =  200.85  days. 

17.  The  ratio  of  the  median  survival  times  is  403.66/200.85 
=  2.01.  This  is  the  estimated  acceleration  factor  for 
CLINIC  =  2  vs.  CLINIC  =  1  calculated  in  Question  11. 
Note  that  if  we  used  any  survival  probability  (i.e.,  any 
quantile  of  survival  time),  not  just  S(i)  -=  0.5  (the 
median),  we  would  have  obtained  the  same  ratio. 

18.  The  addition  of  the  frailty  component  did  not  change 
any  of  the  other  parameter  estimates  nor  did  it  change 
the  log  likelihood  of  —260.74854. 

19.  If  the  variance  of  the  frailty  is  zero  (theta  =  0),  then 
the  frailty  has  no  effect  on  the  model.  A  variance  of 
zero  means  the  frailty  (a)  is  constant  at  1.  Frailty  is 
defined  as  a  multiplicative  random  effect  on  the 
hazard  h(t|a)  =  ah(t).  If  a  =  1  then  h(t|a)  =  h(t),  and 
there  is  no  frailty. 
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1.  a.  Survival  time  (say,  in  weeks)  to  the  first  event 
(stratum  1): 


t(f) 

nf 

rrif 

Of 

R(t(f>) 

0 

2 

0 

0 

[B,L] 

12 

2 

1 

0 

[B,L] 

20 

1 

1 

0 

[L] 

b.  For  each  approach,  the  observation  for  the  first 
event  is  identical. 

c.  Survival  rime  (say,  in  weeks)  from  the  first  to  the 
second  event  (stratum  2)  using  the  Stratified  CP 
approach: 


t(f) 

nf 

nif 

Of 

R(t(f>) 

0 

0 

0 

0 

- 

16 

1 

1 

0 

[B] 

23 

1 

1 

0 

[L] 

d.  Survival  time  (say,  in  weeks)  from  the  first  to  the 
second  event  (stratum  2)  using  the  Gap  Time 
approach: 


t(f) 

nf 

nif 

Of 

R((f)) 

0 

2 

0 

0 

[B,L] 

3 

2 

1 

0 

[B,L] 

4 

1 

1 

0 

[B] 

e.  Survival  time  (say,  in  weeks)  from  the  first  to  the 
second  event  using  the  Marginal  approach: 


t(f) 

nf 

nif 

Of 

R(t(f>) 

0 

2 

0 

0 

[B,L] 

16 

2 

1 

0 

[B.L] 

23 

1 

1 

0 

[L] 

f.  Correct  choice  is  iii. 

Bonnie  is  at  risk  for  a  second  event  between  times 
12  to  16. 

Lonnie  is  at  risk  for  a  second  event  between  times 
20  to  23. 

Neither  is  in  the  risk  set  for  the  other’s  second  event. 

g.  Correct  choice  is  ii. 

Bonnie  is  at  risk  for  a  second  event  between  times 
0  to  4. 

Lonnie  is  at  risk  for  a  second  event  between  times 
0  to  3. 

Bonnie  is  in  the  risk  set  when  Lonnie  gets  her 
second  event. 
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h.  Correct  choice  is  i. 

Bonnie  is  at  risk  for  a  second  event  between  times 
Oto  16. 

Lonnie  is  at  risk  for  a  second  event  between  times 
0  to  23. 

Lonnie  is  in  the  risk  set  when  Bonnie  gets  her 
second  event. 

2.  a.  Cox  PH  Model  for  CP  approach  to  Defibrillator 
Study: 

h(t,X)  =  h0(t)exp[p  tx  +  y  smoking] 

where  tx  =  1  if  treatment  A,  0  if  treatment  B. 
smoking  status  =  1  if  ever  smoked,  0  if  never 
smoked. 

b.  Using  the  CP  approach,  there  is  no  significant 
effect  of  treatment  status  adjusted  for  smoking. 
The  estimated  hazard  ratio  for  the  effect  of 
treatment  is  1.09,  the  corresponding  P-value  is 
0.42  and  a  95%  Cl  for  the  hazard  ratio  is  (0.88, 
1.33). 

c.  No-interaction  SC  model  for  Marginal  approach: 

hg(t,X)  =  h0g(t)exp[p  tx  +  y  smoking],  g  =  1,  2,3 
Interaction  SC  model  for  Marginal  approach: 
hg{t,X)  =h0g(t)exp[pg  tx  +  yg  smoking],  g=  1,  2,  3 

d.  LR  -  —  21nLR  -(—2 In  LF)  is  approximately  y2  with 
4  df  under 

H  0:no-interaction  SC  model  is  appropriate,  where  R 
denotes  the  reduced  (no  interaction  SC)  model  and 
F  denotes  the  full  (interaction  SC)  model 

e.  The  use  of  a  no-interaction  model  does  not  allow 
you  to  obtain  stratum-specific  HR  estimates,  even 
though  you  are  assuming  that  strata  are  important. 

f.  The  CP  approach  makes  sense  for  these  data 
because  recurrent  defibrillator  (shock)  events  on 
the  same  subject  are  the  same  kind  of  event  no 
matter  when  it  occurred. 

g.  You  might  use  the  Marginal  approach  if  you 
determined  that  different  recurrent  events  on  the 
same  subject  were  different  because  they  were  of 
different  order. 

h.  The  number  in  the  risk  set  (nf)  remains  unchanged 
through  day  68  because  every  subject  who  failed  by 
this  time  was  still  at  risk  for  a  later  event. 

i.  Subjects  3,6,  10,26,  and  31  all  fail  for  the  third  time 
at  day  98  and  are  not  followed  afterwards. 

j.  Subjects  9,  15,  and  28  fail  for  the  second  time  at 
79  days,  whereas  subject  #16  is  censored  at  79  days. 

k.  Subjects  4,  14,  15,  24,  and  29  were  censored 
between  days  111  and  112. 
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l.  Subject  #5  gets  his  first  event  at  45  days  and  his 
second  event  at  68  days,  after  which  he  drops  out 
of  the  study.  This  subject  is  the  first  of  the  36 
subjects  to  drop  out  of  the  study,  so  the  number  in 
the  risk  set  changes  from  36  to  35  after  68  days. 

m.  None  of  the  above. 

n.  The  product  limit  formula  is  not  applicable  to  the 
CP  data;  in  particular,  P(T  >  t|T  >  t)  does  not  equal 
“#  failing  in  time  interval  /#  in  the  risk  set  at  start  of 
interval.” 

o.  Use  the  information  provided  in  Table  T.2  to 
complete  the  data  layouts  for  plotting  the  following 
survival  curves. 

i.  S  i  (t)  =  Pr(T  i  >  t)  where  T,  =  time  to  first  event 
from  study  entry 


t(f) 

nf 

mf 

Of  S(tp)  = 

=  S(tf — ! )  x  Pr  (Tj  >  1 1  Tj  >  t) 

0 

36 

0 

0 

1.00 

33 

36 

2 

0 

0.94 

34 

34 

3 

0 

0.86 

36 

31 

3 

0 

0.78 

37 

28 

2 

0 

0.72 

38 

26 

4 

0 

0.61 

39 

22 

5 

0 

0.47 

40 

17 

1 

0 

0.44 

41 

16 

1 

0 

0.42 

43 

15 

1 

0 

0.39 

44 

14 

1 

0 

0.36 

45 

13 

2 

0 

0.31 

46 

11 

2 

0 

0.25 

48 

9 

1 

0 

0.22 

49 

8 

1 

0 

0.19 

51 

7 

2 

0 

0.19  x  5/7  =  0.14 

57 

5 

2 

0 

0.14  x  3/5  =  0.08 

58 

3 

2 

0 

0.08  x  1/3  =  0.03 

61 

1 

1 

0 

0.03  x  0/1  =  0.00 

ii. 

Gap  Time  S2c(t)  = 

Pr(T2c  >  t)  where  T2c  =  time 

to  second  event  from  first  event. 

ten 

nf 

mf 

Of  S2(t(f)) 

=  S2(t(f_i))xPr(T2  >  1 1  T2  >  t) 

0 

36 

0 

0 

1.00 

5 

36 

1 

0 

0.97 

9 

35 

1 

0 

0.94 

18 

34 

2 

0 

0.89 

20 

32 

1 

0 

0.86 

21 

31 

2 

1 

0.81 

23 

28 

1 

0 

0.78 

24 

27 

1 

0 

0.75 

( Continued  on  next  page) 
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( Continued ) 


t(f) 

nf  ] 

mf  qf 

S2(t(f))  =  S2(t(f_i))xPr(T2  >  1 1  T2  >  t) 

25 

26 

1 

0 

0.72 

26 

25 

2 

0 

0.66 

27 

23 

2 

0 

0.60 

28 

21 

1 

0 

0.58 

29 

20 

1 

0 

0.55 

30 

19 

1 

0 

0.52 

31 

18 

3 

0 

0.43 

32 

15 

1 

0 

0.40 

33 

14 

5 

0 

0  26 

35 

9 

1 

0 

0.23 

39 

8 

2 

0 

0.17 

40 

6 

2 

0 

0.17  x  4/6 

=  0.12 

41 

4 

1 

0 

0.12  x  3/4 

=  0.09 

42 

3 

1 

0 

0.09  x  2/3 

=  0.06 

46 

2 

1 

0 

0.06  x  1/2 

=  0.03 

47 

1 

1 

0 

0.03  x  0/1 

=  0.00 

iii. 

Marginal  S2m 

(0  =  Pr(T2m  >  t)  where  T2m  = 

time  to  second  event  from  study  entry. 

ten 

nf 

mf 

qf 

S(t(f))=S2(t(f-i)) 

xPr(T2>t|T2>t) 

0 

36 

0 

0 

1.00 

63 

36 

2 

0 

0.94 

64 

34 

3 

0 

0.86 

65 

31 

2 

0 

0.81 

66 

29 

3 

0 

0.72 

67 

26 

4 

0 

0.61 

68 

22 

2 

0 

0.56 

69 

20 

1 

0 

0.53 

70 

19 

1 

0 

0.50 

71 

18 

1 

0 

0.47 

72 

17 

2 

0 

0.42 

73 

15 

1 

0 

0.39 

74 

14 

1 

0 

0.36 

76 

13 

1 

0 

0.33 

77 

12 

1 

0 

0.31 

78 

11 

2 

0 

0.25 

79 

9 

3 

1 

0.25  x 

6/9  =  0.17 

80 

5 

2 

0 

0.17  x 

3/5  =  0.10 

81 

3 

2 

0 

0.10  x 

1/3  =  0.03 

97 

1 

1 

0 

0.03  x 

0/1  =  0.00 

The  survival  curves  corresponding  to  the  above  data 
layouts  will  differ  because  they  are  describing 
different  survival  functions.  In  particular,  the 
composition  of  the  risk  set  differs  in  all  three  data 
layouts  and  the  ordered  survival  times  being  plotted 
are  different  as  well. 
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1 .  Cause-specific  no  interaction  model  for  local 
recurrence  of  bladder  cancer  (event  =1): 

h,(t,X)  =  h0i(t)exp[pntx  +  p21num  +  p3,size] 

2.  Censored  subjects  have  bladder  metastasis  (event  =  2) 
or  other  metastasis  (event  =  3). 

3.  Cause-specific  no-interaction  model  for  bladder 
metastasis  (event  =2): 

h2(t,X)  =  h02(t)exp[p12tx  +  p22  +  p32size] 

where  censored  subjects  have  local  recurrence  of 
bladder  cancer  (event  =  1)  or  other  metastasis 
(event  =  3). 

4.  A  sensitivity  analysis  would  consider  worst-case 
violations  of  the  independence  assumption.  For 
example,  subjects  censored  from  failing  from  events 
=  2  or  3  might  be  treated  in  the  analysis  as  either  all 
being  event-free  (i.e.,  change  event  status  to  0  and 
time  to  53)  or  all  experiencing  the  event  of  interest 
(i.e.,  change  event  status  to  1  and  leave  time  as  is). 

5 .  a .  V erify  the  CIC  i  calculation  provided  at  failure  time 

tf=8  for  persons  in  the  treatment  group  (tx  =1): 

11,(8)  =  1/23  =  0.0435 

S(4)  =  S(3)Pr(T  >  4|T  >  4)  =  0.9630(1  -  2/26) 
=  0.9630(0.9231)  =  0.8889 
i,  (8)  =  hi(8)S(4)  =  0.0435(.8889)  =  0.0387 
CIC,  (8)  =  CIC,  (4)  +  0.0387  =  0  +  0.0387  =  0.0387 

b.  Verify  the  CIC,  calculation  provided  at  failure  time 
tf=  25  for  persons  in  the  placebo  group  (tx  =  0): 

h,  (25)  =  1/6  =  0.1667 

S(23)  =  S(21)Pr(T  >  23|T>  23)  =  0.4150(1  -  1/8) 
=  0.4150(0.875)  =  0.3631 

1,(25)  =  h,(25)S(23)  =  0. 1667(3631)  =  0.0605 
CIC,  (25)  =  CIC,  (23)  +  0.0605  =  0.2949  +  0.0605 
=  0.3554 

c.  interpret  the  CIC,  values  obtained  for  both  the  treat 
ment  and  placebo  groups  at  tf  =  30. 

For  tx  =  1,  CIC,(tf  =  30)  =  0.3087  and  for  tx  =  0, 
CIC,(tf  =30)  =  0.3554. 
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Thus,  for  treated  subjects  (tx  =  1),  the  cumulative 
risk  (i.e.,  marginal  probability)  for  local  bladder 
cancer  recurrence  is  about  30.1  %  at  30  months 
when  allowing  for  the  presence  of  competing  risks 
for  bladder  metastasis  or  other  metastasis. 

For  placebo  subjects  (tx  =1),  the  cumulative  risk 
(i.e.,  marginal  probability)  for  local  bladder  cancer 
recurrence  is  about  35.5%  at  30  months  when 
allowing  for  the  presence  of  competing  risks  for 
bladder  metastasis  or  other  metastasis. 

The  placebo  group  therefore  has  a  5%  increased 
risk  of  failure  than  the  treatment  group  by  30 
months  of  follow-up. 

d.  Calculating  the  CPCi  values  for  both  treatment  and 
placebo  groups  at  tf  =  30: 

The  formula  relating  CPC  to  CIC  is  given  by 
CPCc  =  CICc/(l  -  CTCc0  where  CICc  =  CIC  for 
cause-specific  risk  event  =  1  and  CICc'  =  CIC  from 
risks  for  events  =  2  or  3  combined 
For  tx  =  1,  ClCjftf  =  30)  =  0.3087  and  for  tx  =  0, 
CICi(tf=  30)  =  0.3554. 

The  calculation  of  CICc'  involves  recoding  the  event 
variable  to  1  for  subjects  with  bladder  metastasis 
or  other  metastasis  and  0  otherwise  and  then 
computing  CICc'.  Calculation  of  CICc'  involves  the 
following  calculations. 


tx  =  1  (Treatment  A) 


tf 

nf 

dif 

hi(tf) 

S(tf_i) 

Ii(tf) 

CICr(tf) 

0 

27 

0 

0 

— 

— 

— 

2 

27 

1 

.0370 

1 

.0370 

.0370 

3 

26 

2 

.0769 

.9630 

.0741 

.1111 

4 

24 

0 

0 

.8889 

0 

.1111 

8 

23 

1 

.0435 

.8889 

.0387 

.1498 

9 

21 

1 

.0476 

.8116 

.0386 

.1884 

10 

20 

1 

.0500 

.7729 

.0386 

.2270 

15 

17 

1 

.0588 

.7343 

.0432 

.2702 

16 

15 

1 

.0667 

.6479 

.0432 

.3134 

18 

14 

0 

0 

.6047 

0 

.3134 

22 

12 

0 

0 

.6047 

0 

.3134 

23 

11 

0 

0 

.5543 

0 

.3134 

24 

8 

0 

0 

.5039 

0 

.3134 

26 

7 

0 

0 

.4409 

0 

.3134 

28 

4 

1 

.2500 

.3779 

.0945 

.4079 

29 

2 

0 

0 

.2835 

0 

.4079 

30 

1 

0 

0 

.2835 

0 

.4079 
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tx  =  0  (Placebo) 


tf 

nf 

dit 

hi  (tf) 

S(tf_f) 

Ii(tf) 

CICr(tf) 

0 

26 

0 

0 

— 

— 

— 

1 

26 

0 

0 

1 

0 

0 

2 

24 

0 

0 

.9615 

0 

0 

3 

23 

0 

0 

.9215 

0 

0 

5 

21 

1 

.0476 

.8413 

.0400 

.0400 

6 

20 

2 

.1000 

.8013 

.0801 

.1201 

7 

18 

1 

.0556 

.7212 

.0401 

.1602 

10 

16 

1 

.0625 

.6811 

.0426 

.2028 

12 

15 

1 

.0667 

.6385 

.0426 

.2454 

14 

13 

0 

0 

6835 

0 

.2454 

16 

12 

1 

.0833 

.5534 

.0461 

.2915 

17 

10 

0 

0 

.4612 

0 

.2915 

18 

9 

0 

0 

.4150 

0 

.2915 

21 

8 

1 

.1250 

.4150 

.0519 

.3434 

23 

7 

0 

0 

.3632 

0 

.3434 

25 

6 

1 

.1667 

.3632 

.0605 

.4039 

29 

4 

0 

0 

.2421 

0 

.4039 

30 

2 

0 

0 

.2421 

0 

.4039 

From  these  tables,  for  tx  =  1,  CICi'((tf)  =  30)  =  0.4079, 
and  for  tx  =  0,  CICr((tf)=  30)  =  0.4039. 

Thus,  for  tx  =1,  CPC1((tf)=30)=  0.3087/(1  -  0.4079) 
=  0.5213,  and  for  tx  =  0,  CPCi((tf)  =  30)  =  0.3554/ 
(1  -  0.4039)  =  0.5962. 

6.  a.  HR] (tx  =  1  vs.  tx  =  0)  =  0.535(=  1/1.87), 

p-value  =  0.250,  N.S. 

b.  HR2 (tx  =  1  vs.  tx  =  0)  =  0.987, 
p-value  =  .985,  N.S. 

c.  HR3(tx  =  1  vs.  tx  =  0)  =  0.684  (=  1/1.46), 
p-value  =  .575,  N.S. 

7.  a.  Hazard  model  formula  for  the  LM  model: 

h*(t,  X)  =  hog(t)  exp[j8,  tx  +  //,  mini  +  /?3  size 
g  =  1,2,3  +  (5i(txd2)+  <52(numd2) 

+  />3(sized2)  +  c>4(txd3) 

+  c>5(numd3)  +  <5g(sized3)] 

where 

d2  =  1  if  bladder  metastasis  and  0  otherwise, 
and 

d3  =  1  if  or  other  metastasis  and  0  otherwise 
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b.  Hazard  ratios  for  the  effect  of  each  of  the  3 
cause-specific  events: 

HRi(tx  =  1  vs.  tx  =  0)  =  exp(— 0.6258) 

=  0.535(=  1/1.87) 

HR2(tx  =  1  vs.  tx  =  0)  =  exp(— 0.6258  +  .6132) 

=  0.987(=  1/1.01) 

HR3(tx  =  1  vs.  tx  =  0)  =  exp(— 0.6258  +  .2463) 

=  0.684(=  1/1.46) 

c.  Corresponding  HRs  are  identical. 

8.  a.  Hazard  model  formula  for  the  LMait  model: 

hg(t,  X)  =  h'0g(t)  exp[  (5U  txdl -(-  <512  numdl  +  <513sizedl 
g  =  1,2,3  +  <521txd2  +  <522numd2 

+  <523sized2  +  <531txd3 
+  <532numd3  +  (533sized3] 

where 

dl  =  1  if  local  bladder  cancer  recurrence  and  0 
otherwise 

d2  =  1  if  bladder  metastasis  and  0  otherwise, 
and 

d3  =  1  if  or  other  metastasis  and  0  otherwise 

b.  Hazard  ratios  for  the  effect  of  each  of  the  three 
cause-specific  events: 

output. 

HRi(tx  =  1  vs.  tx  =  0)  =  exp(— 0.6258) 

=  0.535(=  1/1.87) 

HR2(tx  =  1  vs.  tx  =  0)  =  exp(— 0.0127) 

=  0.987(=  1/1.01) 

HR3(tx  =  1  vs.  tx  =  0)  =  exp(-0.3796) 

=  0.684(=  1/1.46) 

c.  Corresponding  hazard  ratios  arc  identical. 

9.  No  interaction  SC  LM  model: 

h*  (t,  X)  =  hog(t)  exp[  (1,  tx  +  P2  num  +  p8  size] 
g  =  1,2,3 

Assumes  HR,(X)  =  HR2(X)  =  HR3(X)  for  any  X 
variable  e.g.,  Rx  =  0  vs.  Rx  =  1: 

1  lR|(tx)  =  HR2(tx)  =  HR3(tx)  =  exp[p,] 

10.  Carry  out  the  following  likelihood  ratio  test: 

H0:  <5gj  =  0  g  =  2,3;  j  =  1,2,3 

where  8gj  is  coefficient  of  DgXj  in  the  interaction  SC 
LM  model 

LR  =  21og  Lr  -  (-2LogLF)  approx  xl  under  H0 
R  =  no-interaction  SC  (reduced)  model 
F  =  interaction  SC  (full)  model 
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1.  Example:  A=2,  F=2,  so  Mt=A/2  +  F  =3,  R=2 
cc=0.05,  (3=0.10 

;  0.10,  =  0.05,  A  =  Xq/  7.j=  2 

Nev  =  {(1.96  +  1.282) [2(2)  +  l]/[x/2(2  -  l)]}2 
=  131.382  «  132 


Using  Formula  1 : 
N  =  9 

'{  1  -e 


131.382 


-2(0.05)(3) 


2+1 
131.832 


}  + 


2+1 


{  1  —  e 


.(0.05)3! 


0.2192 


-=  601.4 


602 


Ni  =  [2/3J601.4  =  400.93  «  401  and 
N0  =  400.9/2  =  200.45  ss  200 

2.  Nev  =131.383  =  132  from  question  1. 
1 


Pf.vi  =  1  — 


(0.05)(2)  L 
=  1  —  0.861 1  =  0.1389 


e  (0.05)(2)  _  e-0.05)(2+2) 


3. 


4. 

5. 

6. 


Pevo 


(0.10)(2) 
=  1  -  0.7421 


g— (0.10)(2)  _  e-(0.10)(2+2) 

=  0.2579 


N  = 


Ni  = 


131.382 
2  (0.1389)  1 


2+1 
131.832 


2+1 
=  738.14  a 

0.1786 

[2/3J738.14  =  492.09 


(0.2579) 

;  739 

ss  492  and 


N0  =  492.09/2  =  246.04  «  246. 


The  results  using  Formulae  1  and  2  are  somewhat 
different  since  Formula  1  yields  N=602  whereas 
Formula  2  yields  N=739.  Formula  1  uses  the  median 
follow-up  time  MF  in  the  computation  of  pEVi  whereas 
Formula  2  computes  pEVi  by  assuming  that  the  time  X 
at  which  any  subject  enters  the  study  has  the  uniform 
distribution  over  the  accrual  period. 


NLOFadj  =  739/(1  -  0.25)  =  985.33  w  986 

Ni  =  [2/31985.33=  656.89  w  657  and  N0  =  656.89/2  = 
328.44  «  328 


NiTTadj  =  986/(1  -  0  05  -0.10)2  =  1364.71  w  1365 
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7.  Ni  =  [2/(2+1)11364.71  =  909.81  «910  and 
N0  =  909.81/2  =  454.91  sa  455 

8.  From  question  6,  the  required  accrual  rate  is  r  =  N/A  = 
1365/2  =  682.5  «  683  subjects  per  year.  If  this  accrual 
rate  is  not  feasible,  but  r*  was  considered  feasible,  then 
you  can  adjust  your  sample  size  by  reducing  the  accrual 
period  to  A*  =  N/r*.  For  example,  if  the  maximum  for 
r  is  rmax  =  600/yr,  then  the  required  accrual  period  is 
modified  from  A=2  to  A*  =  2.275  years. 

Now  suppose,  we  keep  NEV  (=131.382),  F=2,  R=2, 
a=0.05,  (3=0.10,  L0  =  0.10,  Lj  =  0.05,  and  A  =  Lq/ai  =  2 
all  constant,  but  increase  the  accrual  time  to  A*=2.275 
years.  Then  we  would  need  to  re-compute  pEVi,  Pevo 
and  N  to  obtain  pEVi=  0.2677,  pEVo=  0.1447,  and  N  = 
579.541  (prior  to  adjusting  for  LOF  and  Crossovers), 
which  is  modified  to  N*  =  1069.51  after  adjusting  for 
25%  LOF  rate,  5%  dc  rate  and  10%  dt,  rate.  For  this 
modified  sample  size,  the  modified  required  accrual 
rate  is  r*  =  N*/A*  =  1069.71/2.275  =  470.11,  which  is 
less  than  rmax  =  600,  so  that  the  study  is  feasible. 

Note,  however,  it  is  also  possible  to  obtain  a  feasible 
study  if  the  accrual  period  remains  at  A=2,  but  the 
follow-up  period  increases  to,  say  F=4,  again  keeping 
Nev(=131.382),  R  =  2,  a  =  0.05,  (3  =  0.10,  Lo0.10,  a,  = 
0.05,  and  A  =  Aq/aj  =  2  all  constant.  This  will  require 
re-computing  pF\/i ,  Pevo  and  N  again,  followed  by 
adjustments  for  LOF  and  Crossovers.  In  particular,  if 
F  is  increased  (to  say  F=4),  then  pEVi  and  pEVo  should 
correspondingly  increase  from  previously  calculated 
values  because  the  probability  for  an  event  occurring 
should  increase  if  follow-up  time  is  increased. 
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Gamma  distribution,  328 
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Goodness-of-fit  (GOF) 
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Hazard  function,  9,  10 
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Kaplan-Meier  (KM)  curves,  56 
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log-log  survival  curves,  171 
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Left  truncation,  132-133,  136-140,  144, 
147 


698  Index 


Leukemia  remission-time  data, 

18-20,  30 

Cox  proportional  hazards  (PH)  model 
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Kaplan-Meier  survival  curves  for, 
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log-log  KM  curves  for,  171-175 
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Likelihood  function 
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LM  approach.  See  Lunn-McNeil 
approach 

Logit  link  function,  324 
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Log-log 
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Lognormal  survival  models,  14,  316 
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Lunn-McNeil  (LM)  approach,  433, 
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marginal  probability,  446 
results  for,  393 

Maximum  likelihood  (ML)  estimation 
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519 
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Non-informative  censoring,  37,  41-42, 
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Observed  vs.  expected  plots,  175-180 
Open  cohort,  135,  140,  144 
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Parametric  approach  using  shared 
frailty,  389-391 
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defined,  292 
examples 

exponential  model,  295-297,  300-304 
log-logistic  model,  309-314 
Weibull  model,  304-309 
likelihood  function,  318-321 
other  models,  316-318 
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Pepe-Mori  test,  471 
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goodness  of  fit  test  with  Schonfield 
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Kaplan-Meier  log-log  survival 
curves,  611-612 

observed  vs.  expected  plots,  175-180 
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SPSS, 615-617 
Stata,  535-538,  545-547 
time-dependent  covariates,  183-187, 
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evaluating,  161-200 
meaning  of,  123-127 
PH  model,  Cox.  See  Cox  proportional 
hazards  (PH)  model 
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Probability,  12 

density  function,  294-295 
PROC  LIFEREG  (SAS),  598-603 
PROC  LIFETEST  (SAS),  572-576 
PROC  PHREG,  576-580 
Product  limit  formula,  56 
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R  software  620-663 
assessing  PH  assumption 

using  graphical  approaches,  631-633 
using  statistical  tests,  640-641 
estimating  survival  functions,  626-631 
modeling  recurrent  events,  660-663 
obtaining  Cox  adjusted  survival  curves 
with,  641-645 

running  Cox  proportional  hazards  (PH) 
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running  extended  Cox  model,  646-650 
running  frailty  models,  657-659 
running  parametric  models,  651-656 
running  stratified  Cox  (SC)  model, 
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Random  censoring,  37-41,  49 
Recurrent  event  survival  analysis, 
363-423 

counting  process  approach,  368-376 
definition  of  recurrent  events,  4,  364 
examples  of  recurrent  event  data, 
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other  approaches  for  analysis,  379-385 
parametric  approach  using  shared 
frailty,  389-391 
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Stata  modeling,  564-570 
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Right-censored  data,  8,  321 
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Robust  estimation,  376-378 
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Sample  size  inflation  factor,  513,  517, 
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SAS,  570-607 

assessing  PH  assumption,  with 
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572-576 

modeling  recurrent  events,  603-607 
obtaining  Cox  adjusted  survival 
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running  Cox  proportional  hazards 
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running  parametric  models,  with  PROC 
L1FEREG,  598-603 
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recurrent  events  analysis  using, 
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Shared  frailty  model,  338 
SPSS,  607-620 

assessing  PH  assumption 
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estimating  survival  functions,  626-628 
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Stanford  Heart  Transplant  Study 
extended  Cox  model  application  to, 
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using  graphical  approaches, 

535-538 
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estimating  survival  functions,  531-535 
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obtaining  Cox  adjusted  survival  curves 
with,  547-550 
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Survival  curves 
adjusted,  120 
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R,  626-631 
SAS,  572-576 
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Tarone-Ware  test  statistic,  74 
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